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TTAATCGGCG TACCATTTGC AGCAGGTGTC GTTAACTTTG TCGTGCTAAC TGCCGCGGCC 12 00 

TCTGCTACAA ATAGTGGTAT CTATTCGAAT AGTCGTATCT TATTCGGACT GTCACAACAA 1260 

GGGTTAGGTC CTAAAGTTTT AAATAAAACG AATAGTCATG GCGTGCCTTA TTTATCAATG 1320 

TTAGTTTCAT CAATTGCATT ACTTATAGCA GCCTTGTTAA ACTACATTTT CCCTAATGCA 13 80 

ATTCAACTAT TCATATACGT TACAACGTTA tCAACTGTGT TGTTTTTAGT TGTtTGG^CA 144 0 

ATGATnATTG TCGCTTATCn AATGTATTTG GAAAAAGCAT CCTGAGGCA 14 8 9 

(2) INFORMATION FOR SEQ ID NO: 241: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 5000 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: 

TTTTCCATCA TrATcACCAT TTAAGACAAT AGCTGTATGA CCATATCCTC CACCAAATCT 6 0 

TCCACTAAAA ACCACTAAGT CCCCAGGTTC AGCTTTGAAA GTTGGTGTGT TGTGATAAAT 12 0 

TTTAGCTTCA CTATTAAAAT TATTTGCATA TGGTATATCT TTAGCTCCAT ATCCTTTtAA 180 

TCCAkGACCA TAAAGAkGAT TCCAATATAC ATTAACTAAA TCGAAACATT GCCATCCATA 240 

30 ACTACCGTCG AAATCCCATC CTCTATTCTC TAATGTTTTC AAATAATTTA ATGTACTGCT 3 00 

ATTACTATTA CTTTTATTAT TTGAAGACAC TGTTTTTGGT TTTGGTTCTA CTAATGC-TGT 3 60 

CATTGGCACT TTTAATTTTT GACCAATAAA TATTAAATTA GGATTTGCTA TATTATTTGT 4 20 

ATTTTGAATA TTTGAAACTG TAGTTTTGTA TTTTAATGCT ATAGCACTAA GTGTGTCTCC 4 80 

TTTTlS'TACA GTATAGATTT GTGTTTTTGG AGCTTCTCTA AAACTGTAGT AACCAAAGTA 540 

ATTATTAGTA ACTTTATTCG TTTTCTTCTG ATTAGAATTT TGAGCTTCCA AGTTTGCAAT 6 00 

TTTAATTTCT TTAGTAAGTT CATTGTTATT AATAACTAGA TTGTTACCTT GGCTTGAGTT 660 

TTTCGGAGTA TTTGAAATCT TTATATCTTG ATTAATTTCA TTTCCGTTTG AAATTGCTGA 72 0 

TTTGTTGTCT AACTTTAAAC TTGTGTCCGA TGTTTTAACA GCACCTTCAT TTTTTATTTT 78 0 

GTCTTTTGTC GTATTTTTAT TAGCATrTAA CTCTGATTTC GCGAATACAT TTTGCTCATA 64 0 

CCCTCTTGTA AAATCTTTAG ATTTATCAAT TTCATCTGCA TATGCTTTGT TCGACATACC 900 

CAATGCCAAA AACATACCTA TTGAAATTGA CAAAATTCCA ATACTAACTT TTCTAATTGA 9 60 

ATAGCGTACT TTACTTTGTT GTTTATTCAT GATGAAACAC TCCTTACAAT AATATACAAA 1020 
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GTAGGAGGTA AAATAATTAA CTTGTCTTTC CAAAATATGA AAAGTGTACT AAAaTTCATC 114 0 

GCACGACAAA TAGCCCATTT CCGATACTTT TATAAAGTAT GGAaTGGGCT ATAGCCATTT 1200 

ATATCATCTT TTAACTTTAT TTATTAACAG TTAATAATGA TTCATAAATA CCTGCTTCTT 126 0 

TAGCAGCTTC AATTAATGTT GAACCAATTT CTGAAGGTGT TGCCGCTGTT TTCACACCAC 132 0 

AACTATTTAA TGTTTTAATT TTCTCTTCAG CAGTACCTTT ACCACCTGAA ATGATTGCAC 13 8 0 

CAGCATGTCC CATACGTTTT CCAGGAGGTG CTGTTTGTCC ACCGATAAAG CCTACAACTG 144 0 

GTTTTGTCAT ATTCGCTTTA ATCCATTCAG CTGCTTCTTC TTCAGCCGTA CCACCGATTT 1500 

CACCAATCAT AACAACTGCT TTCGTTTCGT CATCTTCATT GAATGCTTTT AAAACATCAA 156 0 

TAAAGTTTGT TCCGTTGACT GGGTCTCCAC CAATACCAAC AGCTGTAGTT TGACCAATAC 162 0 

CTTCTTCAGT CAATTGGTGC ACTGCTTCAT ATGTTAATGT ACCTGAACGA GATACTACAC 16 80 

CAACATGACC XnTTTGTGA ATATAGCCAG GCATAATACC AATTTTACAT TCATCTGCTG 1740 

TAATCACACC TGGACAGTTC GGACCAACTA AACGTGTTTT TCTACCTTGT AAGTAGCGTT 1800 

TAACTTTAAC CATGTCTAAT ACAGGAATAT GTTCAGTGAT ACAAATAACC ATATCTAAGT 1B60 

25 CTGCATCAGC TGCTTCTAAA ATTGAGTCTG CAGCAAATGG TGCTGGAACG TAAATGACTG 192 0 

AAACCGTTGC CCCAGTTTCA TTTTTAGCTT CTTCAACAGT GTTGAAAACA GGAACGCCTT 198 0 

CAACAACTTG ACCACCTTTA CCAGGCGTCA CACCTGCTAC TATTTTCGTA CCATAATCAA 2 04 0 

30 GCATTTGTTT TGTATGGAAA AGGGCAGTAG ACCCTGTAAT ACCTTGTACC ATTACTTTAG 2100 

TATTCTTATC TATAAATACA CTCATCTTAG TGCTCCCATC CTTTCCTTAT GCTTCTTTGA 2160 

CTAGTTTAAC AATTTTTTGT GCACCTTCAG CCATTGTTGC TGCTGGTTCA ATTGCTAATC 2220 

CTGAGTCTTT TAAGATTTTT TTACCTAACT CAACATTTGT ACCTTCTAGG CGTACAACTA 22 80 

GTGGTAAAGT TAAATCTACT TCTTTTACAG CTTCAACGAT ACCTTCTGCG ATAACATCAC 234 0 

ATTTCATAAT GCCACCGAAA ATGTTTACAA AAATACCTTT AACATTTTCA TCACCTAAAA 24 00 

TGATTTTAAA TGCTTCAGTT ACTTTTTCTC TAGTAGCGCT TCCGCCTGCA TCTAAGAAAT 24 60 

TGGCTGGGTT TCCACCGAAA TGATTAATCG TATCCATTGT TGCCATGGCT AAACCTGCAC 2520 

CATTAACCAT ACATCCGATG TCACCATCTA ATGCAATGTA TGATAAATCA TGTTTAGACG 2580 

CTTCAATCTC TTTCGGATCT TCTTCTTCTA AATCACGTAA TTCTACAACA TCTTTATGTC 2 64 0 

TGAATAATGC ATTATCATCA AAATTAATTT TAGCATCTAA TGCCAATACA TCACCATCAG 2700 

CTGTTGTAAC TAATGGGTTG ATTTCTACGA TTGAACAATC TTTTTCAATG AATACATTAT 2760 

AAAGTGCTAA TAAGAATTTA GCAGCTTTGT TAACAGATTC TTTAGGAATA TTAATATTAA 2820 
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AGATCTTTTC 


AGGAGTCTTC 


GCAGCAACTT 


CTTCAATCTC 


AGTGCCCCCT 


TCTTCAGACG 


2940 


CCATCAATGT 


TACTTGGTCA 


GTCGCACGAT 


CAATAACGAA 


TCCAACGTAA 


TATTCTTTTT 


3000 


GAATAGCACA 


ACCTTCTTCG 


ATATATAAAC 


GCTTAATTTC 


TTTACCTTCT 


GGACCAGTTT 


3060 


GATGTGTCAC 


CAAAGTTTTC 


CCTAATAATT 


CTTTTGCATA 


TGTTTCTACC 


TCAGATAAAG 


3120 


ATTTAGCAAT 


TTTTACTCCG 


CCTGCTTTAC 


CTCTACCTCC 


AGCATGAATT 


TCTCCT-TTTA 


3160 


CAACATAAAC 


ATCAGAATTT 


AATTCTTTTG 


CTTTCTCCAC 


CGCTTCTTCA 


GCAGTAAATG 


3240 


CTACTCGTCC 


TTCTGGAACT 


GCAACGCCCA 


TTGAACGAAA 


TATTTCTTTA 


CCTTGATACT 


3300 


CGTGGATATT 


CATCTTCCAT 


CCTCCTGTTA 


CTTAGGTTAA 


GTTCCCTTAC 


AATTATAAAA 


3360 


AATGTAAGCG 


CTATTGTAAA 


CTTAAATGCT 


ACTTTTTTAT 


CATTTAATTG 


AATTTTACGA 


3420 


TTTACAGTAA 


CGATTTTATA 


GGTTCAAAGC 


TTTTTCTATG 


CTCTTTCATA 


ATGCCAATAT 


3430 


CATCGATTGC 


TAGTAAATGT 


TGTTTGGTAC 


CGTAACCCGC 


GTTTTTTTCA 


AAACCATATT 


3540 


CAGGATAATC 


TTTAGATAAC 


TGTGTCATAT 


AATCATCACG 


AAAAACCTTT 


GCCATGATAC 


J600 


TTGCAGCTGC 


AATGGACACA 


CTTCTTGCAT 


CACCCTTGAT 


TAAAGATACT 


TGAGGCAGTG 


3660 


CATTATCAAG 


CGTCATCGCG 


TCTATCAATA 


AATGCGTTGG 


TTGTACTGAT 


AATCCATCAA 


3720 


TAGCTCGCTG 


CATGGCGATT 


TGAGTAGCTT 


TATAAATATT 


AAATTCATCT 


ATTTCTTnCA 


3780 


GtGTCGCGAT 


CCCATATGCA 


AAAGCAGTAA 


CTTCAi-rn-i' 


TAGTGCTTCA 


TTTAATTCTA 


3840 


GACGTTTCGT 


AACAGGTACT 


TTTTTCGAGT 


CATCAAGGCC 


CAAATAATTG 


TGATTTGAAT 


3900 


TTAAAATTGT 


TGCGCATGCA 


ACGACTGGAC 


CTGCTAAAGG 


TCCTCTTCCA 


ACTTCATCAA 


3960 


TCCCACAAAT 


AATAGCATTA 


GGATGCTCTT 


TTAATATTTC 


ATTTTCAAAG 


TAAGTCATTT 


4020 


CAACATACTT 


TTCTTTTAAA 


GCTTGTTCTT 


TTTCTAACGC 


TTTTCTGCGC 


CTAGCTATGG 


4080 


CATTlTGAAC 


ACCTTTTCGC 


TCATCTAAAA 


AGCATTCATG 


ATTTTCTAAT 


TCTTCTATTG 


4140 


TATTAACCGC 


ATTAATCAAC 


TGCGTAACTT 


CTTTAATTGT 


TAGCGTCATT 


TGCTAATTCC 


4200 


TCAGTCATAT 


CTTTAAAAAT 


ATCAAAACAA 


TAATTTCCTA 


TTTTAGCATT 


TCGAATATCA 


4260 


TAAATAATCA 


GTTCAATGAC 


TGCTTCGTAA 


TCAATTTCAT 


TACCACGTCG 


AATTAAGCCC 


4320 


ACGTTTTTTC 


CCTATCGCAT 


CAAACCACGC 


TATGATTTCT 


GCAtCTTCAG 


GAACTTCAAT 


4380 


ATTATAATGT 


GACTTTAATC 


GCGCTAAATC 


Al'TiTGAATT 


AAAAAGTTTA 


ATCCATAGAT 


4440 


GGCAACTTCA 


TCTAAGTGCA 


CAATACTATC 


TTTTATCGCA 


CCAGTTAAAC 


TCAACTTCTT 


4500 


ACCGACTTCT 


TCATCTTCAA 


ATTTAGGCCA 


AAGTATCCCT 


GGTGTGTCTA 


ATAGTTGTAA 


4560 


TGCATTACCA 


ACTTTAATCC 


ATTGTTGTTG 


TTTGGTCACA 


CCTGGTTTAT 


TACCAGTCTG 


4620 
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AACGATCATT GCTCTTATCG CTCTAGGTTT AAGTCCTTTC GCTTTTTCGC GTTCAAATTT 4 74 0 

TTCAGCAGTC GCCTTAATTG CTGCAGCTTC CACTTTCTTT AAATTTTTAC CGTGCTTAGC 4 8 00 

^ ATCCACTGAT ACAGGATAgT AtCCTTTATC AATAAAAAAT tGTTCCCATT TTGACATCTC 4 860 

ATTTAAATTA GACATATCTT TTTTATTTAA TATAACAACA CGTGGTTTTT GGTTAATAAC 4 920 

TTCATCTATC ATAGGGTTTC TTGAACTATA TGGAATTCTT GCATCTACTA GTTCAAACAC 4 980 

10 

TACATCTACT TTTTTTAATT 5000 
(2) INFORMATION FOR SEQ ID NO: 242: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1700 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 





AACCCGnAGA 


CGAAACTnCA 


TAGTTGCCTG 


TTATGTTGAT 


TAGTGCGTTT 


GTAGCTATTT 


60 


25 


nAAATCAAAC 


ATTATTAAAT 


ACAGCGTTaC 


CTAGTATAAT 


GAGAGAATTA 


AATATCAATG 


120 




AAAGTACATC 


GCAATGGCTA 


GTTACTGGGT 


TTATGCTTGT 


TAATGGCGTC 


ATGATACCTC 


180 




TGACGGCATA 


TCTAATGGAT 


AGAATTAAAA 


CTAGACCTTT 


ATACTTAGCG 


GCGATGGGGA 


240 


30 


CAT'rrrrATT 


AGGTTCTATT 


GTTGCAGCCT 


TAGCTCCGAA 


TTTTGGAGTT 


TTAATGTTAG 


300 




CTCGTGTAAT 


TCAAGCGATG 


GGTGCAGGCG 


TACTTATGCC 


CTTAATGCAA 


TTTACGTTAT 


360 




TTACATTGTT 


CAGTAAAGAA 


CATCGAGGTy 


TtGCAATGGG 


ACTAGCAGGT 


TTAGTAATTC 


420 


35 


AATTTGCACC 


AGCAATAGGA 


CCTACAGTTA 


CAGGATTAAT 


TATTGATCAA 


GCGAGTTGGC 


480 




GAGTTCCATT 


TATTATAATT 


GTAGGAATTG 


CTATACTTGC 


CTTTGTTTTC 


GGTTTGGTTT 


540 


40 


CAATCTCGAG 


TTACAATGAA 


GTGAAATATA 


CGAAATTAGA 


TAAGCGTTCA 


GTAATGTATT 


600 


CAACTATTGG 


GTTCGGGTTA 


ATGCTATACG 


CATTTAGTAG 


TGCAGGAGAT 


TTAGGATTTA 


660 




CAAGTCCAAT 


AGTAATAGGT 


GCGTTGATAT 


TAAGTATGGT 


TATTATCTAT 


TTATTTATAC 


720 


45 


GTAGACAATT 


TAATATTACT 


AATGCACTTT 


TAAATTTAAG 


GGTTTTTAAA 


AATAGAACAT 


780 




TTGCATTATG 


TACGATTAGT 


TCAATGATTA 


TAATGATGTC 


AATGGTTGGA 


CCTGCGCTGC 


840 




TTATACCGCT 


ATATGTTCAA 


AACAGTTTAT 


CTTTATCTGC 


CTTGTTATCA 


GGACTTGTTA 


900 


50 


TCATGCCTGG 


TGCAATAATA 


AATGGTATTA 


TGTCAGTTTT 


TACAGGTAAA 


TTTTATGATA 


960 




AGTATGGTCC 


TAGACCATTG 


ATTTATACTG 


GTTTTACAAT 


TTTAACAATT 


ACTACAATTA 


1020 
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GAATGTTTTC AGTTTCTTTA CTCATGATGC CGATAAATAC TACAGGAATT AATTCTTTGA 114 0 

GAAATGAAGA AATCTCACAT GGCACGGCTA TTATGAACTT TGGTCGTGTA ATGGCTGGTT 1200 

5 

CACTAGGCAC AGCTTTAATG GTTACATTAA TGAGTTTTGG TGCAAAAATA TTTTTATCTA 1260 

CATCGCCATC GCATTTAACT GCAACTGAAA TTAAACAGCA ATCCATTGCT ATAGGGGTGG 132 0 

ATATCTCATT TGCTTTTGTA GCTGTGCTTG TTATGGCAGC TTATGTGATA GCACTTTTTA 13 80 

10 

TAAGAGAACC TAAAGAAATA GAAAGTAATA GAAGGAAATT TTAAAATAAT TATAGTAGTT 144 0 

GGTCTATTTA AAATAATAGG CTAACTGCTT TTTTTATTTA ATAAAAAGTT TTATACTTTT 1500 

AGTGATAGAC TAAGCAAAAA TTGTTATTTG CTATGATGTA GATGTCTTAA AATGATTAAG 156 0 

15 

GGGGATTTGC TTTGTTAACG GTAGATCAAG TGAAAGAATT GGTAGGAGAA ATTAAAGATC 162 0 

CTATTATAGA TGTGCCTTTA AAAGAAACAG AAGGTATTGT TGAnGTTTCT ATTAAGGGAG 168 0 

20 AAnAAGAACA TGTGAGTGTT 1700 

(2) INFORMATION FOR SEQ ID NO: 243: 

(i) SEQUENCE CHARACTERISTICS: 

[A) LENGTH: 1014 6 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 3: 

TGCATCAACT TTCAAACAAT AAATCATCAC AATAACCACA CCTAATTCAA CACTTTTCAA 6 0 

ACATAAGTAT TGACACATTG AGCAAAATGA TTTTTAATTG TAACTAATAC AGTTACAATT 120 

ATGAGGTGAG AAACATTGAA TTTAGAATTT AACATTGCCG TGCATGTATT AGCTTTTTTA 160 

ACTAAGCATC aTTCAGAAAA ATTCAATAGT AGTTCATTAG CAGAaTTAaC TTGTTTAAAT 24 0 

CCTGTTCAAT TACGACGCGT GACGACTCAA CTTGTCGATT TAAAAATGAT TGACACAATA 300 

CGAGGTAAAG ATGGCGGTTA TTTAGCAAAT GATCAAAGTG CTGATGTCTC TCTAGCAACA 360 

TTATaTAAAC ATTTTGTCTT AGAGAAAGAA CACCACACAC GTCTATTTAC TGGCGACGAA 42 0 

GGCAGTCACT GTCAAATTGC TCGTAATATT GCAACTACCA TGTCACATTA TCAGCAAGAC 4 80 

GAACAGAATA TCATTATTAA TTTTTATAAT GGAAAAACAA TCAAAGATGT CATTGAAGAC 54 0 

ATTCAAAAGG AGGATTTATG TCATGAAAAC ATATGATTTA ATTGTAATAG GATTTGGGAA 60 0 

AGCTGGTAAA ACTTTAGCGA AATACGCTGC ATCAACAGGT CAACACGTCG CAGTTATCGA 66 0 

ArAATCrCCG AAAATGTATG GAGGCACTTG TATAAACATA GGATGTATAC CTTCGAAGAC 72 0 
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TGTTGTCAAT GCGCTAAACA ATAAAAATTA CCACTTATTA GCAGACGATA ACAACATTGA 84 0 

TGTACTGGAT TTTAAAGCGC AGTTTAAATC TAATACTGAA GTGAATTTAT TAGATCAACA 900 

TGACGATATC GTCGATAGTA TTACTGCACC TCATATCATT ATTAATACCG GTGCTACCTC 960 

TGTCATTCCT AACATTAAAG GCCTTGATCA AGCTAAACAC GTCTTCGATT CGACAGGTTT 1020 

ATTAAACATT AGCTATCAAC CTAAGCACCT CGTTATTGTA GGTGGCGGTT ATATCGCCTT 10 80 

AGAATTTGCT TCAATGTTTG CGAATTTAGG TAGTAAGGTA ACAGTATTAG AACGTGGCGA 114 0 

ATCATTTATG CCACGCGAAG ATCAAGATGT CGTTGCATAT GGTATTACTG ACTTAGAAAA 1200 

TAAAGGCATT GCATTGCATA CAAATGTTGA AACGACTGAA TTGTCATCTG ACAATCATCA 126 0 

TACAACAGTC CATACCAACG TTGGTAACTT TGAGGCTGAT GCAGTACTTT TGGCTATCGG 132 0 

GCGCAAACCG AATACGGATT TAGCTTTAGA AAATACTGAT ATCGAATTAG GCGACAGAGG 13 80 

CGAAATTAAA GTCAATGCTC ATCTTCAAAC AACTGTGCCG CATATTTATG CTGCAGGTGA 144 0 

TGTTAAAGGC GGACTTCAAT TTACGTATAT ATCTTTAGAT GATTATCGAA TTATCAAATC 1500 

AGCGTTATAT GGTAATCAGT CACGTACGAC TGACAATAGA GGCAGCGTGC CTTATACAGT 1560 

25 ATTTATAGAT CCACCATTAT CACGTGTTGG ATTAACTAGT AAAGAAGCTG CCGCTCAACA 162 0 

TTATGATTAC ACTGAACATC AACTTTTAGT AAGTGCTATA CCTCGTCATA AAATTAACAA 16 BO 

TGATCCAAGA GGTTTATTTA AAGTAGTCAT TAATAATGAA AATAATATGA TTTTAGGTGC 174 0 

TACATTATAT GGTAAGCAAT CTGAAGAATT AATTAATATA ATTAAACTTG CGATTGATCA 1800 

AAACATTCCA TATACCGTAT TACGAGATAA TATTTATACG CATCCTACGA TGGCCGAATC 18 6 0 

ATTTAATGAT TTATTTAATT TCTAGACAAA ACATAAAAAC CTGGTGGCAC GCATTGAATG 192 0 

ATGCTGCCAT CAGGCTTTAT TGTTGTGCTT TTCGCTTTTC TAATTTTTCT TTAAGCTTTC 198 0 

TATCTTGTTC TTCTTTACGA CGTTTACGTT CTTCATGTCG TTTTCTTAAA CGCTCTTCTT 204 0 

CTTCAGGATC ACGTGGTTTC TTTAATTGTT GAGAAACTTT TTCGATTAAT TCTTCTTCAG 210 0 

TAAGCGCagc CAGTGGGCGG TTATTAACAA AAGTGAATGT TTTTCGGCGT CCAGGTCCAC 216 0 

AATAAGATTG ACAACCTATC ACGATTTCAG CATCGGGATC TAATTTTTCC AACTTCTTTT 222 0 

GTAACGTTCT TATATTGACT GCCTGACATT CATCACAAAT AAGGAATGTA TTTTTCATAT 229 0 

TGCTACCCAC CTTTCTTTAT CATATCTATA TCGTCGATTT CATTAATTTT TTCGTTAACT 234 0 

CTATCTATTT TACTCTTTTT AATATTTTTT TCAAGATACG TAACACGGCT GaCAATAAAA 24 0 0 

AATGGAGCAT TTATCTTCTA ATTAAATTAG ATGaTTGCTC CCCTATCAAA TCATTTATTG 24 60 

CCCATGATAA ATATTAAATT TTAATGGTTT AATACCATGT TTTGTCCATT CATCATAAAT 2 52 0 
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TGATGTTTTA GATGCGCCGT GATACTTTTC AGCAATATCA CACAAATATT TTAGCTTTTC 264 0 

AGTTTCTATA TCAACTGTAG CTTCTTTATC CATACGTTGA ATAATTGTAC GATTCTGACG 270 0 

CACCATCTTT TGCACACCTT TAATGTTATT TGTTTTAAAA GCATGAATAA GTTTTTCAAC 27 6 0 

ACAACGATGT GAATCTTCTA AGAAGTCACC GTAAAATGAA GGATCTGATT TCAAACGTTT 282 0 

CACTTCGCTA ACAAAGTGTG GTGATQACGC CGGTGAGCCA GTCCAACCGA TAAGTACTTC 28 8 0 

CATATTTTCA GGTGCTTGTA ATGGTTCGAT GTGCAATCCA GGCCAGTnT TGATTAAAAC 294 0 

TTCTTCAACC GTAGTATCTT CAATTTGATG CTTAACCCAT TCATGATCAA AAGTACTATA 3 000 

CGCTAGCCAT CCACTATATA CACTCACAGC AATATCTCCG CATGAACTTA AACTTTGTAA 3060 

CTTCATATTT GCAATCACTG CTAGTTTATA AATGTATAAA TTAGATAACT TCATATCATA 3120 

AAATTCATTT AATACTTTTA TAACTGACAC AAGTACTGCT GCACTTGAAC CTAATCCATA 3180 

TTTATGACCA TTTGAATCAT CTAAATTACT ATCAATAGTC AGATGAAAAT GCTTCATCGC 324 0 

TATATCGCAA CTTTTCGCGT ATTGTTCAAA TATTTCAATA GCTGTGaCCA CATAATTTAA 3 3 00 

TTGTTTTGCT GcATGTGGAT CTGAAATGAC AATACTATCT TCATCTCTAC TAAATGTAAC 3360 

25 TGGGTTATGA TGTAATGCTT TTGAATGAAT GGTACCTTTA TATTGGTCTG CTTCTTCAAT 3420 

AGTAGCAGTT ACAAAACGAT CTAACGCAAT AAGTACAGAT TTATATCCTG GTTCTGTTAC 34 80 

AGCATATTCT CCAGCAATAT AAAGTTTTCC GGGTGCTTTG ACCTGAATCA TTTTATCTCT 3 54 0 

TCCTTACTCA ATTATTTCAA TTCCTGTGGC AATAATGTCA CTATCAATAA TTTGGTTATT 3 600 

ATCAAACTGT GTTAATAATT TATCTATAAT CTGTTGCTTG TTTTTCTTTT CTACAAGTAT 3 5 60 

TTTCACATTA GGTCCCGCAT CCATTGTAAA ATAACACGGA TACCCCGCTT CTCGGCATTC 3720 

GTGAACAAGC GCCATGACAT CATAACTTTC TTGCACAAGA TATGTGAACG GCGGTGTTGA 3 7 80 

TCCTAGATTC GTGGCATGCA TACGCAAACC ATTTTCTTCA ATTACTTCAC CAAGGCGTTT 3 84 0 

AAAATCTTTG TCTTGAATCG CTGCTTTTGC TTCAGCTAAA TCTTCATCAA TATGATCTAA 3 90 0 

CCAATATTGA TAAAACCTTG ATGTGTTTCG TGTCAATGAC ATACCATATC GACTAGGTAC 3 960 

CTTTTTAGAA TGTTGATTAA TCACAACAAA TATCATGGCA AGGTCATCTT CAAAATGATT 4 02 0 

CGATTCAAGT GGAACGGCAT ATGACGTCTC ATCACTATAC CCTTTTTCCC ATTCTGCAAA 4 080 

TCCACCATAA ATACTACGCG ACGCAGAACC CGAACCAATT CGCGCCAATC TCGATAAATC 414 0 

CTTATCTGAC AGCTGCATGT CTAGCGCTTG ATTACAAGCT GCTGCTAAAG CTGCATATGC 4 200 

GCTTGCCGAT GAAGCCAACC CTGCTGCTGT TGGTACAAAA TTGTCGCTTT CAATTTCTGC 4 26 0 

ATACCAATCG ATGCCAGCTC TATTTCTGAC AATATCCATA TATTTTGAAA TTTTCTCTAA 4 3 20 
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AAAAGTGACT TTCGTTTCAG TGTAAAATTT TTCTAATGTA ACAGATATGC TATTATTCAT 4440 

TGGAATGATT AGTGCTTCAT CTTTTTTACC CCAATATTTT ATAAGTGCAA TATTCGTATG 4 500 

TGCACGTGCT TTGCCACTTT TAATCAACGC ATTAACCTCC TAAATTCTCA ATCCAAGTAT 4 560 

GTGCTGCACC AGCTTTTTCT ACAGCTTTTA CAATATTTTT CGCTGTTGGT AAATCnTGG 4 620 

CAAGCAATAA CATACTTCCA CCACGACCAG CGCCAGTAAG TTTTCCAGCA ATCGCACCAT 4 6 80 

TTTCTTTACC AATTTTCATT AATTGTTCTA TTTTATCATG ACTAACTGTC AACGCCTTTA 474 0 

AATCCGCATG ACATTCATTA AAAATATCCG CTAAGGCTTC AAAGTTATGA TGTTCAATCA 48 00 

CATCACTCGC ACGTAAAACT AACTTACCGA TATGTTTTAC ATGTGACATG TACTGAGGGT 4 860 

CCTCACAAAG TTTATGAACA TCTTCTACTG CTTGTCTTGT TGAACCTTTC ACACCAGTAT 4 92 0 

CTATAACAAC CATATAGCCG TCTAAACTTA ACGTTTTCAA CGTTTCAGCA TGACCTTTTT 4980 

20 GGAACCAAAC TGGTTTGCCT GATACAATCG TTTGCGTATC AATACCACTT GGTTTACCAT 504 0 

GTGCAATTTG CTCTGCCCAA TTAGCCTTTT CAATGAGTTC TTCTTTCGTT AATGATTTCC 5100 

CTAAAAAATC ATAACTTGCA CGAACAAAAG CAACCGCGAC AGCTGCACTC GATCCTAATC 516 0 

^5 CACGTGATGG TGGTAAATTC GTTTGGATCG TTACTGCTAG CGGCTCTGTA ATATTATTTA 522 0 

ATTCTACAAA ACGGTTCACC AAAGACTTAA GATGGTCAGG CGCATCATAT AACATACCAT 5280 

CGTAAACATC GCTTTTAATA GACGAATAGT TCCCGCTCTC TAAGGCTTCT ATTAAAACTT 534 0 

TGATTTTACC TGCGTTAAAC GGTACTGCAA TAGCAGGCTC TCCAAATGTA ACAGCATGTT 54 00 

CTCCTATTAA AATAATCTTA CCTGTCGATT CCCCATATCC TTTTCTTGTC ATGTCAATAT 54 60 

CACCTTTTAT ATTTATCCTA TACTTGATTC ATTATTTTTA TTTATTAGTA AAAGACATCA 552 0 

TATTCTAAGT TGCAtngCAT TCGCGTTAAA TTTCATTGCA GTCTTTATCT CACATTATTC 5580 

ATATTATGTA TAATCTTTAT TTTGAATTTA TATTTGACTT AACTTGATTA GTATAAAACT 564 0 

AACTTTCGTT TACTTCAAAG TTTAAATCTT ATCGAGTGAT ATTTCAGATT CTTTATCTTT 5700 

TTATAAAATA GCCCTACAAT TTATAATTTT CCACCCTAAC TATAATACTA CAAATAATAA 576 0 

TTGGAATATA TAGATTTACT ACTAAAGTAT TAGAACATTT CAATAGAAGG TCGTTTCTTT 5820 

CATAGTCATA CGCATTATAT ATACCCTATT CTCAATCTAT TTAATACGTA AAACATGAAA 58 80 

TTTTCTTATT AAATTTATTA TTTCCATCAT ATCATTACTT TTAATTTAAT GATGTTCAAT 5 94 0 

TTAAATATTA GGTCAATAAC ATATTTATGC TTTTTATGGA TACTTTCAAA AATAACAGCC 6000 

50 CCAAACGATA ACTTGAAAGG GGCTGTTAAA TATTTAACTA TTGCATTTGA TCTATCATTT 6060 

TCTTGTTTCT TTCAATCATT TTATCAAAAT ACGTATCGTA TCTTTGCCAT TCTTCTTGAG 6120 
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TTACATCTTG AACAGTAATC GTTTTGTTAA GCAATGTCTC TAATGAGGCC ATACAAGATG 624 0 

GTTCAATTTC AGGATATTTA AATTTAGTCA CTTCACCTTT TAAAGCATGT TCATAAAATG 63 00 

5 TTTGCATCAT CAATGCACGT TCTGAACCAG AGCCTTCAAC ACAAAGATAA ATTTGTACAG 6 360 

CAATACCGCC TCTAACTCTT CGTTGCGATA TGCCTGCAAA TTTCTTACCA TCGATACTTA 642 0 

AGTCAAATTT TCCTGGGCAA TAAGAATGTT CAATTTCCAT CGTATCAATA TCAACATTCT 64 SO 

CATTTTCGAA CATTTTGCTA ATTAAGAGGT ACATCACAGT AAACGCTTCA TCAATCGTTG 6 54 0 

TTTCTGTTTG TCCTTTGAAC ATCAGCGATA TATTTAATAC ACCTTGATCT AGAACGACAC 66 0 0 

CTAAGCCACC AGAATTTCTA ACAATGGCAT TATAACCAAT CTCATTCGTT AAATAATCAA 66 6 0 

15 

TGCCATCTTT TAAAAACGGC AATCTTGAAT CATGAATACC AAGAATAACA GTATGTTGAT 6720 

GAATCCAAGT ACGCACAACA TTATCTGATA TATCTTTGCC CACACTTTCG CAAAATGTAT 6 780 

CATCGAATGC GAAAGATTGC ATAGGTTCTA ATCCAGAAGA ATGATCGATA TATCGCCAGT 6 84 0 

20 

TGACGCCATT AAAATATTTA CTCGCTAAAT CCATCGTTAT TGTAAGOCTT GCGCTGCTGT 6 900 

AATAATTGAA AGATTGTATA CATCTTCAAT TGAGCAGCCA CGTGATAAGT CATTTACTGG 6 96 0 

AGAATTTAAA CCTTGTAATA CTGGACCAAC TGCATCATAT CCACCTAAAC GTTGTGCAAT 7 02 0 

TTTGTAACCA ATATTACCAG CTTCTAAACT TGGGAATACA AAGACATTTG CATCACCTTG 7 080 

TAATTTAGCA CCTGGCGCTT TTTTCTrCAGC AACACCTGGT ACAATCGCAG CATCAAATTG 714 0 

30 GAATTCGCCA TCAATGATTG CTTCTAATTT TtCTTCTTCA GCTTTTtGTT GTGCTAATTT 720 0 

GACAGCTTCT TGAACTTTTG TCACGTCGTC TGATTTAGCA GACCCTTTTG TTGAAAAGCT 7260 

TAACATTGCA ACTTTTGGAT CCATGCCAAA GCTTAATGCT GATTTTGCAC TTTCTACTGC 7 3 20 

35 AATTTCTGCA AGTCCTTGTG AATCAAGTTC TGGATTGATT GCACAATCAC CAAAGATGTA 7 3 80 

TTGTTCATCA CCTTTAATCA TAAAGAAGAT ACCTGATGTT CTTGATACAC CTGGTTTCGT 7 44 0 

TTTGATGATT TGTAAAGCTG GACGCACAGT GTCGCCTGTT GAATGTGCTG CACCACTAAC 7 50 0 

40 

TAAACCATCT GCTTTACCAG CATAAACAAG CATTGTACCG AAGTAGTTCA CATTGTTTAA 7 56 0 

TAATTCTTGT GCTTGTTCTT CAGTCGCTTT ACCTTTACGT CGTTCAACAA ATGATTGAAC 7 62 0 

TAATTCAGCT TTCAATTCAC TTGTCGCAGG ATTAATTAAT TCAATATTAG AAATATCAAG 76 80 

-J 5 

ATCAAGTTTT TGCGCTAAAG ATTGAACCTT AGTCTCATCA CCTAACACGA TTGGTGTAAC 7 74 0 

ATAATCTGTT GCTTGTAATT GTGTTGCAGC TGTTAGAACA CGTTCGTCCT CTCCTTCAGG 7 8 00 

TAATACGATT TTAACGTTTT TACCAGAAAG TTTGTCTTTT AATACATTTA ATAAATCAGC 7 86 0 

50 

CATAATGTCC TCCTGTAATA TAAATCTTAT TAATCATTCA CGGTATAATT ATACGCCATT 7 92 0 
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TATGATAAAA TTTATAAAGA ACTGATGATT TTTGAAAAGG AGCGATAAAC ATGAGTCAAG 804 0 

CAGCCGAAAC ATTAGATGGT TGGTATAGTC TACATTTATT TTATGCAGTT GATTGGGCAT 810 0 

CATTACGTAT AGTTCCAAAG GACGAACGCG ATGCACTTGT CACTGAATTT CAATCATTTT 8160 

TAGAAAATAC AGCAACTGTA AGATCATCAA AATCTGGTGA TCAAGCTATT TATAATATAA 822 0 

CTGGTCAAAA AGCAGATTTG TTATTATGGT TCTTACGTCC TGAAATGAAG TCTTTAAATC B2 80 

ATATTGAAAA TGAATTTAAC AAATTGCGCA TTGCTGACTT CCTAATCCCT ACATATTCAT 834 0 

ATGTATCAGT CATTGAATTG AGCAATTATT TAGCTGGTAA ATCTGATGAA GATCCTTATG 84 00 

AGAACCCTCA TATCAAAGCA AGATTATACC CAGAATTACC ACATTCTGAT TATATTTGTT 84 60 

TCTATCCAAT GAACAAACGT CGTAATGAAA CTTATAACTG GTACATGTTA ACTATGGAAG 8520 

AACGCCAAAA ATTAATGTAT GACCATGGTA TGATTGGTAG AAAATATGCT GGCAAAATCA 8580 

AACAATTTAT TACTGGTTCT GTAGGGTTTG ATGATTTCGA ATGGGGCGTA ACATTGTTCT 8 64 0 

CAGATGACGT ATTACAATTC AAAAAAATTG TATACGAAAT GCGCTTTGAT GAAACAACAG 8700 

CACGATACGG TGAATTCGGT AGTTTCTTTG TAGGACATAT TATTAACACA AACGAATTCG 876 0 

25 ATCAATTCTT TGCGATTTCT TAATACATTG GTACGTTTAT AAATTAATAA AAAAATTCCA 8820 

AGCTTATCGG TTTAAGCTTG GAATTTTTCG TTTATCTTCA GTATATTCCC GTATACATAA 88 80 

GACGTGATTT GGTAAATAGT TGAAATCTGT ATGTTTAAAC TTATATATAT GTGCTAATGT 894 0 

ATTATCAATA ACAAAGtACA CTTTGCTCAT AGCAAgTsaC CCGAgTAGTC TTCCTTGGGA 9000 

GAACTTTAAC TACTATCACT ACATATAAAC GTTAACCTCA ATAGAAATTA TACAGTCGCT 90 6 0 

ACTCTATACA ATTTTTGTAA TGGTTAACTA ATATTATTTT AACCTATTTG AAATATTTGA 912 0 

AACATATTTT TGTCGAATTT TTTTCAATAA TTTTTCCTTT TTATACTTCA AGAGAATTTT 918 0 

AACTACTAAA AATTCCGATG ATTATTATTA CAATAGTATC AAATATTAGT TTTTTAAAAT 924 0 

CAATAACAAC TTATCAAAAA GCTCATGTGG TTATTTTATA GTGTATAAAC TATAATGAGT 93 0 0 

ATTAAATTCT TATAAACAAT GGTGATGAAA TGGACATAAA TTCAGAAGAA TACAAACAAG 93 6 0 

AGGTACTTAT CAAAGACGTT GTCATGCTTG CTGCTCGCAT ACTATTAGAA TCTGGTGCAG 94 2 0 

AAGGTACGCG TGTAGAAGAT ACCATGACAC GTATTGCAAA AAAACTTGGT TACAGTGAAA 94 80 

GTAACAGCTT TGTTACAAAC ACTGTCATCC AGTTTACGTT ACATTCGGAA TCGTTTCCTA 954 0 

GAATATTTAG AATTACCTCT CGAGATACAA ACTTAATAAA AATTTCTCAA GCTAATAAAA 960 0 

ITTCGCGTCA AATTACAAAC AATGAAATTT CTTTAGCCGA AGCAAAAACG CAACTTGAAA 966 0 

AAATATATGT TGCTAAGCGT GACAGCAGTC TTCCCTTTAA AGGTTTTGCT GCAGCAATGA 972 0 
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TAGCAGGTAG TCTAGGATAC CTAGTCACTG AGATTTTAGA TCGTAAGTwA CACGCACAGT 9B4 0 

TTATCCCAGA ATTCaTTGGT TCaTTAGTTA tTGGGATTAT CGCCGTTATT GGACATACAC 9900 

TTATTCCAAC AGGTGACTTG GCAACTATTA TCATTGCGGC AGTCATGCCT ATTGTTCCTG 9960 

GTGTATTAAT AACAAACGCA ATACAAGATT TATTTGGTGG ACACATGTTG ATGTTCACAA 10020 

CGAAATCATT AGAAGCATTG GTTTnGCGTT TGGCATCGGT GCTGGCGTTG GTAGCGTATT 10080 

AATTTTAGTA TAGGAGTATC AGACTATGTT irGGATCTTA AACTTTATCT TTAGCTTTTT 1014 0 

AGCTTC 10X46 
(2) INFORMATION FOR SEQ ID NO: 244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2022 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: 

ATTTAATTGG TTGGTGGCGT ATTCTCnTTT CCAAGATTTG CCACGATTGG ATGTTCAGTC 6 0 

GACAAATATG TAACCAGTCC CGATAAATTA CGTTCCTCAA TCATTGATAG TATGTCTGGT 12 0 

GTTTGTAACG ATTTATCGAG TAAGGCATCA AGCAATTGAT AATGTCCCAA TACAACAAAA 180 

30 TGCACGTTGT CTCTCAACTG CTGTTGAATA AACTGAATAA AGAGCTTTAA GCTCTGTTGC 24 0 

ACATTCGCTA ATGATGGTGC ATAGTTTTCC AAACCAACTT GTACAGCCGC TTCATTATTC 300 

CGAATGATTA AACCTGTGTA TGCCACTTTT GTTGCTGCAG TTGGATACAT TGAGTAATAA 36 0 

CGCAATAATT GATCTGTAAA ATCATTTCGA AGTGCATAAA TTTGATGCTC ATGTTGCCAA 420 

AAArCrCGCT CACCCATCTG CTgCAAATCC TCATGGTTCA ATTGTTTCCA GTCCAACTTT 4 80 

TCAACCACAC TAAAATCAAC TAACTCATAA TCCGCTTTAT TAAAATATTT TAAAAATGCT 54 0 

GTTTCCGATT CTTTTAACGC AATTAATTGT TCTGnATTAT TCACTCGACC ACCCTTTACT 6 00 

TTCAATACTG TATTTAAAAT CACTTGGTAT TTTCGTTGTT TGCTTTACTT CTCTACCACG 660 

CTAAAGTGTA ATATGATTAA TAACTTATCA TTTTTAGCAA TACATTACAA CCTTTTTCAG 72 0 

AAAATTCGGT GTATTGATTT TAAAATTTTT' TAAAATAnAA AAGGCAAGAC ATTTGTGCCT 7 80 

ATAAAAATGC TTAACCAAGA TTTTTATATT GaAGTTGTAC TTCTTGCACA TATTGTCCTT 84 0 

GCCTTATTAT GTAAAGTTAT TTTCTTTCTA TCTTTTTATT AAATTTAACT ATTCTTCATA 90 0 

ATCCCGATTC CCTTTAAAGT AACGTCTATC TTGTTTACTA TATACATTTT CAGGATTAAA 96 0 



EP0 786 519 A2 



10 



15 



TTGGTAAACG TTCGTTGCTG ATATATCTGT AAAATTGTTT GGACCGACAC CTGCAATAAA 108 0 

CTTAAACTCT GCTTCATCTA CCAAATAATC ATACGCTTGT GTATGTCTAT CCTGTGCGCC 1140 

ATGTGGAAAT ACAAACATAT CTGTTTTACC TACAATTGGT TCAACTTCAT CTTTCCATCT 1200 

TTTAGTATCA CGTTTAATAC CTTCTAAAGA TGTTTTTTCA AAATTAATGT GACCATATGA 126 0 

ATGACTCGCA AATGACCATC CATCCCGTTT CATTGCGCGA ACAACTTCCT CAGCTGCCTT 132 0 

TTTATTCTTT GTATAATCTT TACTCGTTAA TTCATTCGTG CGATAACCTA ATACGCCCTC 13 8 0 

ATAACCGGTT AAAGCAACAA CACCTTTTTC ACCATTTAAA GAAAAATCTG GATGCTCTTT 144 0 

TACAAATTTA TTTAAAATTG GCACGATATC ATTGTCATCA GAATAAGTAG CATGGCCTTT 150 0 

TTTGTCTGTA GTTTCAGAAA CAACATGTTT ATTTTTATCG AGTACTAAAC GGTCAGCATA 156 0 

ACCATGGTGT CTCATGTAAC TATAGTAATT CATATCATCA ATTGAGATGA TTAGTGGCTT 16 2 0 

20 TTTACCTTTC GGCAATTTTA TTTTTTTGGC TTTTACATGA TGAGATGATA AGTCGTATAC 16 8 0 

ATCATGTGGA TTAACGATGA TGTAATTATT TTTATATAAT TCGTTCAATG ATTTTTTAAA 174 0 

TTCACTTACA GTAATCATCC AATCATTGTT GCCCTTAGCT TGGTGTGTAT CTCCTGTAAA 18 00 

25 CGCAACTTTT GGGTCTGTAA TTAATGGGTG ATAAAACACA TGATAAACTT GGCCGTGATA 1660 

TGTTTCCCAA TGTTCATCCA TTTTcGATTT aTGCTTTGCA TACTCATTTG GATTAACAGA 1920 

TTTATTkTGA GCTTTCTCAT TTTGCTTGGA ACAGCTATAT mACAATGCAA CTGATAATAA 198 0 

30 

CAGAAAAAAT AGCAATAAAT ATTTTTTATG CATTAAACAT TC 2 022 

(2) INFORMATION FOR SEQ ID NO: 245: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 134 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245: 

ATAGAGTGAT AAAGGATGGT TGTCATATGA TAAATGCAGT AGTAATAGCA GTAATTTTAA 6 0 

TGATTGTGCT ATGTTTATGT CGATTAAACG TnTTATTAAG CTTATTTATC AGTGCGCTAG 12 0 

TTGGTGGCTT AATTTCAGGC ATGAGCATTG AAAAAGTTAT AAATGTATTT GGGAAAAATA 18 0 

TAGTCGATGG TGCTGAGGTA GCATTAAGCT ATGCTTTATT AGGTGGATTT GCAGCATTAA 24 0 

50 TTTCATACAG TGGTATCACA GACTATTTAG TAGGAAAAAT TATAAATGCA ATTCACGCTG 30 0 

AAAATAGTCG ATGGTCAAGA GTTAAAGTCA AAGTGACAAT AATCATTGCA TTATTAGCTA 36 0 
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CACCATTGTT AAGTCTGTTT AATGACTTAA AAATAGATAG ACGTTTAATC GGTTTGATTA 4 80 

TCGGTTTTGG TTTATGTTTC CCGTATGTGT TATTACCATA TGGATTCGGT CAAATTTTCC 54 0 

^ AGCAAATTAT TCAAAGTGGC TTTGCAAAGG CAAATCACCC AATTGAGTTT AATATGATTT 6 00 

GGAAAGCAAT GCTTATTCCT TCAATGGGGT ATATTGTTGG CTTACTTATC GGTTTATATG 66 0 

TATATCGTAA ACCACGTGAA TATGAAACAC GTAAAATTTC AGA'TAGTGAC AATGTTACAG 72 0 

10 

AGTTAAAACC ATATATCTTA ATAGTAACAA TTGTAGCAAT ACTAGCTACA TTTTTAGTAC 780 

AAACATTTAC AGATTCAATG ATTTTTGGTG CACTGGCAGG GGTACTCGTA TTCTTTATTT 84 0 

CACGTGCATA TAATTGGTAT GAATTAGATG CTAAGTTTGT TGAAGGTATT AAAATTATGG 900 

15 

CTTATATTGG TGTAGTTATT TTAACAGCAA ATGGATTTGC TGGTGTAATG AATGCTACTG 960 

GTGATATAGA TGAATTAGTT AAAACTTTAA CAAGTATTAC TGGTGATAAT AAATTATTTA 1020 

GCATTATCAT GATGTATGTG ATAGGTTTAA TTGTCACTTT AGGTATTGGA TCATCATTTG 108 0 

20 

CAACAATTCC TATTATCGCA TCATTATTCA TTCCTTTTGG AGCGTCAATT GGACTAGATA 114 0 

CAATGGCATT AATCGCATTG ATTGGAACAG CGAGTGCATT AGGTGACTCA GGTTCGCCTG 1200 

2s CAAGTGATTC AACATTAGGA CCAACTGCGG GATTAAATGT TGATGGCCAm CATGATCATA 126 0 

TACGTGATAC ATGTGTACCA AACTTCTTGT TTTATAATAT TCCTTTAAAT GATTTTCGGT 1320 

ACTATTGCTG CTATGGTACT 134 0 

30 (2) INFORMATION FOR SEQ ID NO: 24 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3365 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246: 

40 

CAAAATCTAA GAGAATAAnA TTTGTTAATT TnAAATAGCA AGCAATTCAA AGTTATATGT 60 

GTAATAGATA AAATAGATAT CCCTATAGTO ATGCGTTACT AGCTAAACAT AATAACACAT 120 

TAGAAGATAA TGAAGTTAAG GAGTTACTGG ATTGTTTCGA CTATGTAATT AAGTATAAAA 13 0 

ATATCCAACG ACAAAACGTA ATTATAAAAT GGTAAAAGCT ATGGTACAGT TTCAAATTGC 24 0 

TAATGACATG CGTATCGGTG AGCTACTTGC AATAAAGAGA GTAAATATAA ACTATGAAGA 3 00 

TAAAACGCTA GATATCGACG GTAAAGTTAA TTGGATAACT GAAAAAAGAC GGGAGCATTC 360 

GGAGTAAAGG AGACAACTGA AAGAAGTAAT AGCTATAAGG CCACAGGGCT CACTACCCAA 420 
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TATTCACAAA TACGTCTGGT AGCCCTATCG ACTCGAACAA AATTAGCCAC ATTATTAAAG 54 0 

GGGGGCGCTG ATATTAGTTC TATTAAGAAA CCTATAACGA CGCATACATT ACATCATTCG 60 0 

CATATATCTA CACTTGCTCA ATTAGGAATT AACTTAAAAG CAATGCAAGA GCATGTAGGT 660 

CATTCAGATT ATAAAArAAA TCTAGAGATA TACACACATG TTACTAATCA GATGGCGAAA 72 0 

GATATGATGA ATAAATTTGA ACGATTGGGG AGTTAAAATT GGAAAAAGAT GaTACACTAG 78 0 

CAGAAATTAA GCCTATGCTC AATTTTGATG AGCAAATAGC AAAATTAAAA CAGATGAATA 84 0 

TATTTTTTAA TATTATTGAC ACCGAAAAAG CAAATGAAAT TCTTAGAAAA AATAATTACT 900 

TCTTCAAACT wGcTTATTTC CGaAAAAATT TCGrAAAAAA GaATGGCGGC TATTTCATAG 960 

AATTTGCTTA TTTATCAGAT TTAGCAACTA TAGATATGaA ATTAAGATAC ACAATGTTGC 1020 

ATTTAACTTT AGATATTGAA CATAGTTTAA AGTATCTAGT CTTAAAACTA ATAACAGAAA 1080 

ATAACCAAGA AGATGGTTAT AAAATAATAG ATGAGTTCTT ATGTATTGaT AAATCATATA 114 0 

GCAATTCAAA TTTTGACACA AATTCAAGAA CACCAGAAGA AGTTATGGAA ACCAAAATCA 1200 

AAAATAAAAA CGAAATATTC AAGCATATGA ATAAACGAGG ACAACTACCC GAGAAGTTGa 126 0 

25 ATAAATACTA TCmAAATCCA CCCGCnnGGk TTTGCaTTGr ATTCATGCAA CTAGGTCAAT 1320 

TCGTTTCGTT TCTCAACTTC TATTACAAGA AGTACAATGA CGAAGAATTG AGAGTTGCTA 13 80 

ATATTTTAAT GCCTTTAGTT AAAAATATAA GAAaCAAATC AGCTCATAAC CAACCCATCA 144 0 

TAGCAAATCT AAATTATGAC AGTAGATTAC CTCAATATTT ATTTGAAAAA GGGAATAATA 1500 

TAGGCATATC TAGAAACATG TTCGGAATAA AAAATTTCAT AGATACTkTC ksTACGCTAG 156 0 

AATTACATAA TCAAGTTTGT AGTAATGCAA TTATCCAAGC AAGATATCAC GATTTGGACC 1620 

AACTTCAAAA GCGATATAAA AGrAACGrAA GCTATTATAA TAATGCATTA GCTATCAAAA 16 8 0 

GATT^TTTAT AGCTTTAGAT AAAATTATTG ACTTCAACAG ACCAAAAGTA TAAACTATCT 174 0 

AGTGAGGAAA GAGACTTATA GGTCTCGCGA GTTATTTTAA TTCGTATGCA AGAAAAAGAA 1800 

GAGCTATGCA TTTTATTTAA AATGCGTAGT TCTTtTTTTA TGCATCTAAA TTCATATTAT 1B60 

TTTTGCAATA TAAACATATC TTTGTGCAAA TTCCGAACAC AAAACATTCA CATCATCCTT 1920 

TTTtGCCCTT TTTCTATACC CCAAAACACA AAAAGCCCCG TAAGCCTATG CcTACGGGgT 1980 

TTGACAATAA ATTATATATT ATTGTTCTTC TTTAACATAT GGTAATAATG CCATATGACG 2040 

AGAACGTTTG ATAGCTGTAG TCAaCATACG TTGATATTTA GCTGAAGTAC CAGTTACACG 210 0 

50 ACGTGGTAAA ATTTTACCGC GTTCTGAGAT AAAACGTTTT AATAATTCAG TGTCTTTGTA 216 0 

GTCGATATGT GTAATACCAT TTGCTGTGAA ATAGCATACT TTTTTACGAC GACGTCCGCC 222 0 
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CGTTAAITTT TATTAGAATG GTAAGTCATC ATCACTTATA TCAATCGGTC CGTTTGCATT 2 34 0 

TGCAAATGGA TTATCAGATT GTTTCGTGTT TGATGAATTA TTGTACGAAT TGTTTTGTCC 24 00 

TGATTGTTGA CCACCGAATC CTTGACCGTA ATCTTGGAAT TCATTTTGTT GACGTTGGCC 24 60 

ACCATTTTGT TGCGCATTTT TAGGTTCAAG GAATTGAACG CTATCACACA CAACTTCAGT 2 520 

AACAAACACA CGACGACCTT CTTGATTTTC ATAATTACGG GATTGTAAGC GACCATCTAC 2 5 SO 

ACCAGCTAAA CTACCTTTAG ATAAATAGTT ATTTACATTA TCTGCTTGTC TTCTAAAAAC 264 0 

AACACAGTTA ATAAAATctG CtTCGCGCTC CCCTTGAGCA TTCGTGAACG TACGATTTAC 2 7 00 

TGCAAGAGTG AATGtCGCTA CACTCACACC TGAGGGAGTG GTTCTGTATT CCGGATCTTT 2760 

CGTTAAACGA CCTACTAATA CAACTCTATT TAGCATTTAA ACGCCCCCTC TAATTATTAC 2 82 0 

TTGTCTTCGT CTTCACOAAT AACCATGTAA CGAATGATAT CGTCACTGAT TTTAGCTAGA 2880 

CGTTGGAATT CGTCAGTAGC TTTGTTGTTA TCAGATTTAA CACGTACGAT GTTGTAGAAG 294 0 

CCATCTTTGA AATCATTGAT TTCATAAGCT AGGCGACGTT TACCCCAGTC 1-rrTGCTTCT 3 000 

AAAACTTCTG CACCTTCAGT AGCTAAGATA CCGTTGAAAC GTTCAACTAA CGCTTTTTTA 3 06 0 

GCATCTTCCT CAATGTTTGG GCGTACGATG TACATAACTT CATATGTTCT CATTTTATAT 3120 

TTGCACCTCC TTGTGGTCTA TACGGCTTAT CAATCTTAAA ACAGATAAGC AAGGelATAAT 3180 

TTTCATTACT CACAATAAAG AATTaTATCA TGCGCCATTA CTTTTTACAA TaATAATTcA 3 24 0 

AACTACTCTT CATATCATTT TTGATATtAA TTCATTTGaA ACTTTCnATG ATATTTTnAA 3 300 

AAATACACTT CACAAAAGCG AACATATGTn CTATAAnAGT TGTGAGGTGG TAAGGAATGA 3 360 

ATTTA 3 365 

35 

(2) INFORMATION FOR SEQ ID NO: 24 7: 

4i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1032 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: doiible 

(D) TOPOLOGY: linear 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 

GTTAAAAGTA ATTGGTGGTA TTGATGATGA TTTTACAGCC AATGTTATGC ATCCAA_ATCA 6 0 

ATATCGAATT CGATATTCGT CTCAGAAACA GGACCTTAAT GAAGATATGA CAGTTTTTGA 12 0 

TGCAGTATTA AGTTCTGATA CAACAACTTT ACGCATCATC AAGCAATATG AGCAGGCAGT 180 

ACAAGCTTAT GCGGATGACC AAAGTGATAA ATTGTTCAAG CGAATGATGG ATGCGCAAGA 24 0 
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ACTAGGTATA CATGATACTA CTAAATACAT TAAAGAATTA TCCGGCGGAC AACAAAAACG 3 60 

TGTTGTACTT GCTAAAACAT TAATAGAACA ACCAGATTTA TTGTTATTAG ATGAACCTAC 420 

^ GAACCATTTA GACTTCGAAT CAATCAGCTG GTTGATCAAT TATGTGAAGC AATATCCTCA 480 

TACTGTTTTA TTCGTAACCC ATGATCGATA TTTTTTAAAT GAAGTTTCCA CTAGAATTAT 54 0 

TGAACTAAAC AGAGGTAAGT TAGCGTCATA TCCTGGTAAC TATGAATCTT ATATTGAAAT 600 

10 

GCGCGCTGPiA AGAGAAGTAA CACTTCAAAA GCAACAACAA AAGCAACGAG CTTTATATAA 66 0 

GGAAGAACTT GCTTGGATGA GGGCTGGgAG CTAaGGCTCG TACTACAAAG CAACAAGCTA 72 0 

GAATTAATCG ATTTAATGAC CTAGAmAATG AAGTTaACCA GCAATATAAA GACGATAAAG 780 

15 

GTGAATTGAA TCTTGCTTAT TCaAGATTAG GTAAGCAAQT GTTCGAATTA GAAGACTTAT 84 0 

CAAAGGCTAT TAATGATAAA GTATTATTTG AACATCTGAC GGAAATTATT CAAAAmGGTG 900 

AGCGTATTGG TGTTGTTGGG CCAAATGGAG CTGGTAAAAC AACACTCTTA AATATTTTGA 96 0 

GTGGAGAAGA CCAACAATTC GAAGGTAAAT TGAAGACTGG GCAGACGGTT AAAGTAGCTT 102 0 

ATTTTAAGCA AA 1032 
25 (2) INFORMATION FOR SEQ ID NO: 248: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 852 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: i 

35 

TGTGATTAAC GAAGCTTATT TTCGTACACC 
AGGT-KVTTAT ATTGATTTTG AAGCAAAGGA 
TAATATTCAT GACCATCAAG TCGAACATAT 

40 

GTTTTTAATG ATTCGTTTTA AAACGCTAGA 
CGAAGTATTT TGGAAGAGAT ATAAAGATAA 
ACGAAAAAAT GGTTACCATA TTCCTTATCA 

45 

AGTTGATAAG TTGATATTAG ATGAAAGTGA 
TCTCAGCCTA AGAAAAACGG TAATAATGGT 
50 GTGAAGAGAA CGATTATTAA GATTATTGGC 
TTACTAGGTA TCTTATTGTT TGCTTATTAT 



;EQ id NO: 24 8: 

TTCAACAACT GATTACAACG GCGTTTATCA 60 

AACTAAAAAC AAGACGTCCT TTCCTTTAAA 120 

GAAAAATGCA TATCAACAAA AAGGTATTGT 18 0 

TGAAGTTTAT CTTTTACCCT ATTCAAAATT 24 0 

TATTAAAAAG TCTATAACAG TTGATGAAAT 3 00 

GTATCAACCA AGATTAGACT ATCTAAAAGC 36 0 

GGACCGCGTA TGACGGAAAA CAAAGGATCT 420 

GGGAAATCCA ACTCAAAAAA GAATAGAAAT 480 

TTCATGATTA TTGCATTTTT CGTTGTTCTT 54 0 

GCTTGGAAAG CACCTGCTTT TACCGAAGCT 60 0 
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TTAGATAATG GCCAAAGACA TGAGCATGTA AATTTAAAAG ACGTGCCGAA ATCAATGAAA 720 

GACGCAGTAC TTGCAACTGA AGACAATCGT TTCTACGAAC ATGGCGCACT TGATTATAAA 780 

CGTTTATTCG GTGCAATTGG TAAGAACTTG ACTGGTGGAT TTGGkTCtGA AGGtGCCTCA 84 0 

ACATTAACAC AA 8 52 



(2) INFORMATION FOR SEQ ID NO: 24 9: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5804 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249: 

20 CACTTTTTTC ATTAAAAATC TCATATTTAT ACACTGAACG TAATCTCGAA TATTTTTCAA 60 

CCCAAGTTTT AACTTTAACT TTTTCTGGAT AAAAAATAGA CTTTTTATAA TTGACATTGA 120 

GGTCAGTCAC AGGTGAAATG ATTCCTTGTT TTTCCATATC AGCATAACTA AAACCTAACT 180 

^5 TCGATATATA ATCCAACCGC GCAACTTCAA ACCAAGTTGC ATAATTCCCG TGATAAATTA 24 0 

CACCCATCTT ATCAGTTTCA GCATAACGCG CTTCTATTTC TGTAATACTA TATATCATTT 300 

TAAGCCTTCT TTCAGTTTAA CTTTATATCT CATTCTAACA TAAAATACAA GAAGAGGCCG 360 

GCCAAGAACA CAAAGGkTTT GAACCGACCT ATTATATCAT AAaGTTTATA GAAGTATTTT 42 0 

TGAGCACTAT CAAAGTGCCT CAAATACCGA TTAAAATTTT ACTGTGATAT CTATTTTTTA 480 

TTGCGCTAAT TTATTTCTTA AAACCATTTG TAAAATTCCA CCGTGACGAT AGTAATCCAT 54 0 

35 

TTCAACAAGT GAGTCAAAAC GAACCATAGC GTCAAATTCT ACCAAATCAC CATCTTGCTT 600 

CTTAffCAGTA ACTTTGACGT AGtCATGTGG TTGAACATTT TCATCAATAT TAACAGrAAT 6 60 

TTCTTCTGTA CCATCTAGAC CAAGAGAATC AGCTGATTCA CCTTTTTTAA ACTCTAATGG 720 

40 

TAATACACCC ATCATAACTA AATTTGAACG ATGGATACGT TCATAACTTT GTGCAATAAC 78 0 

TGTTTTAACA CCTAATAAGT TTGTACCTTT TGCTGCCCAG TCACGAGATG AACCCATACC 84 0 

ATAATCGTTA CCAGCTAATA CAACTAAACC TGTACCATCT TCTTTATATT TCATTGCAGC 90 0 

ATCAAAGATA GGCATTACTT CATTTGTTGG CCAATAAGTT GTAAAACCAC CTTCAGTACC 96 C 

TGGCGCTAAT TGGTTTTTAA TACGTATATT AGCAAACGTA CCTCGAACCA TTACTTCGTG 102 0 

SO ATTACCACGT CTTGAACCAT ATGAATTAAA TTCACGAATA GGCACTTGAT GATCTTGTAA 1080 

A-^AT-^ACCA GCTGGCGTAT CTTTACCAAT TGCACCTGCT GGAGAGATGT GGTCAGTTGT 114 0 
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15 



20 



TTCTTTAGAT AATCCTTGGA AGAATGATGG ATTTTGAATG TATGTTGAAT TAGGATCAAA 12 6 0 

GTCATATAGA GGTTGATCAG TTACATCAAT CTCATTCCAT AATTCGTTGT TATTGTATAC 1320 

GTTATTATAT TCTTCAATAA ATAATTCAGG TGTTACAACA CTATCAACGG TATCTGAAAC 13 80 

TTCTTTAATT GATGGCCAAA TATCTTTCAA ATATACATCT TCACCGTCAT TACCTTTACC 14 4 0 

AATAGGTTCA TTTTGTAAAT CAATATCAAC CGTTCCAGCT AATGCATAAG CAACAACTAA 1500 

CTGTGGTGAA GCTAGGTAAT TGGCTTTAAC AAGAGGATGG ATACGACCTT CAAAGTTACG 156 0 

GTTACCAGAT AATACAGATG TCACTAATAG GTCCTCATCA GCAATCGCTT TTTCAATTTC 1620 

TGGTAATAAA GGACCTGAAT TACCGATACA AGTTGTACAT CCATAACCAA CCAAGTTGAA 16 80 

GCCTAAATCA TCTAAATAAG GTTGTAAGCC AGCATCTCTT AAATATCCGG TAACAACTTT 174 0 

TGATCCTGGT GCTAGAGAAG TTTTAACGTA TTCAGGAACT TTCAAGCCTT TTTCAACTGC 1800 

TTTTTTAGCA ACTAAACCTG cACCTAACAT TACATAAGGG TTAGATGTAT TTGTACATGA 186 0 

TGTAATTGCT GCTATTGCAA TATCACCTGT TTTCATTGTA GCTTTTGATC CATCTTTAAA 192 0 

GTTAATTTCA GCTTTCTTAT CAAATTCACT TTTATCTAAA CCGTGTCCTT GGTTGCCTGC 1980 

25 TGGAGCTGTT ACAGAATTTT CAAATGATGA TTTCATATCA CTTAAGAAAA TTAAATCTTG 204 0 

AGGACGTTTT GGTCCTGAAA GCGATGCTTC AACTGTTGAT AAATCCAATT CGATAACATC 2100 

TGTATAATTA GGATCTTCTT TCTCAACATC AAAGAACATA TGGTTTTGTT TCAAATATTC 216 0 

TTTTACTAGC GCGATATGTT CGTCTGATCT ACCAGTTAAC TTCATATATT TAAGAGATTC 2 22 0 

ATCATCAACT GGGAAGAATC CGCAAGTTGC TCCATACTCT GGTGCCATGT TTGCAATTGT 228 0 

AGCACGGTCT GCTAGTGGTA AATGTTGTAC ACCTGGACCA AAGAACTCCA CAAATTTACC 234 0 

AACAACACCT TTTTTACGTA GCTCTTGAGT TACTCTTAAC GCTAAATCAG TTGCTGTTGC 24 0 0 

GCCTTGTGGT AATGAATTTA CTAGTCGTAC ACCAATAACC TCTGGAATTG GGAAATAAGA 24 6 0 

AGGTTGTCCA AGCATTCCAG CTTCAGCTTC AATACCACCA ACACCCCATC CTAGTACGCC 252 0 

AATACCATTT ATCATTGTTG TATGTGAATC AGTACCAACT AATGTATCTG GAAATGCAGT 25 80 

TTTTTCACCA TCTACATCAC GAACATGTAC AACACTTGCT AAATATTCTA AGTTAACTTG 264 0 

GTGAACTATT CCAGTTGCAG GAGGAACTGC ATTGTAATTA TCAAATGCTT TCGTTGCCCA 270 0 

ATTTAAAAAC TGATAACGTT CATAGTTACG TTCAAATTCT AATTTCATAT TACGTTCAAG 276 0 

AGCTTCTGGA TTTGCATAGC TATCCACTTG AACTGAGTGG TCAATAACTA AATCCACCGG 282 0 

50 TACTTCTGGA TTAATTTTAG TAATATCTCC CCCAACGTCA TCCATTGCTT TACGTAAAGA 28 8 0 

AGCTAAATCA ACTACGGCTG GTACACCTGT GAAATCTTGT AAAATAACAC GAGAAGGTTT 2 94 0 
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GTCTGTAATT 
ATAAGGTAAA 
^ AGTATAACTT 
CATTTAATGA 
TATATTTATT 

10 

TTAATCAGAG 
TAATTGAGAA 
CATTGAGCAG 

15 

TGATGGGAGT 
TTGCATTGTC 
GCACATTATT 
CATTGCCTGT 
AAAAGCTTGT 

25 CATAGAATAT 
GCGTTTATAC 
AAATAAACCT 

JO CGTTCTTATC 
TCTAGCTACT 
ATCTTTTTTC 

35 

TTCTTGTTGA 
TAGGAATTTA 
AAAGGAGAAT 

40 

GCCGTTACCG 
AACAACTTTT 
AAATGTTGCT 
CGATAAAACA 
CATATCAAAA 
AAACCAAACA 
TGAACGTCCT 



ACAAAATCAT CTTCTTGACG AAGTAAAGAT 
TTGGAAACTT TAGTAATACC TTGCTCTTCT 
TGGCCATTCA AGTCAAAATG TTTTTTTGAT 
TCCCCCTTGA TACATTTTTA TATTTATATG 
GATAAACAAC TCATCATGCT TAGAAAACGC 
TATATAAGCA AAACTTATCA TACAGGTAAG 
TAATTATCAA TTTCGCGAAT GATTCAATTC 
AAAGAAAATT ATGGCACCAA ACTTTAATAT 
GGGACAGAAA TGATATTTTC GCAAAATTTA 
TGTAGAAATT GGGAATCCAA TTTCTCTTTG 
GTAAGCTGAC TTTTCGTCAG CTTCTGTGTT 
AGAATTTCTT TTCGAAATTC TCTGTGTTGG 
TACAAGCGCA TTTTCGTTCA GTCAACTACT 
TGATTTATGT CCCAGCCTGA GTTAATTTTC 
CGTCAAACTT CACTTTAGCT TTGTCAAACC 
ATCTTAAAAT ATAAAAAAAT CGAGAATTCG 
CTTTTGAATA TACTCAATTT TCCACAAAAA 
AGAATGACAT ACTACTTGTT ATTAAAATAC 
TCTATATTTC TACGTGACTG ACGCTTTTCA 
CTCTTGATAT ATTCTTGTAA GCGATGTTTA 
CGTTCCTGGT TCGCATCCTT GTAGAAACTT 
GGGAATGCAC TTATAATTGC AGCACTTTGA 
CCACCAGCTA ATAAAAGTAC AAATGCTATT 
ACCATACTAG ATGGATTTAA TGAACCAAAT 
GAGTCAGCAG ATGTAATAAA GAATGATGCA 
ATGCCAAATG GCACATGAIT AAACACTCCA 
ATTTCTTTGT GTTTCTTACC TGTCTCGATG 
AAACTAACAA TTGCTGGAAC TAGCAAGACA 
TTTGAAACTC GTGCAATAAA CACTCCAACG 



TCTAACAAAA CACGAATTGA 3 06 0 

ACAGCTTTTA AATCATAGTA 3120 

TGCTCTTTAA AATTTGCAGC 3180 

CCTTGATTAA ATTGTATTAT 324 0 

TTAATTTAGG TTTTGACTTT 3 3 00 

GTGTAATAAG TATTTTTTAT 3 3 60 

AATTTTTAAA CGTATTATTT 3420 

TTTTTTCAAT GTCATTCTTT 34 80 

TTTCGTCGTC CCACCCCAAC 3 54 0 

TTGGGGCCCA TCCCCAACTT 36 00 

GGGGCCCTCA CCCCAACTCG 3 66 0 

GGCCCCTUAC TAGAATTGAA 3 720 

GCCAATATAA CTTCGTAGAG 3 780 

TATAAAAGTA TATTTAATTT 3 840 

CCTTTCTATT AAGTTTTCAG 3 900 

TAGTTTAATA ACGAAATTCT 3 960 

CAAACAAGTA GTATATCTGT 4 020 

TTAACTAAAC TTTATTAGTT 4 0 80 

AGAATGTCAG ATTCATAATC 4140 

TTCGGAGTCA ATGTTAAACC 4200 

ACCATCATGA GTATGACGAC 4 26 0 

ATCGCATTTA AAGCTTCAGC 4 3 20 

AAGGCCTGTG AAATTCCCCA 4380 

GTTGTTTGCA TTCCTAATAC 4 44 0 

ATTAATAATA ATGCAATCAA 4 50 0 

AATAGCTGTG TTTCAGGAGT 456 0 

CCTAATACAC CAAAGACACT 4 62 0 

CCAGAAATGA ACTCTCTAAT 4 6 80 

AATGGACTCC AACTTAACCA 4 74 0 
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TGCTGTATCA AAACTATTAA ACAAGAATGT GTTTAGTAAA CTACCCGTAG AGCTAGTTAA 4 860 

CATATTTAAA ATAAGAACAG TTGGTCCAAC AATTAAAGCA GCTACCATTA AAATAGTACC 4 92 0 

^ TAAACCAATG TTCAAGTTAC TTAAGTATTG AATACCTTTA CTTAATCCAG ACCATGCACT 4 980 

TGCTATAAAT AAGATAGTAA CAACAATGAT GATAATCGCT TGTACAAACG TATTGTTTGG 504 0 

AACATTGAAC AAGTAATGTA AACCACCATT AATTTGTAGA GCACCCATAC CTAACGAAAC 5100 

10 

GGCTACCCCA ACGATTGTCG CAAATACAGA TAAAACGTCA ATAAAAATCC CAATAGGACC 516 0 

TTCTACTTTA TCACCTAAAA GAGGACGTAA AGTTCTAGAT AATAAACCTG GTTCACCTTT 522 0 

ACGGAATTGC GAATATGCCA ACGTAACGCA ACAACACCAT AAACAGCCCA AGCATGGAAT 52 80 

15 

CCCCAATGGA AAAATGTTGA ACGTAGAGCT TCAGTATAAG CTTCAGTAGT TTTGGGATCT 534 0 

GCTGTAGGTG GCGTAGCAAA GTGCGCCATC GGTTCAGCTG CACCATAAAA CACCAAACCT 54 00 

ATCCCCATAC CAGCACTAAA CAACATAGCA AACCATGAAA TTGTATTAAA CTCAGGTTTG 546 0 

TCATTTGGTT TACCTAGTTT AAGTTTTCCA ATAGGACTAA AAATAAGGAA TATACAGAAG 5520 

AACACGATAA TCGTAGTAAG AATAAGATAA TACCAACCTA ACTTTTCTGT AATCCACATT 558 0 

25 TTAATATTAT TGGTAACATA GTTGAATTGT TCAGGTAAAA ATGCACCAAG TAATACGACT 564 0 

ATAGCAACAA CAATTGCACT ATAGATGAAG ACTGGTGAAT ACTTCTTTCC ATTTGGATTC 5700 

TCTGGTGAAG AAGAATTCAT AATTAATTAC TCCCTTCAAT TCTATATTTA ATTTTATGTA 5760 

^0 GTAGAATAAA AATATTATCT AAACATTTTA TTCAATAACT CACG 58 04 

(2) INFORMATION FOR SEQ ID NO: 250: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
r (D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 
ACCCGCGAAT ATGGTCCATC CTATCGATTT ATTTTTAACT GGTTTGACAA TATTTAATTT 6 0 

TTCATAATCA TTCTTAGTGA TTTTGACATA TGTTTTCGGT ATGAGCCAGT TAATAAATGG 12 0 

45 

AAAGAAGAAG ACAATCCAAT TACTTGCCAA ATCaATCATT AAATATTCAC TATCGTATTT 18 0 

GATTATTCGA TATTTAGGGT TTTTATTAAT AACTTTAGAT TCGCAAAGCA ATGTCTCCAC 24 0 

5^ ATCCCTTTAA TTTTATGTGT AATACATTTT TCGATACTTC AAAAGACATT CAAATACTAT 30 0 

CAAGTTACTG TCATCAAAGG TTTTATTAAC TGATATTtTC ATATTTTTAa TCTGAATTTA 360 

55 
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(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 964 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251: 
CCAGGGTGCG GAAAGCTTTA AAArTTGGAC TAATAAAGAT GCTGATATTA AITCTATGAA 
AACAGCAGTT TTACAACAAT TAAAAGGAGA ATAACATATG CTTACTGGcA AACAAAAAAG 
ATACTTAAGA AGTrXAGCAC ACAATAITGA TCCGATTTTT CAAATTGGAA AAGGCGGTAT 
CAACGAAAAT ATGATTAAAC AAATAGATGA TACGTTAGAA AACAGAGAAT TGATTAAAGT 
;,CATGTACTA CAAAATAACT TTGATCATAA AAAAGAATTA GCTGAAACAT TAAGCGAAGC 
TACTCATAGT GAATTAGTGC AAGTGATTGG ATCTATGATA GTGATTTATA GAGAATCTAA 
AGATAATAAA GAAATTGAAT TGCCATAATA ATOAAAAAGA TAryACTTTA CGGCGGTCAG 
TTIAACCCTA TCCATACTCC ACATATGATA GTAGCTAGCG AAGTATTTCA TGAATTACAG 
CCAGATGAAT TnATTTrrT ACCTAGTrTT ATGTCTCCAT TGAAAAAGCA CCATGATTTr 
ATAGACGTTC AGCACAGATT AACAATGATA CAGATGATTA TCGACGAGCT TGGTmrK3A 
GATArriGTG ACGATGAAAT TAAACGTGGT GGTCAAAGTT ATACCTATGA CACGATCAAG 
GCATTCAAGG AGCAACACAA AGACAGTCAG TrGTACTTTG rTATTGGGAC GGATCAGTAT 
AACCAACTAG AGAAATGGTA TCAAATTXSAA TACTTAAAAG AAATGGTTAC TTrTGTAGTT 
GTAAATCGAG ACAAAAATAG TCAAAATCTT GAAAATGCTA TGATTGCAAT TCAGATACCT 
AGGO^AGATA TAAGTTCGAC AATGATTCGA CAAAGAGTTA GTGAAGGGAA ATCTATCCAA 

GrrcrrGTTc ctaaatccgt tgaaaactat attaaggggg aaggattata tgaaca^tga 

AAAA 

(2) INFORMATION FOR SEQ ID NO: 252: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1193 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



50 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252: 
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TTGTCATGAA ATAAATGGGA TGAATATCAC GACTAGAAGT AATGTTACGA ACAGGAGCGT 120 

ATAAACTAGA GACGCTAAAT TCGACATAGT ATGTnGCTCA ATTATGGCTG ATGATGAATT 180 

TAAAGTATGT GCGTTGGAAC TGTCGGATTT TTGTTCATAA TGTTTTGCAT ATTGCGCCAT 24 0 

GATGAATAGT GTAAAAATAA ATAAAACAAT AAGAGATATA ATGCCCATAA TCAAAAGTAT 3 00 

TTGTTTAGAG CCTTTCATTA TTTCACATCC TTTCTAAAAT ATATTTGTAA CTAAATTTAA 3 60 

AATAGTTATT TTTGTAATTC TAAACCTTTT TCATCGCGAA AACAATTAAA TAGGTCGCGG 42 0 

TATTAATTAT TATATTATTA CCGCTTAATA TGAAAAATAC ATGAaAATTA ATTTTCTAAT 4 80 

ATACTTTTGA AAAATTATTA CAAATTAGCC CCTTCAAAAC GCGAAAACAT AAGGATTCTA 54 0 

GTTTCAAAAG GGCTGATAAG CATAAAATGA AATGTAATAT TTCGATGTAT AAAATTTTTA 600 

ATTAGCTAAA AATCATCGCA TTAATTTTTT GAGCTACATC ATCAAAATTC GOACATTTTA 660 

ACGACACATA TAATTTAATT TTAGGTTCAG TACCAGAAGG ACGTAAAGCG ATAAATCCTT 720 

CGTCAAATAA GACACGAATA ACATTTGATT TAGGAGAATT AATCTGCGAC GTTGTATCTT 780 

TATCCAAATG ATAAACCTCG CTAGTTAAAT AATCTTCAAT TGCTTTCACT TTGAGTCCTT 84 0 

25 GAATCTCTTG CGGTGGATTT GAACGGAATT TGGTCATTAT TGCATTAATT TTCTTTTTCC 900 

CTTCAAATCC TTCTAGCGTA TGCGgAATAA TGTATCCTCA TGTCTACCAA CAGTTTGATA 960 

AATCTGTTCT AATTCATCTT TCAATGTTTT GCCATATAAT TTTAACTCAG AAGCGTATTT 1020 

TATAATGAGT GGCACAATTT GTACGGCATC TTTATCACGT ACAAAAGGCT CTGATAGAAA 1080 

ACCGTAACTC TCTTCAAATG CGAAAATCAT ATTTGATGAT CATCCAGTTG TCTTATTTCC 1140 

TGAGCAATAA ATTTAAGCCC GTCAGCACCT CTTTGGTATT CAACATTATT ATA 1193 
(2) INFORMATION FOR SEQ ID NO: 253: 



20 



35 



tl) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1098 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253: 



TATCAGGATG 


ATTATGnCGG 


GGTTTTATTA 


AGTCTGAGCG 


TATATnCCTT 


TTATTCTCCT 


60 


AAATGGTTTG 


GAATTCAAGG 


AGAAGACGAT 


GAAATGGTAT 


CCAAAAACCA 


AnGAATATAT 


120 


nGCATTAAGT 


CTGAGGATGA 


TAGTGCGGTG 


GCAATTCGTT 


CATTAATTTT 


GCATAAAGAT 


180 


GAACCTATGT 


ATTTAAAAAA 


ACGTACATGT 


GTACCTACTT 


TGTTAATTAA 


TGGGGAACAT 


240 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254: 
CCA(idTAAAA TTGTGCAATC ATTTGACGCA TTAATGGACG CCTTGGACAA TGAAGATTAT 
GAAGGAGAAA AAGTCATTCC ATTCTTAGAT AAACATTTTA AATATCAAGA TGGCCGATCA 
AGTGAGCGTT TAGTCAGAAA TTTATTrGGT AGCTAAGTTT ATATAGTAGT CAAAGTGGGA 
GAGGTATAAT GATGAAATTT TCAGTAATAG TTCCAACATr CAATTCAGAA AAGTATATAA 
CAGAATTACT TAATAGCCTr GCGAAACAAG ArTITCCGAA AACTGAATTT GAAGTGGTTG 
TAGrrGATGA CTGTTCAACA GATCAAACGT TACAAATAGT TGAAAAGTAT CGCAATAAAT 
TGAACTTGAA AGTAAGTCAA CTCGAAACAA ATTCTGGTGG TCCAGGTAAA CCTAGAAATG 
kGGCGrrAAA ACAAGCAGAA GGTGAATTTG TATTATTTGT GGACTCCGAT GACTATATAA 
ACAAAGAGAC TTTAAAGGAT GCAGCAGCAT TTATTGATGA ACATCACTCA GATGTCTTAT 



360 
420 
480 
540 



AAAAAAATCT TCGAACATTC AGGACATGCA CCGCATATTG AAGAACCAGA AGCATTTATG 
AAlTATTATr TAAAATTTTT AAAAAGCGTA TCATAATATG TGATATATAA ACCTAGGGCA 
' TAAAGTCCTT AGGCAATGTG AAAAAGCTGA TTACTATTCA rrATTTGATA GAAATCAGCT 

TTrrrTGAAA TGTArrrGAT ATATACTGCT CGITATGCGG CTATCTTCCT TATATTAAGT 
GCCATTAGTG CAAAACCTCT TAACAATTAG GTAAAAAGAG CATAAAAAAA GGAAGTTTAA 600 
TAGAATGTAT CATCTATCAA ACTTCACCAA ATTGCGCTAA ACAAAATTAT AGTTCAATTT 660 
CGTrGTITGC TTCAGTGATT CGrrTATTTA CTCGACTCAA TAATGATTCG ATTrmTAC 
GTTGITGTGC AITAACAAGA ATTAATACAG TrCTTTCATC ATGCTCATTA CGTTTTTrAT 
CGAAGTAATC TTCTTGAGAT AAAATTTTAA CTGCTTTAAC AACTTCTGGT TGTTTGTAGT 
TTAAATGATT AATAATATCt TTAAGATAGT ATTCrTTCTC TrTGTTTTCG CTGATGTATG 
TCAATACAGC GAATTCTTCA AAGCTAATTC AnAATTCCTT TTTAATTAAA CTTTTTAArr 

20 

TGTCAGCATA AGTGACCATT GATAACAACT CAAAGCAATC ATTGATrrrT GTAATIX.CCA 
TGTTTAAAAC CTCCCTATTT GATGCATCTT GCTCGATACA TTTGCCCCGA TAATATAnTG 
25 TATCTAATCT TTATGnAT 

(2) INFORMATION FOR SEQ ID NO: 254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2881 base pairs 
tB) TYPE: nucleic acid 
(C) STR_ANDEDNESS : double 

CD) TOPOLOGY: linear 



720 
780 
840 
900 
960 
1G2G 
1080 
1098 
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CACCTGAAGT TACTTTGTTA AATTCAAGAA TTATCTATAC TTTAAGCCCG ACTAAAATCT 66 0 

ATAGAACAGC ATTACTAAAA GATAATGACA TTTATTTTCC AGAAGAATTA AAGAGTGCAG 720 

5 AAGATCAATT ATTTACAATG AAAGCATATT TAAATGCAAA TCGAATCAGT GTGTTAAGTG 780 

ATAAAGCGTA TTATTATGCT ACAAAGCGTG AAGGTGAACA TATGAGTAGT GCGTATGTTT 84 0 

CACCTGAAGA CTTTTATGAA GTCATGAGAT TGATTGCTGT AGAAATATTA AATGCAGATT 90 0 

10 

TAGAAGAAGC CCATAAAAAT CAAATCTTAG CAGAATTTTT AAATCGTCAT TTTAGTTTTT 96 0 

CTCGTACGAA TGGCTTCTCA CTTAAAGTTA AACTAGAAGA TCAACCACAA TGGATTAATG 102 0 

CTCTAGGAGA CnTATACAA GCAGTTCCAG AACGTGTAGA TGCATTGGTG ATGAGTAAAT 1080 

15 

TACGACCATT GTTGCACTAC GCGAGAGCGA AAGATATAGA CAACTATAGA ACTGTGGAAG 1140 

AAAGTTACCG TCAAGGTCAA TACTACCGTT TTGATATTGT AGATGGTAAA TTAAACATTC 12 00 

AATTCAATGA AGGCGAACCA TACTTTAAAG GCATTGATAT CGCTAAGCCA AAAGTGAAAA 126 0 

TGACAGCATT TAAATTTGAT AATCATAAAA TTGTTACAGA GCTAACGTTA AATGAATTTA 1320 

TGATTGGCGA AGGACATTAT GATGTCAGAC TTAAATTACA TTCACGAAAC AAGAAGCACA 138 0 

2s CAATGTATGT ACCTTTAAGT GTCAATGCGA ATAAACAATA TCGTTTTAAC ATTATGTTAG 144 0 

AAGATATTAA AGCGTATTTA CCTAAAGAAA AAATTTGGGA TGTTTTCTTA GAAGTCCAAA 1500 

TAGGTACGGA AGTATTTGAA GTGCGTGTTG GTAATCAACG TAATAAATAT GCATATACTG 1560 

30 CAGAAACAAG TGCATTAATT CATTTGAATA ATGATTTTTA TAGATTAACA CCGTATTTCA 162 0 

CAAAAGACTT TAATAACATT TCGTTATACT TTACAGCTAT TACATTAACG GATTCAATCT 1680 

CATTGAAGTT AAAAGGTAAA AACAAAATCA TTTTAACTGG TCTGGATCGT GGTTATGTAT 174 0 

TTGAAGAAGG TATGGCTAGT GTCGTACTAA AAGACGACAT GGTGATGGGA ATGTTAAGCC 18 00 

AAACSTCAGA AAACGAAGTG CnAAATCTTA CTTAGCAAAG ATATTAAAAA GCGAGACTTC 186 0 

AAAAATATTG TTAAGTTAAA CACTGCACAT ATCACTTATC CACTAAATAA ATAATAAATG 1920 

40 

CCCTCAAATC ATTGTGAGCC AACATGATTT GAGGGCTTTA TTTTGCTGTT TATGACATGA 1980 

TTATGACATT TCCCTGATTT TCATTTTCAT ATACATTAAA TTGTATACAC TGGAAATGAG 2 040 

GAGGTTATCT ATAATGATAA ATAAAAATGA CATAGTAGCA GATGTAGTAA CTGATTATCC 2100 

45 

GAAAGCAGCG GATATTTTTA GAAGTGTGGG AATAGATTTT TGTTGTGGCG GACAAGTAAG 2160 

TATAGAAGCA GCAGCCTTAG AAAAGAAAAA TGTAGATTTG AACGAATTAT TACAGCGTCT 222 0 

CAACGACGTT GAACAAACGA ATACACCAGG TTCGTTAAAT CCTAAATTTT TAAATGTTTC 228 0 

ATCACTTATT CAATATATTC AATCAGCATA TCATGAACCT CTAAGAGAAG AATTTAAAAA 2 34 0 

55 
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TGAGTTAAAA GAAACATACG ATACATTTAA AAATGGCATG TTAGAGCATA TGCAAAAAGA 24 6 0 

AGACGATGTC GATTTTCCAA AACTCATTAA ATATGAGCAA GGTGAGGTAG TAGACGATAT 2 52 0 

TAATACTGTG ATAGATGATT TAGTTTCAGA CCACATTGCA ACGGGAGAAT TGTTAGTAAA 2 5B0 

AATGAGCGAA TTAACATCTA GTTATGAACC TCCGATAGAA GCGTGTGGTA CTTGGCGACT 2 64 0 

TGTTTATCAG AGATTAAAAG CACTTGAAGT GrTAACACAT GAACACGTAC ATTTAGAGA.A 2 70 0 

TCACGTATTA TTTAAAAAAG TATCATAAAT AACGCGATTA GAAACTGTTG GCAAAAATAA 276 0 

GTCCAGCAGT TTTTCGCTAT GTATAAAAGT CATAATAGTG ACATAAACAG CATTATTTGA 2 82 0 

AAAGAAnAAT GGTCAACTTA GCATAAAAAT TGATATGAAn ATTTAATGGT ATAGATAATT 28 80 



2BB1 



A 

(2) INFORMATION FOR SEQ ID NO: 255: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 1056 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 

ACCGTCGAAT ATCGCTTGTG ATTTACAATT TGTGTATTAA GATGCTCAAC TAATTTGGGT 6 0 

^0 ACATATTCCG AATTTAGATT TGCAAGTACA ACAATTCCAT AATTTTGTTT TGGATTTAGT 120 

PJ\PJ^ThPATG ATGAAAAGTT ATCTAGCGTT CCTGAATGAA hCACTAPATG TTCATCATTA ISO 

TTGGTAAACC AGCCGGAAGC ATATGCATTG GCATTAGGTT CACCAATTGT TGAAGATAAA 240 

^5 TTTTTATGTG ATTGTTGAAC TAATGATTTG TATTTATCAG GTGGATTAAG TTGGAATTTT 3 00 

ATCC^TGTT CCAAATCTTC AGTTGATGTC ATCATATATG CTGATGGTGT ATCCCAAAGG 3 60 

TTAAATTCAG GTTTAGAGAC GACAGGTGTC GAACCTTGTA ATTCATAGCC AATAGCATCA 42 0 

TGTTTTGATT TGTAATTGGT TTGTTTGAAT GATGTATGTG TCATATGCAA AGGCTTGAGC 4 80 

CATGAATTTG TAATATATTT TGTATAGGAT TGCTTCGTAA CGTTTTGGAT AATTAAACCT 54 0 

AATAAATCAT AGTTCATATT TGAGTATTCA AATTCTTCTC CGGGCTTATG ATGTAATTCA 60 0 

TCACCCATAA TTGCATGGGT TACATCATTT AAACGATTAT TTTTGCTTGT CACAGAATCT 66 0 

tCGCTTGTAA TAtCACTAGG TATACCACTT GTTTGAGCCA AAAGTTGCTT AATCOTAATA 720 

GTTTCATTTT GACCATTATA GTTCATTTTA AAATGAGGCA CATGTTTGGA TACGGCATCA 78 0 

TTTAAGTTTA ATCGACCTTC TTGAGCTAAT TTTAAAATTG CAAGACCTGT GAAAGCTTTC 64 0 
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TGATAACCAT AACCTTTATT TAAAAAAACT TTGCCATTTT TTACTAyTAA AATTGATGCT 960 

CCAGGAATGT GTCCCTTTTG TAAATCATGC TCGATAATTG TATCTATTTG TTGTTGCGAA 1020 

^ TCATTGGTTA ACCGTGTCTT CGTATTGCTA TTTAAT 10 56 
(2) INFORMATION FOR SEQ ID NO: 256: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1277 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS : double 
(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256: 

ATGCCaCACT TATTGGTGGC AGGATCGACG GGTAGTGGTA AATCTGTTTG TATAAATGGT 60 

ATTATTACAA GTATTTTATT AAATGCTAAG CCGCATGAAG TTAAACTTAT GTTAATCGAT 120 

20 

CCGAAAATGG TTGAACTAAA TGTTTATAAC GGAATTCCAC ACTTATTAAT TCCGGTTGTT 180 

ACAAATCCTC ATAAAGCTGC TCAAGCTTTA GAAAAAATTG TAGCTGAGAT GGAAAGACGT 240 

2s TATGATTTAT TCCAACATTC ATCAACTAGA AACATTAAAG GTTATAACGA ATTAATCCGT 3 00 

AAGCAAAATC AAGAATTAGA TGAGAAGCAA CCAGAATTAC CTTATATCGT TGTTATTGTA 3 60 

GATGAGCTTG CAGATTTAAT GATGGTAGCT GGTAAAGAAG TTGAAAATGC GATTCAACGT 42 0 

30 ATTACACAAA TGGCACGTGC AGCAGGTATA CATTTAATTG TAGCGACACA AAGACCTTCT 4 80 

GTGGATGTAA TTACAGGTAT CATTAAAAAT AATATTCCAT CTAGAATAGC TTTTGCTGTG 54 0 

AGTTCTCAAA CAGATTCAAG AACTATTATT GGTACTGGCG GCGCAGAAAA GTkACTTGGT 6 00 

AAAGGTGACA TGTTATACGT TGGAAATGGT GACTCATCAC AAACACGTAT TCAAGGGGCG 6 60 

TTTTTAAGTG ACCAAGAGGT GCAAGATGTT GTAAATTATG TAGTAGAACA ACAACAGGCA 720 

AATTATGTAA AAGAAATGGA ACCAGATGCA CCAGTGGATA AATCGGAAAT GAAAAGTGAA 7 90 

40 

GATGCTTTAT ATGaTGAAGC GTATTTGTTT GTTGTTGaAC AACAAAAGGC aAGTACATCA 84 0 

TTGTTACAAC GCCAATTTaG AATTGGtTAT AATAGAGCAT CTAGGTTGAT GGATGATTTA 900 

GAACGCAATC AGGTAATCGG TCCACAAAAA GGAAGCAAGC CTAGACAAGT TTTAATAGAT 960 

45 

CTTAATAATG ACGAGGTGTA AAAAAATGTC AGAAATGAAT GCGGTATATA ACGTTAAACA 1020 

ATaCATTTtA AATTTgATTA AGCAAAATAA ATTGGAATAT GGTGACCAAC TTCCAAGTAA 10 80 

TTTATCAATT GCCAGAGAAT TAAATGTAAA AACCGACGAT GTTTATGAAG CAATTCAGcA 114 0 

TTGATTACTG AACAAGTCAT TAAAGATAtT TTGAAGAGGG CACAAGTGTT AAGTCACTGC 12 00 

55 
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GrtTTGAATG CGGAACT 1277 
(2) INFORMATION FOR SEQ ID NO: 2 57: 

^ (i) SEQUENCE CHARACTERISTICS: 

(A] LENGTH: 3557 base pairs 

(B) TYPE: nucleic acid 
IC) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

w 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257: 

TACCGTCGTT TTATGyGTCA AATTTTACAG TAAATTTTGC TTCATCAAAA GAAATAACCT 60 

15 

TTAACAAGTA TAGTAATTTT ACATTTTACA ATGTTACAAA ATATAATTTT TTATAATTAG 120 

TTAAAATCAC TAAAACGCTT TTATACACTA TCAAATCAGC ATTTATAAAA ATATGAACCG 180 

ATATCCTAAA ATGTTAATAA TATTACAAGA TAATAACArA CCACACAAAG CTACTTATTT 24 0 

20 

TTGATAATAT GGAAATCGTA ATATAAAACn AAAACTTAAT TTACTATATA AATTGTCTTA 300 

ATAATTTTTA AAAGTAGTAA AACATAATTT TAAGGAGGAG TCCCTTTGAA AAAATTAGCA 36 0 

05 TTTGCAATAA CAGCAACATC TGGTGCAGCT GCATTTTTAA CGCATCATGA TGCACAAGCT 420 

TCTACACAAC ATACAGTACA ATCTGGTGAA TCATTATGGA GTATTGCTCA AAAATACAAC 4 80 

ACTTCAGTAG AGAGTATTAA ACAAAATAAC CAATTAGATA ACAACTTGGT ATTCCCTGGT 54 0 

30 CAAGTTATCT CAGTAGGTGG AAGTGATGCA CAAAATACGT CAAACACTTC TCCACAAGCT 60 0 

GGTTCAGCAT CATCTCATAC TGTACAAGCT GGTGAATCAT TAAATATCAT TGCTAGCAGA 66 0 

TATGGTGTTT CAGTTGATCA ATTAATGGCA GCCAATAACT TACGTGGTTA TTTAATTATG 720 

CCTAACCAAA CATTACAAAT TCCTAATGGT GGATCAGGTG GTACAACACC AACAGCTACA 780 

ACACSSTAGCA ATGGCAATGC ATCATCTTTT AATCACCAAA ATTTATACAC TGCTGGTCAA 84 0 

TGTACATGGT ACGTATTTGA CCGTCGTGCT CAAGCTGGTA GTCCAATTAG CACATATTGG 900 

40 

TCAGACGCTA AGTATTGGGC TGGTAACGCA GCTAATGATG GTTACCAAGT AAACAACACA 96 0 

CCATCAGTTG GTTCAATTAT GCAAAGCACA CCTGGTCCAT ATGGTCATGT TGCTTATGTT 102 0 

GAACGTGTCA ATGGTGATGG TAGTATCTTG ATTTCTGAAA TGAATTACAC ATATGGTCCA 1080 

45 

TACAATATGA ACTACCGTAC AATTCCAGCT TCAGAAGTTT CTAGCTATGC ATTCATCCAT 114 0 

TAATTAAATA AATTGTACTG ATATATACTA GCAATTCACA TCATGTGAGA TTGCTAGTTT 120 0 

TTTATTTTTG AAAAAAATTT TCATTTTGGT ACAAAAAATT ATCTCACCCT TCCCTATCAT 1260 

50 

ACATATTTAT ATTTTGTATG AATGGTAGTT AGGTAAAAAT TAACAACCTA CCTATTTGAT 132 0 
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20 



ATTTAATTTG TTATACCAGT ATTTTACGCT TTTTCGTCTA CAThThCPiAA TTTATATTAA 144 0 

ATAAAGCCCA ATACAATTTA GGTTAATTAA ACAAGTTGAT AACTATTTAA TTATTCCTTC 1500 

ATTGAAGAAT ATAAACTATT AAATCATTAT TTTGCTCTTA CATATATTTT AATGACCTAA 1560 

CTGaTTATGT TCCATGGAAT ACATTTATAA TATAGCCTCC TAATTAArAT GCyTTGTCTT 162 0 

GGTCATTCTA CGTAAATTCT ATAAAATATG TTATCTACTT ACATAArArn CTGrACTTCA 168 0 

ATACCACCAT ATGTTTGTGA TACTGAAGTT CAGTTTAGTT TTATTTTCAA TTAGAAAAAT 174 0 

AAGTTAAGTA TATAGAATAG TAAACCTGCT AACAATGCTG AAATAGGTAA TGTAATCACC 1800 

CATGTAATGA TCATTCGTTG CGCAgTGCTC CATTTTACAC CTTTAGCTCG GTTAGAAGCA 1860 

CCAACACCTA AGATTGATGA TGACACAACG TGAGTTGTTG ATAATGGGAA ATGTAGCGAT 192 0 

GATGCAACAA AAATTGTTAA TGCAGATGAT AAATCGGCCG CAGCACCATT TGCTGGACGT 19 80 

ATTTTCATAA TATTACCACC TACAGTTTTG ATAATTTTCC AGCCACCAAT TGCAGTACCA 2 04 0 

AGCCCCATTG cTGTCGCACA GGCAAATTTT ACCCATAACT GTGGTTCAAC ACTGCCATCA 210 0 

TTCTGTACAT TAGCGACAAT CAATGCCAAC GTAATAATAC CCATTGATTT TTGCGCATCA 216 0 

25 TTCGTACCGT GAGAGAATX3A TTGTAACGCT GCTGTGAAAA TTTGGAAAAA TCTAAAGTTA 222 0 

CGATTCGCTC TTGTTAAATT TGCATTTTTA AAGATAACTT TAAAAATTOA ATACATCAAG 2280 

AAACCAACAC AAAATGCGAT AATCGGTGAA ACGATTAATA CAATAATAAT TTTTGTGAAA 234 0 

30 CCTTGGTAAT GTAACACTCC AAATGAGCCT TCAGATGCGA TTGCTGCACC CGCAATTGAA 24 00 

CCTATAAGTG CATGTGAAGA CGAACTTGGA ATTCCGTAAA ACCAAGTAGC TAAATTCCAA 24 6 0 

ATAATAGCCG CAAGTATTGC AGCTAACACA ACAACTAATC CATTTTCCAA TTTAAATGGA 2 520 

TCGACAATGT CTTTAGTAAT GGTGCCTGCA ACGCCCGTAA ATGTTAAAGC ACCTATAAAG 258 0 

TTCATCACTG CTGCCATTAA AATTGCCGTT TTAGGgTTAA CGCTCTAGTA GATACAGCAG 264 0 

TAGCTACTGC ATTGGcTGTA TCATGGAaTC CcATTGATAA AGTCAAATAT CAGCGAGAAA 27 0 0 

ATAACTACAG CTATAGTGAC GATGATTATA TATGACATAA ATATATACTC CCCTTAGCTA 276 0 

TTTTTCATAA TAATAGTTTC AAAATTATTT GCTACGATTT GACATTTATC AGCGATTTCT 2 82 0 

TCCATGCTTT CATAAATATC TTTTATTTTA ATTAAAGTGA TTGGATCTGT TTCGCTATTG 2 8 80 

AAAATATGTT TAATTGACTG TCTTAAAATA CCATCACAGT TTGTTTCAAA TTCTTTAATA 2 94 0 

TTAATTGAAT GAATACGCAT ATGTGATAAT TTTTTATCGA CTAATAAGCC GACAGCAAGT 3 00 0 

TTCATTTCTG CAACTGCTTT TTGAATGTTA TCAACAAACT CAGCCATATA TTCATCTGTG 306 0 

TATTCGATTG AATACATTTC AAACATrGCT GCCGTTTCTT CAATTGCATC TAAAACATCA 312 0 
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TTTAAATCAG TAATTACTTG ATGTACTAAt tCGcACCATG TGACTCATAA GTTTTAATGT 324 0 

TGTCTGAGTA TGCTTTTAAA TCTAAATGTG TATTGAAATC CATTTTACCG AATTCAATAG 3 3 00 

5 CAGCACGATC CAGATTGAAA ACCATCTCTT CTAATTGAAC CATAAACTTA TCTTTTTTCT 336 0 

TACTAAACAT TTAAAATCCT CCATTTAAGC GATTGTCACC AATCACATTC AGTTATAATT 34 2 0 

TGTTTCAAAT TAAGACAAGT GAATTTACAA ACTAATGATA CAAATTTGTT ATTATCAATC 34 8 0 

GTCAGTATAA TTTTAGTGTA CTGATATTAA TTTCAAAAAT GCCTCACAGT AAACAATTTA 354 0 

CTGTATTTGC CCTTATA 3 557 



(2) INFORMATION FOR SEQ ID NO: 258: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1631 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: 
AACTATACAT TTCGAAAAAT TCTTCTAGTG AACCTGCGCC ACCAGGAGCC ATGACAAATG 60 

25 

CATCTGCAAG TTCTGCCATT TTATTTTnAC GTTCATGCAT AGAATCAACT AAAATTAATT 12 0 

CAGTTAAACG TTGGCTTGTG ATTTCATGTT CATCTAACAT TTTAGGCATG ACGCCAATAG 180 

20 CTTTGCCGCC ATGATCTAAT ACACCATCTT GaATGGCACC CATAATGCCA ATTGACCCTG 24 0 

CACCAAATAC TAATTCATAA CCTTGTTCAG CAAAATATTT ACCTAAATCG TATGCTTTTT 300 
GTACATATGA AGGGTCATGA CCTTTGCTTG CACCACAATA AACTGCGATT CGTTTCATGT 360 

35 TAATCCAGCT CCTTAATTCG ATGAATGACT TTTAATAGTG ATTGTTCAAA CACTTTTTGA 420 
TCTT0CTTTG TAAAAGGTGG GGGACCTTTG TGGCGACCAC CTTGTTTTCT AATTTGTGCA 480 
TTCATATATC GTTTATCTAA TAGTTGTTGA ATATTTTTGG AATTGTATAT CTTCCCATTA 54 0 

TGATGCATGA CAATTAAGAC TTTGTCGACT AATAAACTTG CGAGTCCATA ATCTTGAGTG 600 
ACTACGATAT CATCCTTCGT TGATAATTGA ACAATTTTGT AATCAACTGC ATCTGGTCCA 660 
TCATCAACAT ATAATGTTGA TACATGTGGA GGATATAATT GGTTCGAAAA ATGGCTGAAG 720 

45 

CTCCGAATAA TTGTCACAAA AATGCCTGTC TCAGTTGTTA AATCTATAAT AGAATCAACA 7 80 

ACAGGACAAG CATCTCCATC AATAATAATA TGTGTCACAA TTATGCCTCT GTATTGTTTT 84 0 

CTTTATTTTG TTGAGAGGCG CTTTTGGCAA CATAATCTTT ATATTTTTTA AATGACTTGA 90 0 

50 

TGCGTGCTTT ATCAGCTTCT TGTTGGCGTT TTTGTTCTTC TTTGTGTCGT TTTTCAATAT 96 0 
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CGCCTTTTTT CTCAGTTTTC TCATCTAATT TATTAGGTGT TAAGCCTGCT TTTTCTTCGT 1080 

ATTTTTGTGA TTTTTTCATA TCTTTAATAC GTTGTATTTC ATTCTTTTCG CGGGCTTTTT 114 0 

^ GCTCTTCTTT ATGACGCTTT TCGATATTTT TTTGAAGTAT TTTATTCATT TTATCAGCGT 1200 

CTTTACGATT TTGTTTAGCT AATTTTTCGC CTTTTTTCTC AATATAGGCA GGATCATGTT 1260 

CTCTAGCAAA CTTTTTAAGT TCACGTTTAT TTTCAAAATC TTGTTTTTTA TCGCCGACAT 1320 

10 

ATTCTTTAAC ATCACTCGCT GTGTTACTGA TTGCTGCAGA TGTTTTTGAA GCAACTTTAC 1380 

TTGTAGCATC TGTAACTTTT TGTACGTCCX3 GATGTTGTTT GATACGTTTA CGTTCAACAA 144 0 

TTAACGGTAC CAATACAATT GGTAATACAT TAATCATAAA TTTGATGACT TTTTTCTTAT 1500 

15 

CCATAGATCT TGcCTCCaTA ATTACTTTAT TAAtTTTACa TACCCtATGa TACATCAATA 156 0 

TAAACGATGA TAGTAGTGAA TCACTATTAA GTATTTCAGA TGTTTtTTAA AAgaAGaCCC 162 0 

AATTAnAAAA A 1^^^ 

20 

(2) INFORMATION FOR SEQ ID NO: 2 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6645 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 
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CGAAATCATG 


ATTTAATGCT 


TTTTCATATA 


AGCTTTTCCA 


ATTAATCTTT 


CGTCCATGAT 


60 


ATTCTTCAAC 


TGTTGCTAGA 


TATTGTGCAA 


TTTTAGTTAC 


TTTAAAGGAG 


TGTGCTGCAA 


120 


CaTTGTGkTC 


mAAATATTTA 


AATTTTCCaG 


GtAATCTTAT 


AAGTCTTTCC 


aTATCTGATA 


180 


ATCT^TTAAA 


ATATTGATGT 


ACACCCATTT 


CAATTACCTC 


CTCCATTAAT 


TAATCATAAA 


240 


TTATACTTTC 


TTTTTACATA 


TCAATCAATT 


AAATATCATT 


TaAATATCTT 


CTTTaTATAA 


300 


CTCTGATTAA 


ATGATACCAA 


AAAATCCtCT 


CAACCTGTTA 


CTTAAACAGG 


CTAAGAGGAT 


360 


AGTCTTGTCT 


TGATATATTA 


CTTAGTGGAT 


GTAATTATAT 


TTTCCTGGAT 


TTAAAATTGT 


420 


TCTTGAAGAT 


TTAACATTAA 


ATCCAGCATA 


GTTCATTTCA 


GAAACAGTAA 


TTGTTCCATT 


480 


AGGGTTTACA 


GATTCAACAA 


CACCAACATG 


TCCATATGGA 


CCAGCAGCTG 


TTTGGAAAAT 


540 


AGCGCCAACT 


TCTGGTGTTT 


TATCTACTTT 


AAATCCTGCA 


ACTTTrGCTG 


CGTAATTCCA 


600 


GTTATTTGCA 


TTGCCCCATA 


AACTTCCTAT 


ACTTCTACCT 


AATTGTGCAC 


GACGATCGAA 


660 


AGCATAATAT 


GTGCAGTTTC 


CATAAGCATA 


TAAGTTTCCT 


CTGTTAGCAA 


CTGATTTATT 


720 
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TACATTAACT GTCTTAGTTA CTGCTTGCTT AGGTGCTTGC TTAACTACTA CTTTTTTAGA B4 0 

TGCTTGTTGT ACAGGTTGTT TTACTACCTT TTTAGCTTGG CTTGCTTTTC TTACTGGTGA 900 

^ TTTAACCGCT TTAGTTTGTT TCACTTTATT TTGAGGCACA AGTGAAATCA CGTCACCAGG 96 0 

AAAAATTAAA GGTGTTACAC CAGGATTGTA TQAATATAAT TGATTCAACG TTAAGTGATG 1020 

CTCTAAAGCA ATCTTATATA ATGAATCGCC AGCAACTACT GTATAAGTTG TCGGTGATTG IQBO 

10 

CGTTTGTGCT TGAACATTTG ATACATAATT ATGTTGAACA GGTGTTTTTA CTTGTGTGCC 114 0 

ATGTTGTTGT GCATGTGCTG CATTATTTAA AGCTAAAAAA GCTAACACTG ACGAAACCGT 1200 

CACTGTAAGA GATTTTTTCA TCTTGCTGTC ATTCCTTTGC TGTTAGTATT TTAAGTATGC 126 0 

15 

AAATACTATA GCACAATACA TTTTGTCAAA AGCTATTGTT ATAACGATGT AATCAAATGG 132 0 

TTAACAATAT AAAAAGAATA CAACCTTTTA TCATAGTGTA AAATGTATTC ATACCATGTA 1380 

ATTGAGAACG TTTTCAATAA TTAATTCAAT ACCTTGAAAA TCGCCATAGG TAATATTACT 144 0 

20 

AAATGCACAC TGCATATGTT GTTTTAACAA ACACAACTTT TAAAAAATAT ATTCTAACTC 1500 

TATCTACCGA ATTGTACTTA AATATTCATA AACAAATCAT ATTCCAAAAT CTAATTTACA 1560 

25 ATTTATTTAG CTACCTTTAA AAAAACCAAA AACCGACGCC CTTTTAGAGC CTCGGTTTTA 1620 

AAATATATAT TAATCGTGCG ACATTGTCTG TCTTAAATAT GATTCGATAA ATGGTCCAAT 168 0 

GTCTCCATCC ATCACTGCAT CAACCTTACC TGTTTCTTCG TTCGTACGAT GATCTTTCAC 174 0 

30 CATTGAGTAT GGATGGAAAA CATATGATCT AATTTGGCTT CCCCAGCCGA TTTCTTTTTG 180 0 

TTCGCCACGA ATTTCAGCCA TTTCACGTGC CTGCTCTTCC AATTTTAATT GATATAATTT I860 

AGACTTTAAC ATTTTCATAG CTGCTTCACG GTTTTTAATT TGAGAACGTT CATTTTGGTT 1920 

ATTAACAACT ATACCTGAGG GGTGGTGGGT AATTCGTATT GCCGATTCAG TTTTGTTAAT 198 0 

ATGCTGACCA CCTGCACCAG AAGCTCTGAA TGTATCAACT GTAATATCAT CCGGATTGAT 204 0 

TTCAATCTCT ATTTCATCAT TATTAAAATC TGGAATAACG TCGCATGATG CAAATGATGT 210 0 

40 

ATGACGACGT CCTGATGAAT CAAATGGAGA AATTCGTACT AGTCGGTGTA CACCTTTTTC 2160 

AGCTTTTAAA TAACCATAAG CATTATGCCC TTTGATGAGC AATGTTACAC TTTTAATCCC 2220 

CGCTTCATCC CCAGGTAGAT AATCAACAGT TTCAACTTTA AAGCCTTTCT TCTCACAATA 2 280 

45 

ACGTTGATAC ATTCTAAATA GCATATTAGC CCAATCTTGA GACTCCGTGC CACCTGCACC 2 34 0 

AGGATGTAAC TCTAGAATTG CGTTATTGGC ATCGTGAGGC CCATCTAATA ATAATTGCAA 24 00 

TTCGTATTCA TCCACTTTAG CCTTAAAATT AATGACCTCT TGCTCTAAGT CTTCTTTCAT 246 0 

TTCTTCATCA AATTCTTCTT GTAATAAATC CCAAGTAGCA TCCATGTCAT CTACTTCTGC 2 52 0 
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TTGCGCTTTC GTTTGGTTAT CCCAAAAATT AGGTTCTGCC ATCATTTCTT CATATTCTTG 2 64 0 

AATATTAGTT TCTTTGTTCT CTAAGTCAAA GAGACCCCCT AATTTGTGTT AAATCTTGAT 2700 

TATACTTATC TATATTTCGT TTGATTTCTG ATAATTCCAT AGcATTCGCT CCTATTTATA 276 0 

TTTCAATTCA AGTCATTGAT TTGCATCTTT TATAATGCTA AATTTTAACA TAATTTTGTT 2 820 

AAATAACAAT GTTAAGAAAT ATAAGCACAC TGACAATTAG TTTATGCATT TATTGTTAAA 2 8 80 

AATCAGTACA TTTATCATCG ACATATGCCT AAACCGATTT TTTAAAACTA AGTACATAAC 2 94 0 

AACGTTTAAC AACTTCTTCA CATTTTTTAA AGTATTTAAC GCTTGTAAAA TAAAAAGACT 3 0 00 

CCTCCCATAA CACAAACTAT AGGTGTTTAA TTGGAAGGAG TTATTTTATA TCATTTATTT 3 06 0 

TCCATGGCAA TTTTTGAATT TTTTACCACT ACCACATGGA CAATCATCGT TACGACCAAC 312 0 

TTGATCGCCT TTAACGATTG GTTTCXJGTTT CACTTTTTCT TTACCATCTT CAGCTGAAAC 3180 

GTGCTTCGCT TCACCAAACT CTGTTGTTTT TTCACGTTCA ATATTATCTT CAACTTGTAC 324 0 

TACAGATTTT AAAATGAATT TACAAGTATC TTCTTCAATA TTTTGCATCA TGATATCAAA 33 00 

TAATTCATGA CCTTCATTTT GATAGTCACG TAATGGATTT TGTTGTGCAT AAGAACGTAA 3360 

25 GTGAATACCT TGACGTAATT GATCCATTGT GTCGATATGA TCAGTCCAAT GGCTATCAAT 342 0 

AGAACGAaGT AAAATCATAC GCTCAAACTC ATTCATTTGT TCTTCTAAGA TATCTTTTTG 34 80 

ACTTTGATAT GCTGCTTCAA TCTTAGCCCA AACGACTTCG AAAATATCTT CAGCATCTTT 3 54 0 

ACCTTTGATA TCATCCTCTG TAATGTCACC TTCTTGTAAG AAGATGTCAT TAATGTAGTC 3600 

GATGAATGGT TGATATTCAG GCTCGTCATC TGCTGTATTA ATATAGTAAT TGATACTACG 366 0 

TTGTAACGTT GAACGTAGCA TTGCATCTAC AACTTGAGAG CTGTCTTCTT CATCAATAAT 372 0 

ACTATTTCTT TCGTTATAGA TAATTTCACG TTGTTTACGT AATACTTCAT CGTATTCTAA 37 8 0 

GATA&GTTTA CGCGCGTCGA AGTTATTACC TTCTACACGT TTTTGTGCTG ATTCTACAGC 384 0 

TCTTGATACC ATTTTTGATT CAATTGGTGT AGAGTCATCT AAACCTAGTC GGCTCATCAT 3 90 0 

TTTCTGTAAA CGTTCAGAAC CAAAACGAAT CATTAATTCA TCTTGTAATG ATAAATAGAA 3960 

GCGACTATCC CCTTTATCAC CTTGACGTCC AGAACGACCA CGTAACTGGT CATCAATACG 4 020 

ACGAGATTCA TGTCGCTCTG TACCTATTAC TGCTAAACCG CCTAATTCCT CTACGCCTTC 4 0 80 

ACCTAATTTG ATATCTGTAC CACGACCAGC CATGTTAGTG GCAATAGTAA CGGCACCTTT 414 0 

TTGTCCAGCG CCTGCAACAA TTTCAGCTTC ACGTTCATGA TTTTTCGCAT TTAACACATC 42 0 0 

ATGACGGATA CCACGTTTTT TAAGTAAATT TGAAATATAT TCAGAAGTCT CAACTGCAAC 426 0 

AGTACCTAAT AGCACTGGTT GCCCTGCCTT GTGTTTTTCA ACAACATCTT CTACTACTGC 4 3 20 
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TTTATTTGTC 

AGrrrxAGCT 

TGTAATAGAC 
AGCTTGGTGT 
GACAATTAAT 
CGCACGTAAA 
TTCAACTTTG 
CGTTTTTTCA 
TTGTGTATAA 
CTCGTCGATT 
TCTATCTTCA 
AGTACTGTAA 
TAAGTTTAAT 
AACACTTGAT 
TAAGTATGTT 
ATCACCTTTA 
TACACGTTTA 
ATCATTTTGC 
ATTACGAATT 
AGCAAGTTTA 
TCCC^iTTTCG 
ATTTATAGTT 
TGACATACTT 
TACGCAATTA 
TATACAATAG 
TTAGCATAGT 
ATTTGCTTTT 
TATTGTTCAC 
GTTCCATCAG 



GGAATTTGAG 
GTACCTGTCA 
GCCATAGTTT 
AAACCTTCCG 
ACTTCGCCAT 
GCTGTGTTGA 
AACATACGTT 
TCGTATTTAT 
AGTGACGTTG 
AAAATTGAGT 
GAATAATTCA 
GTAATGTCTT 
CCGACAGTCA 
AAGTATTCAT 
GGCATTGTCG 
TGAATTGCAA 
GAGCCTTCTC 
TTTTTGACAT 
TCTTCATCAG 
CCTAACTGTT 
TTCGCTCCTT 
AGAAAATTAT 
CATTCACATT 
TACTTCCTAA 
ACTGACATAC 
CACTATGTTT 
GGAGTATAAA 
TAGTTTGAAT 
TTTCTCTGTC 



TTACTGTCAT 
TACCCGCAAO 
TAGATTCATT 
AGAAACGACG 
CAACAACCAT 
TATGACTAAT 
CAGCTTTATC 
AATCTTCGTC 
ACTTTTCAGC 
CAACCTCATC 
CCATGTTATC 
GTGCGTATGC 
AACCTAAGAA 
TGACTGTAAT 
CTGTTAATGT 
TACCACCCAT 
TAACAAGTGC 
TATCAATGTC 
TTAAAATTGC 
TAATTTCTTT 
TAGCTAAAAA 
ACTTATTTAC 
CTGTTGTCAA 
TAAATTATAT 
TATTATTAAC 



GTTATAAATA 
TTTATTGTAC 
TTGAATTTGA 
GCCTGGCATT 
ATAGTCTACG 
AACATCAACA 
CGCACCTTGT 
CTGTTTTAAC 
TTCACCAGAA 
AATGATTGCA 
TCGTAAGTAA 
TTCACGrrrT 
GTTATATAAC 
AACGTGAACA 
TTTACCTTCA 
AATTTGAACT 
ATATGCTTCT 



TTTCTAAATT 
ATTCTGAAAT 

ACGCCTTCCT 
GTACGTCCTG 



TCACGTTGTA 
TTTTGTACAT 
TCTGTTAAAT 
ATTTTCGCAA 
ATAATTAATG 
AAATGTAATG 
TCAAAACCTA 



iA- i L CTGTCG 
TCAGCCATTT 
CCTCTACCAG 
CCTGTTCTCA 
TTATATGGTG 



AtiUi AAi iCT 



CATTCAACAA 



AAAGCACTTG 
CAAGCCATAT 
TGTGAATACA 



CGTTTTTTCT 
ATTATTGCCA 
ACTGTTTGGC 
TCATTTGTAG 
CAAGTTTATC 
TATAAATATT 
TTAACATTCA 
ATTACATTAT 
TGCAAAAACA 
TTACCGTCTT 
AAGAAGTCAT 



GGTAAAATTT 
G1TTGGAATT 
TCTAAAGCGA 
TCAAGAATTT 
CTACAACAAT 
AATCAATATA 
ACTAATAAAT 
TTACGATTTT 
AATATATACA 
CGAACTATGA 
CAAGTGCTTT 
TACGGCGGTA 
GACCTAATAG 



CTTCTTCTTC 
AGTTTTGGAA 
TCGCTTCAAT 
TAAATTGATC 

CATATAAGTT 
GTACAGCTTT 
AAACATTTGC 
GCGTACGTGC 
GACGCATTAC 
GCTCATTATT 
T" (2TTACTGTT 
CTTCACTTTG 
CTAATGCATT 
TCTCAGCGAT 
TCATATTGAA 
TATCTAAATA 
GTTTCGTTTT 
TTACTTTATC 
TTGATAAAAA 
ATATCTTATC 
AATATATTTA 
ATATTCTCAA 
CGACTCGGAC 
TCCATTAACA 
AATAGTCATA 
AAACTTAATT 
AACGATACTT 
ATTCATTTGT 



4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 
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ATCTCGTTAT CATCGTAAGC GTCATTATCA ACTTGTGTTT CTTGCATTTC TTGTAATTCG 624 0 

GCAACAAACA CTTCTTGATC TCCTCGATCA CGGCTCTTAC GATTAATACG TGTTTTATAT 63 0 0 

TTTCGAACTT GTCTTTCAAG TTTATTATTA ATTAAATcAA TACCTGCGTA TAAATCATCG 636 0 

TTTCGCTCTT CAGCTCTTAA CGTAACATTT TTCAATGGAA TTGTTACTTC AATTTTAGTA 64 2 0 

GCTGAATTTG AATAAGTTTr AACTTTAACA TGCGCCACTG CATTTGGTAC GTCATTAAAA 64 80 

TAACGTTCCA ACTTACCAAT TTTTTCCkCA ATATAGTTGC GAATAGCATC TGTGATAGTG 654 0 

AGGTTATCTC CATGAATTTC AAATCTAATC ATAGTAAATC TCTCCTTAAA CCTCTTTATn 6600 

GGnAACTCnT TATTATATTT AACATTTTTA CGCCAATCGT GCAAA 6645 
(2) INFORMATION FOR SEQ ID NO: 260: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260: 

CAGTTCCAGC ACAtCTATTG GGGATCAACA AACTAGGGAA AATGCTAATT ATCAACGTGA 6 0 

AAACGGTGTT GACGAACAGC AACATACTGA AAATTTAACT AAGAACTTGC ATAATGATAA 12 0 

AACAATATCA GAAGAAAATC ATCGTAAAAC AGATGATTTG AATAAAGATC AACTAAAGGA 18 0 

TGATAAAAAA TCATCGCTTA ATAATAAAAA TATTCAACGT GATACAACAA AAAATAACAA 240 

TGCTAATCCT AGCGATGTAA ATCAAGGGTT AGAACAGGCT ATTAATGATG GTAAACAAAG 3 00 

TAAAGTGGCG TCACAGCAAC AGTCAAAAGA GGCAGATAAT AGTCAAGATT CAAACGCTAA 360 

TAACSATCTA CCTTCACAAA GTCGAATAAA GGAAGCACCA TCATTAAATA AGTTAGATCA 420 

AACAAGTCAA CGAGAAATTG TTAATGAGAC AGAAATAGAG AAAGTACAAC CACAACAAAA 480 

TAATCAAGCG AATGATAAAA TTACTAACTA CAATTTTAAC AATGAACAAG AAGTGAAaCC 54 0 

TCAAAAAGAC GAAAAAACAC TATCAGTTTC AGATTTAAAA AACAATCAAA AATCACCAGT 600 

AGAACCAACA AAGGACAATG ACAAGAAAAA TGGATTAAAT TTATTAAAAA GTAGTGCAGT 66 0 

AGCAACGTTA CCAAACAAAG GGACAAAGGA ACTTACTGCA AAAGCGAAAG ATGATCAAAC 720 

GAATAAAGTT GCCAAACAAG GGCAGTATAA AAATCAGGAT CCTATCGTTT TAGTGCATGG 780 

TTTCAATGGG TTTACAGATG ATATTAATCC TTCAGTGTTA GCTCATTATT GGGGCGGTAA 84 0 
TAAAATGAAC ATTCGCCAAG ATTTAGAAGA AAATGGTTAC AAAGCTTATG AAGCAAGTAT 
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TCGTGTAGAT TATGGTGCAG CACATGCAGC AAAATATGGA CATGAACGTT ATGGAAAAAC 1020 

ATACGAaGGA ATTTACAAAG ACTGGAAACC AGGACAGAAG GTACACCTAG TTGGACATAG 1080 

5 TATGGGCGGT CAAACGATAC GTCAACTAGA AGAATTACTG CGTAATGGTA ATCGTGAAGA 1140 

AATAGAGTAT CAAAAGAAAC ATGGTGGcGA AATTTCTCCA CTATTCAAAG GTAATCATGA 1200 

CAATATGATT TCATCAATTA CTACTTTAGG AACACCACAT AATGGTACAC hCGCATCAGA I2fi0 

TTTAGCTGGT AATGAAGCTT TAGTGAGACA AATCGTATTT GATATCGGTA AAATGTTTGG 13 20 

TAATAAAAAT TCAAGAGTAG ACTTCGGGTT GGCTCAATGG GGTCTAAAAC AGAAGCCAAA 13 80 

TGAATCATAT ATTGATTATG TCAAACGCGT TAAACAATCT AATTTATGGA AATCAAAAGA 14 4 0 

15 

TAATGGATTT TACGATCTGA CGCGTGAGGG TGCAnCAGAT TTAAATCGTA AAACGTCGTT 1500 

GAACCCTAAC ATTGTGTATA AAACATACAC TGGTGAAGCA ACGCACAAAG CATTAAATAG 156 0 

CGATAGACAA AAAGCAGACT TAAATATGTT TTTCCCATTT GTGATTACTG GTAACTTAAT 162 0 

20 

CGGTAAAGCT ACTGAAAAAG AATGGCGAGA AAACGATGGT TTAGTATCCG TTATTTCTTC 1680 

TCAACATCCA TTTAATCAAG CTTATACAAA AGCGACAGAT AAAATTCAAA AAGGCATTTG 174 0 

GCAAGTGACG CCTACAAAAC ATGATTGGGA TCATGTTGAC TTTGTAGGAC AAGACAGTTC 18 0 0 

25 

TGATACAGTG CGCACAAGAG AAGAATTACA AaATTTTTGG CATCATTTAG CAGACGATTT 1860 

AGTGAAAACT GAAAAGCTGA CTGATACTAA GCAAGCATAA TTTATAAAGT AAAGGGAGGA 1920 

JO ATTAATAATG ACTGCAGACT TCTTTCAATT AATCGGATCA TTATTTAGAA TTCTAAAAGA 1980 

ATTATTCAAG TAAAACATTG GCGAGGCCCC AACATAAAGA ATTTCGAAA.A GAAATTCTAC 2 04 0 

AAACAATGCA AGTTGGCGGG GCCCCAACAA AGAAGCTGGC GGAAAGTCAG CTTACAATAA 2100 

35 TGTGCAAGTT GGCGGGGCCC CAACATAGAA GCTGGCGGAA AGTCAGGTTA CAATAATGTG 216 0 

CAAGTTGGGG TGGGACGACG AAATAAATTT TGCGAAAATA TCATTTCTGT CCCACTCCCA 222 0 

TTGGCATTTA CGAAGTTTAA ATGTGCAATT AGAATATATG TATAACAATA TTAAACACGC 2280 

'^^ GGTAAAACGA AGTCAGTCAA TTCAAACTGA TTTCGCCCAC CGCGTGTTTT TAACATAGCT 234 0 

TAATAATTAA TAAGCATTAA TGTTCAATTT ATATGGTTGT TTTCCAATAA TAAACCTAAA 24 0 0 

GATATAGAAT TCACGCAATA TCATGCCGAC ACCTATACAT AATCCTAAAA TGAATAGTAG 24 6 0 

rGATATCGCT AGAAAGACCA TTGTATTATC CTCAAATATA TTTGTATATG CAAACAATGA 25 2 0 

GTCTAGAATG ATTGGATGTA ATAAATAAAT AAAGAATGAG AAAGCACTAA TCATTTGAAT 258 0 

CGTATTAAAT AACATTGTTT TAAAATGCGT GCAAATACCC AAGATAACAA TAAACATAAT 264 0 

so 

ACTATTATAT GGTGTTAATG AATATGAAAA GCTGGTAACG TTCCAATAGT CTCCaTTTGT 27 00 



EP0 786 519 A2 



TCTAAGAAAT TTAATACACG TTCGTAGTTA TAACCCATAT ATGCACCTAA GAAGAAATAA 2820 

AAAATCCATC CGAATATTAT AGTATTTTCA CTTAATGGAT AATAGTGTAG CACGGTATCG 288 0 

^ TGAAACGCTG TGTTGTTCGT AAAGTAATAT AAAAATGATT GCTGTAAAAT AAAAGATAAC 294 0 

AATAATAATA TTTTACTGTT GAATAGGTTA TAGTTAATTT TAAAAATGAT ATAACTCAAA 3000 

ATAAAGAATT GCATGATAAC AACGATAAAA TAGCCATACC ATTGACCTAA TAGGACATTT 3 060 

10 

TCAATGAATT GTTTATTGAA ACTTGAATCT GTTAATAATG ATTCACTATA ACTGTAAAAC 312 0 

AATCCCATTA ATATGTAAGG AATAAGTATA TATTTTACGC GTGTAGTTAA GTATCTATAG 3180 

GTGACTTTTT GGTAATTCAA GGTTGTCAGT AACTGTGACA AGATAATAAA GCAAGGTGTA 3240 

15 

CCAAAAATCA CAATATTACG AATGTAAAAT TGTAACACTA AGGATCCACC CTCCATATTT 33 00 

TCATGTTTTA AAGTAATTTG TGTAAGTAAA TGTGTGATAA TAATAATTGC ACATATAATA 33 60 

GCACGTAAAT ATACGAGTTC AAGTCTAATC TTTTTCATGG AATCCGTCCC ATCTCTTAAT 34 20 

20 

TAAATGCTCA AAAGCATCAT CACTAATTAA TATTCTAGGG ATGTAATAAT CATTGGAGTT 34 8 0 

CGGAGTGACT GCTTTTTCCT CTAATGAAAA ACCGTATTTT AACCCAGCTT TTTTGATTAC 3 54 0 

25 CGGTAATTTA TCGTCATTCA TCAAGCCATA AGGATAGGCT ATAGTTTTCT GCGACTTTTT 3 6 00 

AAAGTTTTTA GTTAGATATT TTTCACTTTT GTTTAAATCT TTTATGATTG TAGCTTCAGA 3 6 60 

AGCTTTCATT AATTTTGACT TATTATTTTT AGATAAGTTA TGCAAATCGT GGGTATGTGT 37 20 

30 TTCAAATTCC CATAACCCAG TTTTATACAT TTCTTTTAGT TCTTTTTTAC TAATCATATC 3 7 80 

GAGGTTGTGA AAGTTTTCTT CCCCAACATG ACCTGTGATA ATAAACCCAG TTGCCGGTAT 3 84 0 

TTTATATTTT TTTAAGATTG GATAAGCATT TTCATAAATA GTTTCATCCA TATCATCAAA 3 900 

35 GTTAATCCAT ACACTTCGTT TTGGaAACTT ACCTTTTTTC TTGtAATATA AAAATTCTTT 3 960 

CAAqGTTAAA AATTTAGCAT CATGTGATTT TAGCCATTTT ATTTGAGATT CAAATTGTGA 4 020 

TTGACTAACA CTATAATTTT TAATTTCTTT ACTACTAGAA AAGAAGTAAA TAAAATTATT 4 08 0 

CAGAAAATTC GCTTTTCTTA CACGGTGATA ATTTAATGCC AGAGCACTAT TTTCTTTATA 4140 

TTTCAGTTTT TTAGGTGAAT CGTCATCTGC ATTTGCAATA TGATGACCAT CCAGTGTGCT 42 00 

TACAGGCAAT ATGATCAAGA TACTCAACAC TAAAATTATA AATTTTCTAT ACTTCACGAT 4 2 60 

45 

TCTCTTCCTC TCTGCCATTT TTGAATCAAT ATGCTAATTG TAAAAAATAC AAAAATGATA 4 320 

ATCGCGAAAA TGCCCATAGT TTCAAATATA TCTAAAATTT CAGTATTTTC AATGTTTAAA 4 3 80 

GCAACACGTA TTGTATTGAT ACTTTCGTCA TGAATTTCAA ATATAGTACC AATATAAACG 444 0 

50 

AGTAGAACAA CTAAACAATA TATCCAAAAG ACACAAGATA TAGCGATAAG TGCTGTTTCT 4 500 

55 
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TTGCGTAACC ACCrrrCTTA COmTAATG cmTGGAAA TGCGACAAGA ACTACTGCTC 4620 

CGTTAATAAT CCAGTATACT GTCGGATACC AACTTACAAA TATGAGTCCA GCCATArrCT 4680 

rrrrcrcGTA gcgactatca ataaagagtg cgactctaaa ttgaataacg tttataaaag 474o 

XCATAGTAAA TCATGATAGT AGAAATATTG AAAAACTATA TGTCATAAAT GTATAGTCTA 4800 

AGAAGTITGC TCTTATGAAC AAATAGCCTA AATATAGAAG CACIATATAT ACCCATAAA.-. 4860 

TCGAGATGAT TK^CTCAAAC ATCAAAATAT ATAAAGGAAA CCmTCGTT TTCATrGTGC 4920 

TAAAAAAGTC TCGTAGTAAT ACTTCGTGTC CCCCTTGAGC CCATCTCACG CGTrGCTTCC 4980 

AAAGACCTCC CAATGriTCT GGAACCAACA TCCAACACAT GGCAAGCGGT TCATACTTAA 5040 

TACGATATCC ACGTAAATGC AATTTCCAAG AAACTGCAAT ATCTTCGGTA ATCATATCAG 5100 

TATCCCAGTA GCCAACGTCG ACAACKJCAC rmTTTAAA TAGAGTGAAG ACACCCGAAA 5160 

TAGTATTGAC TGCGCCAGO. AGTGTCTGAC TtCGCTTAAT ACAGCCAATT AAACTTGCAT 5220 
ATTCTATCGT TTGAATTTTA CCTAAAATAG AACTCTTATl- ■ivu>^H. .v.x„ o^..™— 

TAACTCCACC AAGTrrTGGA TCATGTTrGA AATTCTCAAT CATATAATAT GOTGCATCTT 5340 

. GATCAACGAT AGTATCTGCA TCCAAGCACA TTACATAATC ATATGAAGCC TGTTTAATGC 5400 

CTTGATTGAG TGCGTTGGCT TTACCTCTGT TTTCnGTAA ATCGACGAAA ATAAAGTCAT 5460 

TATrrrcrrr gattttatag atgagttctg ctgtattatc tgaacttcca tcattaatga 5520 

0 TAATAATTTC riTCTTCTCG TATrXGAGTG CAAGAACATT AGACAACGTA TCTTCAAyCG 5580 

.„^y~^ i-T-^TMTTra TrCACATTTA 5640 

TTTCACTTTC GTTATAACAG GCAAGTAAAA AWi^xv,..- 

TGTCAGGCTT CTTGTTCAAT GAATATCTAA TTrCTCTGGT AAAATAGAAA TAAATTGAAC 5700 

CGACAATCCA GTAAATAGAC ATAAATACAG GATAAAAAAG CAAAAAGTTA AAAAATTGCA 5760 

;,rr&TrrAC CTACcrrrcG ttagttaggt tgtaagccat atggtaattg ATAGTAnrr 5820 

;^ri^CAAT AGATTGnX^T TATAATTAAA CGGAAATATT TGTAATTGCA ACTTAATrrr 5880 

- CCTGTAACAT AGIX^TGATTA ArTTTCAGTA GGGGGTTATA AAAATTGAAG GATAAGATTA 5940 

TTGATAACGC AATAACCTTA TTTTCAGAGA AGGGGTATt^A CGGTACAAO. CTTGATGATA 6000 

TAGCTAAAAG TGTAAATATA AAGAAAGCGA GTTTATArrA CCAmTGAC TCGAAAAAAA 6060 

GTATTTACGA ACAAAG^TT AAATGrTGTT TTGATTACCT TAATAATATT ATTATGATGA 6120 

ATCAAAATAA ATCGAACTAT TCAATTGAl.. cmATATCA ATTCTTATTT GAGTTTATTT S180 

TCGACATCGA AGAAAGGTAT ATTAGAATGT ACOTTCAATT ATCTAATACG CCTGAGGAAT 6240 

" .XXCTGGAAA TArTTACGGA CAAATACAAG ArTTAAATCA ATCATTAAGT AAAGAGATAG S30C 
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TGCTGTTTCT TGAAAGTTGG TATTTGAAAG CATCCTTTTC GCAAAAATTT GGAGCAGTGG 64 2 0 

AAGAAAGTAA AAGTCAATTC AAAGATGAAG TGTATTCGCT ACTAAATATA TTTTTGAAGA 64 8 0 

^ AATAATTTTT GTTACTAGTT TGTAATAATT AACTTACTTT TGTAAAAAAA GACATGAGAT 654 0 

TATTTTTTTA AATCTATATA AAGTTGACAA TACAAATCGA TATTGAGAAT ATTAAGATGT 6 6 00 

ATATGAATTT TATAAATTAA ATGCAATACA TTAATATAAA TATCAATTGT TGCAAAATAC 6660 

10 

GATTTGTTCA ATGATTTGAT AATATTATTC TTTATATTTG TGAATGGTTA AGTTTGTCTT 6720 

TGAACATATT ATAAAAGTGT AATGTTCCCC TGAAAAGAAT AAGTTGTCAT CTAATTACAG 67 80 

^5 GAAATCCGCA TAAATTAGAT GAAATGGAAA GTAATAAGTA ATAATTTATT GATAAGCGCC 6 84 0 

TATGTGATGG TAAATCATGA CATAGGCGCT TTTTTTTATA AGTTAAAAAT GTAAATAAAA 6900 

ATTATATAAA TTACCCACAT CTTTTTAAAA GGTGTGGGCT TTATTATCAT TAACCCAACT 696 0 

CACAGTGACG GGTTACGCAA GGTATTGAAT TACCGAGTAC GGGCACGCTC GGTGTTGTAA 702 0 

AGAGCAAATA ATCAAGTAAT GATGATGCTT CTACTCGATT ATAAGAAAGC CATGATAGAG 7 08 0 

TACGATGGTA TCTAGTTTTA TTATTAATAG GTTTGGATAT TTAAAGTTGG ACAATATTAT 714 0 

25 

ATCTTGTGCA AAAATATAAA TAAGTTATAC ATAATGGTAG AGAATCATGA TATAATTTTA 720 0 

AACGATAAAA TATTTATATA AATAATTAGA GaAAATGTAG TTGTGTATGT yTTGTGGtCG 726 0 

TTAAACTAGA TATAATTGTC CGATTTATAA AACATACATA ATGAATACaA TGATTGATTA 732 0 

30 

TGTGGAGGAA ACCATGAmAG AniAAGTTTgA TTTAGTAAAA CTATTAAATA TTCTAAAGAA 73 80 

GAATATTAAA TTATTGCTTA TTTTAC CGGC AATATGTCTT GTAGTAAGTG 743 0 

35 (2) INFORMATION FOR SEQ ID NO : 261: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 082 base pairs 
r (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 

45 

ATTGTTACTC ATTATTTTTT CATATTCACA CAAATGATCT TGTTTATATT TAGCTAATTG 60 
ATTTTTATCT AGCATTTTAT CCTCCTGCTG AGTTTGTAAC CTTTAATAAT TTATTTTCTA 12 0 

TAAAAACTTA GTATTCCAGT TGCTTATTAT ATCATTGATG AAAGGCTGAA ATAAAACATA 18 0 

50 

AACTGTTCCG ACCATTAACG CTGTAGCTAA AGATAAGTCT ACAAGTCCAC CTGTTTTAAA 24 0 

TTGAATCGGT GTCTTCACAT TAAACGGTAA gGA)aiAAAAT AATTTCACGC CTTTTGGTGT 3 00 
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AGGCGTCTGA ATAATTTGCA ATAAAAATGC TATGATTGCG ATAAATAATA TTGAATGCGT 420 

AAAGGTTCGA TGTCCAAAGA TCAATCTCAC AAAAAAACTA ATTACCTTAA ACCTTCTGCC 4 80 

AATCTTACTT TGAGTGTGAC ATATATCGGG TAATAAGCTA GCTAGAGTTG CTAGAATGAT 54 0 

AACCGTAACC GACGAAAAAA TATCCGTTTG AAAATATTGT GTTGTTAGCG CTCCAACGAG 600 

CATGCCGCAT GAAGCATGTG TTTTACCTGT CATATTTGTT CTCCTTTAAT ACTCACAT'rT 660 

TACCACATCC CTAACAAAAA CACGAACATA TTTTCGGGTT AAAATTCATT AGTATGACAC 720 

AATTTAAAAA AGTATCACAT AACTCTTGAA AACGATTACA AAATCGTTTA TGATGTATTT 780 

ACAAAATATT TAAAGGATGT GTTTGAATAA TGGCAATGAC AGTAAAAAAG GATAATAATG 84 0 

AAGTGCGTAT TCAATGGAGA GTTGCTGATA TCAAAATTCC TACAAGTGAA ATTAAAAATA 900 

TTACACAAGA CCAAGATATT CATGCAGTTC CTAAATTAGA CAGCAAAGAT GTATCTAGAA 960 

20 TCGGCTCAAC GTTTGGTAAA ACGAATCGCG TTATTATCGA TACTGAAGAC CACGAATACA 1020 

TTATTTATAC TCAAAATGAT CAAAAGGTTT ACAATGAATT AACTAAATAA ATTGTATAAA 1080 

AaAATCATTC ATGGTGAGGG CTTCATGAAT GATTTTTTTA ATTGATTCAA CACCCAGCAT 114 0 

AAACAAATAC AAAAGGACAA CTGTTCCCAT AATTTTAACA GTTGTCCTTT TTCACATATA 12 00 

TTTATAACAA AAGATGTGCC ATCAAAGAAA TAATTGGTAG TGTAATGATT GTTCTAATCA 1260 

AGAAAATCAT AAACAATTTG CCGATGCTTA CAGGAATCTT CGAACCAAGT ATGACGCCAC 13 20 

CTACTTCAGA CAAGTATATT AACTGCGATA TACTAAGTGC CCCAATAACA AAACGAGTTA 13 80 

TATCATTTTG TACACCTTCA ATTAATATAG AAGGTAAAAA CATATCGGCA AAACCGATAA 14 40 

TAATCGTTTG AGAAGCCTGT GCCGCTTCAG GTATTTGCAT TAACTCTAAA AATGGAACAA 1500 

AAGGTTTACC CAATATGACA AAAAAGGGCG TGTAGTTCGC AATAATGGTA GCAATAGTAC 1560 

CAATACTCAT TACTACAGGC AAAATAACAA ACCACATATC AATGACTGTT TTTAATCCTG 162 0 

ACTTAAAAAA GTCAATAACG CCCGGTGCTT TAATACCTAC TTCTGTTGCA GTATCAAAGC 168 0 

CATGTCTCAA TGCCGTCTTT CCTTCTGGCA ATGCCTCAGT ACGCGCACTT TCAGGTACCT 174 0 

CCTTAGCATA CTCATCAGGA ATTTTATTTA AAGGCCAAAT TCTTGGCATA ATGACTGCTG 1800 

CAACGAGGCA GGATACTATC ACTGATAAAT AGAAAGCAAA AAATTGATTT TGCATGTGCA 1860 

CTGTTTCAGC AACTACAATT GCAAAGGTGA TAGAAACTAC ACTAAATGTC GTTGAAATAA 1920 

CTGTTGCCTC ACGACGAGAA TAATATCCTT CACCATATTG TCTACTTGTA ATTAAGACAC 198 0 

CAACAGTTCC GTCTCCAATA AATGATGCTA AATTATCTAC CGTCGAACGT CCTGGCAATG 204 0 
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AACTTGAGAA CAATAATCCA CCCGTTTCAT CTGAGTAAAT AACCTTTGAA CCAATTCGTA 2 22 0 

AAAATGTCAT CCATGCAAAA ACAACTGCTA ATATTCGTAA AATTAACCAA CCAATTCTAA 2 280 

CGTTAAAAGC ATTGTTCATT AGCCCGTCAG GTTTCAATTT ATCTTTTAAA ATAGTTGAAC 2 34 0 

AAATCAGAGT TATGATACCC GATAAAGTAA TTATCGTCAC AATTAAAAAT GGCATTACGC 24 00 

CACCTAATAC ATCTTTAAGC ACGCCTGCTA AAAATGCCAC GGGCAACGTT GTTTGCTTCT 24 6 0 

GTCCATCTTG TTCGACTGGA ATTGGTACTA AAAATAATAA GATACCAATT AAAGACATCG 2 520 

TAATAAACTT AAGTCTCCCA ATAACTATCT CTTTCCTTGA AAAGCTATCC ATAAAATCAA 258 0 

TCCATTTCTC TATGTATTCG TTTTAAGTAT ATACAGAATT CTATTCAGTT AACAAACATA 264 0 

TTCCTTATCA TTCTATCTTT CAAAATGTTT ATGTATGCAA AATAATGAAT AATTACAGTT 2 70 0 

ATTAAATATA CGCTATTTCT TGTAATTTTT CAAGATGAAT TCAAAAAAGG TTAAGTACAA 2760 

20 TTACTGATTT CGTACTTAAC CTTTTTTAAA CTCTAATCAT ATGTTAGTTA TTTCATTCTT 2 820 

CGTAATAATA TTAAGAAGTA TGGTGCACCG ATAATTGCAA TGATAACCCC AACAGGAATA 2 880 

TCCAGTGGCG GATGAATGCC ACGGGCTAAA CCATCTCCAA ATGTTAACAA TATAGCACCA 2 94 0 

ATTAACCCCG ACATGATAAT AACGTGTAAT GTTTTATTTC CTATTAATTG TCTCGCAATA 3 0 00 

TGAGGTGCAA TTAATCCTAA AAAGCTAATA CCACCGACAA CTGAAATTGC GGATCCTGCT 3 060 

AATATTACTG CTAAAATTAA CAATAGCATT TTAATAGTTT TAACTTTTAA ACCGAGTGCG 312 0 

GTTGCAACAG CATCACCTAG ATTCAATACA TCTAATTGAT AACTCAATAA AATGATGATT 3180 

GGTATCGTTA TTAAAAACCA AGGTAATATA GTATAAATAT TCGACATATC ATGTCCATAT 324 0 

AGACTACCTG TCAACCAAAC AAGCGCTTTG TTTGCTTCCA GTGGATTTCT GATTAATAAG 33 00 

AACTGCACAA TCGCCGTACA TATTGCGCCT ATTGCTAAAC CAATTAAGGC AAGCTTTGAA 3360 

CCTIJAACAT CATATTTTGA AATTAAAAAT GATAAAAATA AACTTACTGC AAAGGCACCT 34 20 

^0 AAGAATGAAC CTATAGGTAA TACAPiACPiAT GGTGCTGTTG GAAAGGTCAT AATAATAATC 34 80 

ACAGCAGCTA AACTGGCACC TTTAGAAATA CCTATAACAT CAGGTGAGGC TAACGGGTTT 3 54 0 

CTTATTACAG CTTGTATAAT TGCACCTGAA ATAGCCAAGC TACTACCGAT AATAATACCA 3 600 

AGTAATGTTC TAGGTATACG ATACTCATTT AAAATAAAAT CATCTTGTGT AAAGATTCCC 3 660 

TTAATAGCAT CAATCGGATG AATCATGACA GACCCTACAC ATAAACTTAT GAATATACTC 3720 

ACAATTAAAA GGATTGTGAT TAAACTATAA CGACGTATAA TTTTCGTTGT CATCATATTC 3 78 0 

TTTTCACCCC TTTAATCGTT ATAAATAAGA AGTAAAGTGC ACCTACGAAT GATGTAACAA 3 84 0 

TCCCTACTGG TGATTCATAA GGATATGTAA TTAAACGACT TAATACATCT GATAGTAGTA 3 900 
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^.GCGTTTGAC TATATGCGGT ACGATTAAGC CAACAAATCC AAriGGTCCT GCCACTGACA 
CCGACATACC TGTAAGAATA ATGACTAATA GTCCAATX5AT AATTCTAACT TTATTrATAT 



TT 

(2) INFORMATION FOR SEQ ID NO: 262: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1145 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262: 
XATTAGAAGG TCGTTCGOAT GAACAATTAA AAAATTTAGT TAGCGAAGTA ACTGACGCCG 
r..^C AACGGGGGCA AATAGACAAG CAATTCACGT TGTTATAGAA GAAATGAAAC 
C^^CCATTA TGGTGTGGCT GGCGTAAGAA AGTCT^TCA ATAATTCTTC ATAAGATGCA 
TGCCAATTAA TTCTTTGAAA ACGAACAAGG CGACTICTAT CTGaGTATGA TAGAAATCGC 
CTTCTTTATT TTTAATCTTC ATCTAAAAAO TCTTTAATAG CTTGTTTATT TGTGTTTTTA 
TXAATCTGTA ATGCACTACC ATCAGTATTT GTATTGACAT CyTCGTATGA GTrCTTGATr 
OGCaCAGTCA ATGACTTAAC ATCTTTrTCA CCTCGGATAC CaAAACTCAA ACCTGTTTGG 
..rrAOGAAT GTTTGTATTC ACATAGCCTC TTAAAATACC TGCAACTTTT 
0=T»Trr» CAACTGTTtrr ««TT«CC AnTCTITrT TCAATOTTTO CATCCXTOT 
TGCTGACGTC OCAO^CGTCC GAAOTCACCI TCA«=OTCGT OACOCAATCT TGCATAACC. 
AGTAATTOT TACCATTCAA CCTATGCTTA CCCTTTITCA ««TACACC AATATTTTTC 
OAoiATCrr TTTCOACATT AATTOOTACA CCHTCAOOCA TTAATTCAIC AATCATTrTC 
TCAAATCCAG TAAAATCAAC TACTGCATAA TATTCAOGAI TAATTCCTAA ATTrrrATCA 

AoToirrrrc taagtagctc tggaccacct aaaocotatg ctgaattaat Trrax^TiTT 

CCATATCCTG GAATATCTGC ATAAATAICA CGCATOACAG ACATCATITT CATCTTnTA 
rrOATAAACT CATAT.OAAC AACCATOATA OAATCTGTIC TTGATTOTCC ACCTTGTOCT 
TTATCrOCAC CGAGTACAAG AATAOAAATT hTACCATCAT rTTTTACrGG TCCATTAAAT 
TOATGTACrr TAACATCTTT CGCA-n=TTTC rTGGCATATT CIACACCOCI AITGTAACTA 
TGTACAATAT ATACAACTAA ™:CGATAAG TAAAATTACA ACAATCAOAA GAATOATAOG 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7075 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: I 

10 

TATGGCTCAT CATTAATGCA CGTATCGGGT 
TAGCACGAGT TCTTAAAATT CCATCGCCAT 
TGmTTAGT CACTTCAAAG GCAAAGCTAT 

15 

GCACTTGAAT AACATGTGCT TTTAAAGTTA 
AACTATATGG TCTTGAACCG TATAACGCAA 

20 GTAATAyyyg TCCTTCCCGA TAAAAACTCG 

GATAACCTTC AGATAAAAGT GCTTCAACAA 
AAtCTGACAC ATCGAATTCA TCAGCATATC 

25 TTTTTGCATT TTGATTATGA TGGTGATCTT 

TAAATCCAGA CCGATGTAAT AAATTTTCAA 
CATTTTTATG GTTATAGTAA TATGTCAACG 

30 

GGACTAAATC AAACTCGTAA ACTGCATTCA 
GTCTAATTTG AATTGCTTCT CGTAGTGATG 
TAAACTGAGT TACAGCAAAT TCTAGGTCAT 

35 

TTAATGGCTG ATTGTTTTTG ACTGTGATTG 
ACCffTGTTGC TGTCAATGTA TTACACCTCT 
40 TTAACCACGT TTATTAACAC TTTTTCATCA 

ACAAAACCTT TATCTTCATT TCGTGTTCCT 
TGACCATAAA AACTAAAATC TTCCCCAAAT 
AAATCAGCTT TTATTAAGGC GTCCTCTATT 
GGGGGATAAC CTTCTGCAAA TTTAACTTCA 
TCTGCTATCT TGTGCATTTG ATTTTTAACG 

50 

ATAGTACCTT CTAAATAGCC ATTACTTGGT 
TGACCAATAT GAACAATATT TCGTTTCAAA 



;EQ id NO: 263: 

AGCGTTTACC AITGATAAGT GCTTCATGTT 5 0 

AACCGATATC AACTACAGCT AATTTTGTAT 120 

AACCGCAATA ATCACCAGCT TGTACTTCGC 180 

ATGACTGAAC TATATCATGT TGATTCAGTG 24 0 

TACCTACACG CGCATGTGTA TGGTGGGGTA 3 00 

CACTATTTTG AGCATGGATT AGGTCGAACT 3 60 

TTTCCATCCA TTGTGAACGT TCAACATTAT 4 20 

CAAAATGGGk CCATAAACCA CTAATAATCA 4 80 

TCAATACTTC TTTAATTTCG TTTAAATCTT 540 

ATTCTAAGTG AACATGAATA CCAGCTAAAT 6 00 

ACGGCAAAGT CATGTGTATT TGATGTTCAC 6 60 

TTAAAAAGAT TGTTGCATCT GGAGCAAGTT 72 0 

TTGTGCTAAA TGTATCTATA CCTGCATGGA 7 30 

AGTGATATGC ATTaTTTTTA ACAACTGCCA 84 0 

CATTTTGTAA AAATATTTTC TTATTTACAG 90 0 

TTGTAATTAT TTAATAAATT TTCGTAAAAA 96 0 

AAATTTAAAT GTGATGTGTG CAAACCAGTT 102 0 

ATAAAAACAA AGTAAGCTGG AGCTAGTTGT 10 80 

AAGAATGGCG TTGGTTTGTC ATAGACATTT 114 0 

TGAGTACGTA ATTTCGGACT ATTGATTGTA 120 0 

CAATCTACAT TAAACAGAAG CTTGACACTT 126 0 

ATTGTTAAAT CATCAATATC ATATGTACGA 132 0 

ACAGTGTTAA TCGCTTCACC AGCTTTAAAA 13 8 0 

CCGTTAAGGT GAAATTGTTG AATTTGTGAT 144 0 
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1560 
1620 
1680 
1740 
1800 



ACATGACrrO ACAGGCCTOT TAAGAAAAAG CGATACTCTG TTGCGCTGGC CGTAATTTCT 
TCATCTCTTA TCACTGCAAT GCCTTCATCA GCAAATGGGT TAACATGAAT ACCAAATACC 
GCTTCAATTG GATACTTATC AAAGGCACCO GCTTTTATTA ATCGATTTOC ACCGCCACCA 
GTrrCTTCTG CAGGTTGGAA AATGAAAACX5 ACATTTTGCG GTAATTGACC TGCATCTTGC 
ATGTCTTTGC AACGTTGTAC AAAAAGCATT AATGCAGTTG TATGACCATC ATGTCCACAA 

' GCATGCATCA CATGATCAGA rTGACTGCGA TAAGGCACAT CATTTTCCrC TAAAATAGGT I860 

AACGCATCAA TATCAGCTCT ATACGCTATC GTATGTGAGC CATTACCTTC TAAGTATGCA 1920 

S ATGACGCCAG TrTCCAATGG GCAATCGTAT TTAATATTTA AACTATCTAA AAACGCTTTA 1980 

ATATAAGCAG TTC^CAAA TTCATGTAAG CTTAATTCAG GATGTrGAl^ TAAATGACGG 2040 

CGATGTrrCG TAACAAATTC TAATTCATTC ATAATTATCA ATCCTTTGTG TTAAATTACT 2100 

^0 ATATAA.^.TAG TV^TAACGtAT rrCGAAATTT GTGATCATAA GnTATTCAA TGCTAAACAA 2160 

TAAGGTTGAG ACATAATCGT ATCTCAACCT l.;AAATTArr ATACGTTCAC GTCACTAGTC 2220 

ATTCAGTTrr CTTAATGCTG CTACAATCTC TTmTAGTA TCTTGTACTT CAGAAGCTTG 2280 

" CTTAATCACT TTTCCAGGTG TACCAGCAAC AACTGCACCA GCTGGTACAT CTTGTGTCAC 2340 

AATCGCGCCA GCTX^cAACAA TAGCACCTTT ACCAACACGT ACACCTTCTA AAATAACTGC 2400 

ATTTGCACCG ATTAATACAT CATCCTCGAT TATAACCGGT GAAGCACTAG GGGGriCAAT 2460 

..^r-rr.r r^r^rrcTAC ATGTACATTT TTACCAGTTG TAGCACGACC 2520 

CACACL'iij^w J. /\rtJ.rtv- J.^^"-"- 

ACCGAGAGTA GCATTCATAT CAATCATTGT ACCTTCGCCA ACGACTGCGC CAATATTAAT 2580 

TGT^CGCCC ATCATAACGA CAGCACCATC TTCAATAATG GCTTGTTCTC TAATAAACGC 2640 

ACCTGGrrCA ArrCGTGCAT TCGTATTTGT TAAGTCTTTT AATGGAATAG CAGAATTCCG 270C 

ACGAfcCATT TCAATTTCTA TATCTTCGAA rrOACTACCA TATGCTTCGT AAAAAGGTTT 276C 

- CCAATCATCC GCTTCACAAA AGATTACrTT AGATTGTTCT GAACCAAATA CTTTAAAACT 282( 

rrCTGGATAT GTGATGCCTT CAAAATTACC ATTTAAATAT ACmTAnX. GTGTAGACTT 288^ 

TTTAGCATCA CTTATATATT GAATAATrTC TTCAGCTGrT AAATGTTGTA CCATAAAATA 294 

" ATCGATCTCC TTTAATATGT TTATA-AGTrG TCAAACGTAT AAAAGCCGTT TGGTTTAriA 300 

ACTAAGCGTT CTGCTGCTTG TATTOCACCA TTCGCAAAAA TATCTTTTGA TTGTGCACGA 306 

TGCGTGATTT GAATCGTTTC ATCAGTGCCA GCAAATAGAA CTTCATGrrC ACCGACAATC 312 

" GTACCTCCAC GAATAGAATG TATACCAATA TC^IGGCT GGCGTTmC ATTTAArrCA 318 
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10 



25 



TCGAAATCAT CAAGTAGGGG AACAGCAGCT GCTAAAATTT TAGTCAATGC ATGAACGCCA 3360 

TAACTCATGT TCGCGCTGAA AAACACAGGC ATATTTTGAC TCAATTCATC TAACTTATTA 3420 

AGTAGTTTTT CTTTCTCGnC CAGTgTTGCC ACAACTAATG GCAAATGAAA ATCTTCATCT 34 8 0 

AATAAAGGGA AAAGCAGATT TGGATTTGAA AAATCTATTG CAACATCGGC ACCTTTAACA 3 54 0 

TCTGCAATAT GTTGATATTG TTGATATGGC GTTGTTGCTT TCGGTGTATT TTCAATGACC 3600 

CCAACGATTT CATGTCCTTT TTCTTCTGCT AATCTAGCAA CGCGTTGATT CATTGCGCCA 3660 

TAGCCAATTA GTAATATTTT CACTCATTTT CACCCGCTTT AAATGTGTCA TATGTTTCAC 372 0 

15 GAAGCACTTT AGTATCTGTA TCTTCTAGGC TAACCAATGG TAGACGTAAT TCATAATTTC 3780 

CAAATCCTAA ATAACTTGTT AGAGCTTTAA TAGGAATTGG GTTAATATCA ACTGATAAAG 3 840 

CTGATAACAG TGTGCCGATT GGTTTAAATT GATCTTGAAT ATCTAATCCA CTTTGTTGAG 3 900 

CATCGTATAA CGCTTGAAAT TCTTTAGGAA TGACATTGGC AATAACAGAG ATAACCCCTT 396 0 

GACCGCCACG TTGATAGTAT TCGACGACGT TGTCATCATT GCCACTATAT AATGCAAATG 4 020 

AATTTGTATC AATGCGCTTT TTCACTTCTT CTAAATACTC AAAATCATTC GTAGCATCTT 4 080 

TTAAAGCAAC TATATAAGGA TGTTGACTTA ATATTTCTAC AGTTTCTGGT TCAATTGTCA 414 0 

TGTTCGTTCT TGAAGGAACA TTGTACAGCA CGACTGGTAA TTTCACAGCA TCTGCAATCG 4 20 0 

CTTCAAAGTG TTTGACTAAA CCACGTTGGT TCGTTTTGTT GTAGTAGGGC GTAATTAACA 4260 

TAATTGCATC AGCCCCTAAG GCTTTAGCTT GGATTGAAGC TTGGATTGAC TTTTCAGTAT 4 320 

CATTAGTGCC AGTTCCTGCT ATGACAGGAA CACGTTTATC TACAAGATCA ATAACTGTTT 4 3 80 

TTAGAATGCG TTCTTTTTCA TCTGTTGTTA AAGTAGGGCT CTCAGCAGTA GTTCCATTAA 444 0 

CGATGATTGC TTGGGCATTA TTTTCTAGTA AAAAATTAAC GTGTGTTTTC AAAGCTTCAA 4 500 

TATKU^CTTT GTTATTTGTA AAAGGGGTTG TAAGTGCaAC SCCAACACCC TCAAATAAAT 4560 

40 GTGTCATTTT AATTCGCTCC TTTTAAACX3C ATAACTTGTT CCAATACTTG TACAGCATTT 462 0 

AATGCAGCAC CTTTTAATAA ATTGTCTGAT GTACACCATA CATGGAAAGT ATTTTCTAAT 4680 

GAATCATCTC TACGTATACG GCCAACAAAC ACTTCATCTT TATTAGTAGA ATTGATTGCC 474 0 

ATTGGATATT CATTGTTCTC TGGATTGTCT ACTAAAACAA CGCGGTCATC TTGATCAAAT 4 800 

AACGCTTTAA TATCTTCTGC TGTTGTTTCT TTGTCAAGCG TTACATCAAT TTCAACACTA 4860 

TGACTATCTT GAACAGGCAC ACGTGCGCAT GTTGCTGTTA CTTTTAAGTC TGGCGCATTT 4 92 0 

AAAATTTTTC TCGTCTCATC AATCATTTTT TGTTCTTCTT TTGTATATCC GTTTTCTAAA 4 980 

AACACATCAA TATGCGGTAA CACATTATTA TAAATTGGAT GTGGATATGC TTCTGGTGCT 504 0 
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15 



JO 



35 



TGATATGTTG 


TATATGCCAC 


TCGTTTTAAA 


CCATAAGCAT 


CTTGCAATAC 


TTTTAGAGGT 


5160 


ACAACAGATT 


GAATCGTAGA 


GCAGTTTGGA 


TTGGCAATGA 


TACCTCTTGT 


AAATGTAGGT 


5220 


TCATTGACTT 


CCGGAACGAT 


TAAATCAATA 


TCTTCTGCCA 


TACGCCATTG 


ACTTGAATTG 


5280 


TCTATAACGA 


TTGCACCAGC 


TTTTTCAAAA 


AGTGGGGCAA 


AGTGTTCGCT 


TGTACCGCCA 


5340 


CCAGCACTCA 


TTAATACATA 


ATCGAAATGT 


TCACTTGCAC 


GAGCATCAGT 


TAATTCTTGA 


5400 


ACTGTATATG 


ITTTTCCTTG 


AAATTCAACT 


TCTTGCCCTG 


CAGAACGTGC 


TGATGAAAAT 


5460 


AATACTAATT 


CATCGAAAGG 


AATATTTTTA 


CGATTTAATG 


TCTCCAACAT 


TTTTGTACCT 


5520 


ACTAATCCTG 


TTGCACCCAC 


AACTGCTAAC 


TTTGTCATAA 


CTTGTCACTC 


CATTTTATAA 


5580 


TAATTTCCaA 


TTTTTAGAAT 


ATTTTAACAA 


TCATTTTACC 


ATTAAATGTT 


AAATGCGTCA 


5640 


TATAGTTTTT 


CTACCGCTTG 


TTGCCCATTA 


AAATCATCAA 


TGACGTATGA 


AATACTTATT 


5700 


TCAGATGTTG 


TTGTTTGGTA 


GAAAGGTATA 


TTATTTTCAA 


TTAATGTCAA 


AAATGCTTTT 


5760 


GATGCCACAC 


CTGACATATC 


ACGCATGCCT 


GAGCCAATTA 


ATGAAATTTT 


GACATAATGC 


5820 


TCATTGATTT 


TATAAGCTAA 


TGCTTCATAT 


TGATTCTTTA 


ATGTTTCAAG 


AATCATAGAA 


5880 


ATTTGATGAA 


AATCACTATC 


TTTAATCGTG 


AAGGATAGTT 


GTAGCCCATC 


CAAGTTGACG 


5940 


ATTTGTGAAA 


TCATATCAAC 


ATTTACAGCA 


CCTTCTTCAA 


GTTCCGTAAA 


TAGTTGGGTA 


6000 


AGTAGCTGAT 


TGTCAGGTAG 


GGGATAACTA 


ATTGTTACAT 


GCATCATATG 


TTTATCCAAA 


6060 


GCCACACCAG 


TAACTGCTTT 


TTTCTCTAAT 


ATTTCTTCAT 


TTGACATAAT 


CCATGTTCCT 


6120 


TTCACGTTCG 


ATAAAGTTTT 


TCCTAAATAT 


AAAGGGATAT 


TATAGTTTTT 


AGCTAATTCA 


6180 


ACACTTCTTG 


TTTCAAGTAC 


ACCAGCACCT 


AAAGCGCTCA 


TTTCCATCAT 


TTCTTCATAT 


6240 


GAGACGATGT 


CTAGTCGTTT 


AGCCTTTGGT 


AAAAGTCTTG 


GGTCAGTGGC 


ATACACACCA 


6300 


TCAACGTCGG 


TATAAATTTC 


ACAAGGTATT 


TGATTACTAA 


CAGCAAGTGC 


CACAGCGGTC 


6360 


GTATCAGAAC 


CACCTCTGCC 


TAAAGTTGTT 


AATTCCTGAT 


GTTCATTGAT 


GCCTTGAAAT 


6420 


CCAGCAACTA 


CTAAAATAtC 


GTTTTCTTGA 


AAGGCTTGTT 


CAAATGTTTG 


AGGATTAATT 


6480 


TGAGCAATTT 


TACTTTTTAA 


ATGATGGCCA 


ATGGTTTTAA 


TACCCGCTTG 


ATAGCCAGTC 


6540 


ATTGCTTTGG 


CATTCATACC 


GATA.TCATTT 


AATACCATTG 


ATAAATAAGA 


TACAGTTTGT 


6600 


TGCTCTCCGG 


TTGTCAATAA 


TAATGCCAGT 


TCTTGTTGTT 


TTGGTGCTTT 


AGTCAAGGTT 


6660 


GATACATTCG 


TCATTAATTG 


ATCTGTTGTG 


TTACCCATAG 


CACTTACAAC 


GACAATTAAa 


6720 


TTGTTCATCT 


TGATTGACTC 


GCTCCTTTAA 


CATTTCAGCG 


ATCCTTTTTA 


TTTTTGTAAA 

, , ^ „ ^ ^ ^ f~^0~P 
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TAATCTATAT ACAAGTGATG CACTCCATTA TTTTTAAATA ATGACAAACT CTCAGCTCTT 6 96 0 

AACCAAAAAG TCCAACAAAT TATAACTGCT ATTATAATTG CTTCGGCATC GCACCCTTTC 702 0 

5 

AAATTTAGCT GTTAGCAGAC AGTAATCTAa ACTTTACTCA TGATTGATGC GCCTC 707 5 
(2) INFORMATION FOR SEQ ID NO: 264: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264: 

AGACGTACTT TGTGATTmCG AAGyrCGTAC TGmAGCACTT GTCGACGTTG ATGTACTTGT 6 0 

TGAACCTGAT TGACTAGTAC TTTGTGATAA TGACTTACTA TCAGAATCAG ATGTACTTTG 120 

TGAATCACTT AATGATTCTG ATGTACTACC TGACTGAGAC GTGCTCATTG AACTACTTAC 180 

GGACATTGAT TTACTGTCTG ATGCAGATAA TGACCCACTT GTACTGATAG AGTCACTTAC 24 0 

25 

TATCTCTGAA GTACTCATCG AGTCTGATGT ACTTGTTGAG ACACTTTGTG ATGCTGCTAT 30 0 

GCTTAGTGAT CCAGAAACAG AACCACTTGT GCTCGTCGAA TCGCTCAATG ATTCTGATGT 36 0 

ACTCATCGAT TTTGAATCAC TTGTACTTAA TGATATTGAT GTACTTTGTG AATCTGATTT 42 0 

30 

GCTTGTTGAC GCACTTTGAG AGTTGGCTAT GCTATTTGAA ATACTGATAG AGTCCGAGGT 480 

GCTAGCTGAC TCGCTCAATG ATGTTGATGT ACTAATTGCA TTCGATGTAC TGTCACTTAA 54 0 

35 TGATGCTGAT GTACTAGACG ACCCTGATAT ACTCGTTGAT AAGCTTTGTG ACTTAGACAA 6 00 

GCTTCCTGAT GTACTCATAC TTAATGAGTC ACTGAGTGAT GTTGATGTAC GCAATGAATC 660 

AGATSTACTT GTTGATAGAC TTTCGGATTT TTCAGTACTG CTAGAGTTTG AAATAGAATC 720 

GCTTAATGAT GTTGATTTAC TAGCTGAATC CGACATGCTT GATGATACAC TTTGTGAATT 780 

CACTAAACTT GTGCTTGTTG AGCTTGATAC ACTATTACTT TCAGATGTGC TTAATGACTT 84 0 

AGATGCACTC ACAGAATCAG ATAGGCTTAC ACTTGTCGAT TTCGAGGTAC TAGCTGATGT 9 00 

45 

AGATACCACA ATCGATCCTG ATGTACTCGT TGATGCACTT TGTGAGTCAG CTTTACTTGT 96 0 

TGACACACTT TGAGATTGTT GTGTACTTCC TGATGTTGAT ACGGAATCAC TCATGCTATT 10 2 0 

TCTTGTTACT TCATATTTAA AAGTTGTCGT CGTTTTGTTA CCGCTCGCAT CTGTAGAAAC 10 8 0 

50 

GATTGATATA GTACTTGTAC CAATGTTTGT TGGTGTACCA CTAATAGTAT TATTTGTACT 114 0 

ATCAAATGTT AGTCCGGATG GCAATCCAGT CACTGTATTC GTCACCGCAT TTCCACTGTT 12 0 0 
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ATTGGTGTCA CTGTTGGTGC TGTCGTATCC ACAACATTTA TTGTAAAAGT TGTCGTCGAT 13 20 

TTGTTATTTG CTTGGTCAGT AGACACAACT GTCACTGTTG ATTGACCAAT TTTTGTTGGT 13 60 

5 

GTCCCAATGA TTGAATTCGT TGCACTATCG TAACTTAATC CGCTTGGTAA TCCTGTAACT 144 0 

GTATTTGTCA CAGTCCCAGT ACCATTATCC GTTGTAGTCA ATACAATAGG ATTCATTGTT 150 0 

TTACCCACTT CTATGGTTTG ATTGCCTACA GTTACAGTTG GTGCTTTTAC ATCAGTAAAA 1560 

10 

TAATATGTCA CTGATTGTCC AGCATTCGTC ATTTTTACAG TTTTATTTGT ATCATTATAA 162 0 

GTTGACGCAT ATGAACTATC GACGGACGTG TAGTTATATC CTTTAGCAGT CAATGCAGAT 16 80 

75 TGCTGaTTAT CGATTGTCAC GACTTGATCA ACATTTCCTG AATATGTTTT TGGTGGAATA 1740 

ATATCTTTAC CTGTTGTTAC ATCAACGTAT CTCACTTGTG TAnCAGCAGA CTCTGTATAT 1800 

TCGAATGTTC CAAATTGTAC TTGTTGTAAA TTTGTCGCGC CACCTGTTGA GGCTGTCATT 1860 

GATAATGAAA AnTTGGTCGT ACCACTTTTC GCAATCCAAT CTGAAATATT ACGTGTCCAT 1920 

GTTTGACCTG CATATTTGAC AGTCATAACC TTTGTATCAC CATTATAGTT AATATCAAAA 1980 

TCTTGGaACG TGTTATTTGT AGGTTGaACA TTTAACTTCG CAGCATTATC AGCTGTTGAA 2 040 

25 

CTTGATGTAT ACGTTGTCGC AACACCATAA CTATCTGTTG TTACAAATGC ACCAAACGCA 2100 

CCTCCACCAG CTACATTAGA TGGGTCAGCA TTCGCCTTTG CAGCTGAATT TGGTTTAGAT 2160 

GTATTGTGAT ACGTATCCAA TTTGAAGCCA AATGCGTTAC TTAAGCCACC AATACCTACT 2220 

30 

GCGGCACCGT TTAACCCTGT TTCACCTAAT ACACCTGGTG AAAAGGCAAA ACCGATACCA 2280 

TCTCCACCAT TTCCATGCCC TTCATATTTG TTACCTAAAT TTACTTTTCC AGAAAAATGA 234 0 

js AAACTCTTAT TAGAGTCAAT ACGTGTTCCT AATGTAATAG CACCTTTTTG GCTGTATGCA 24 00 

TCCTGTGTTA ACGTCACAAT ACCGGTACTT TGATCATAGG TAGCATTACC TGACGTTGTC 24 60 

ATATATTGTT TTAAGTTATC TTTATTAACT GTAATTGTAT TAGCAGTTAC TGCGGTTGTC 2520 

GTCgCTGctG ACGCAAATGT TGACATAGCT AAGCGACTGA AAGTTCGAAG TTTTACTGGT 2580 

GCGGTGCTAG TTGACGTTGT GCTAGTTTTG TTTAAGTTGA CCGAAGATGG CGTTGTGCTT 2 64 0 

TGTGAAGTGT TATTTGATGC AGTACTTTGA TTTGTTGATG TATTAATTGG TTGTTCTGTA 2700 

CTTGAAGTTG AAGCTACAGA TTTAGTATCA GAACTTGATG TAGTATTCTT TGAGGATGTT 2 76 0 

GATTCTGATG TAGATGTCAA TTTCTCTTGT TGATTGCTTG TACTATTAGT TGTCGAAGTG 2820 

ACCTTTTCAG ACTTTTCACT TGAGACTGTG TCACTATTTG ATGTTTGTAC CGAACTACTA 2880 

50 

TTTtTCGTTA CACTTGTGGA ATCGGCTGTT GATGTTGATG CTTCGATTGT CGTTGAGTTT 2 94 0 

■ ■■ ^ - - - " - ^ r. o a ^ ^ 1 n n p 
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25 



GTTTTCAGTC CGTATCCCGT CATTTTTTTA CTAATGCTTT GATTATCTTG ACTCACTAAA 312 0 

CTATGACTAA TAAATGGTAG CCCCATAATT TTGAACATTT CTATTTCTTT AATTCCGGAT 318 0 

TTTACCCAAT TTTTTCCAGA TTTATAAAGT CTTACTCTTG TTTTTTCGTT TGCTAAGCTG 324 0 

TCATGAAATG CTTTCTGTCT TTTACTCATG TAATAACTCC TTGTATTATC TTTACATTCA 3 300 

TTAGATTATA ATATATGCCA CTATTCAATT TAATACAACT CTTTTTTGAT ACAAAAATAC 33 6 0 

TCATTTTGTT AAAATTTGTA AAAATTCaTT TTTATTCGTC TAAATGTAAT CGTTTTCATA 342 0 

TTTTTAAAAT TACTTTTTCT CGTTTATGCG TATAATCTTT TTTTATATAA ATTTGGCTAA 34 8 0 

75 TTGGCTTTAT GTTTAATCAT TATAATTGTT TCGTTTTTAA AATAATTATT GTATTAATAT 3 54 0 

ATCTATACCA TCCACCTTTT ATTTATAAAT AGTTAATTTA CAACTAAACG ATAAATATTA 3 600 

TATGCAAAAT ACATCTTTAA TATTAAAGTA ATACCAATAT TTTTTCAATA AACCTAGTGT 3 66 0 

AATATATGTG TAATTCTAAA AGATTCTTCT TTAAAAATAT AAATACCACG ACATATTGCT 3 72 0 

TTAACATTTT CATTTATAAA GCX3AAAAAAT GCATCGCTAC TAAGTTGAAT GTTTAGTAAG 3 730 

GATGCATTGA ATTCACTAAA ATGATTAAAT TACTTATATC TTTTCATCTG ATTGATTATC 3 84 0 

GAAATTTCTT CCTTCTAAAC CTGCTAACTC TTCTTTAGAA GCTGCAGGTG CTTTCATTTC 3 90 0 

AAATATCTCA TTCACTACTG TGTAATCGTA ATATCCTAAT CTGGCAATAG GTTTAATCGA 3 960 

CTTAATGTCC AATTTACCAT TATCAAGAAT AACCTTATCG TCAATATGAA CTTGGGCAAC 4 020 

TCTTCCTATA ACAATATCTA CGGTAGATAC TGGATCTCCA GTTGGAATAC GAATCGTTTG 4080 

AACGTACTCA CATTCAAAAT GAACTGGCGA TTCTTTTACA CGATATCCTG GAGCTTCTAT 414 0 

ACATTTTTCC TTTGTTACAC CTGCAAAATT AAATTCATCC TCTTCTGGTG GCAATGCTTT 4 2 00 

CGATGATAAA TTAACTGCTT CTCTTAAATC ATACGTTGCC ATATTCCACA CAAACCAACC 42 60 

TGTCTCTTCA GCATTTTTCA CTGTATCTTT ACGTTCGTGA TCACCAAGAA CGGATTGATT 432 0 

40 TGCTGCGAAC ATAACCATAG GCGGATCCCA AGTTAAGTTT TGATACTGAC TATAAGGCGC 4 3 80 

TAAATTATCT TTCCCATCTT TCGATACAGT AGAGATCCAC CCTATTGGAC GTGGTACTGT 44 4 0 

ACTA C TTTTA AATGGGTCGT GCGGTAAACC ATGACTTCTT ACACCTTGTT TTGGCGAATA 4500 

ATTCATACTA TCTTCACCCC TTATAAGTAA TTACATTTAA GGTTACGCCC TCTTTACATA 4560 

AGCGTCTAAT ATAAATAAAC AATTTATTTA TAAGTAGAAA CTATATATGA CGTGGTTGCT 4 62 0 

TATAATTTGC GTTCTTGATT CGAAAAATTC AGATAAGGAT TTATACAATT AATATTTATG 4 6 80 

ATATCTTTTG TAAATTTAAT TAATTATAGT TACTTCAATC ATGATTAGTT TATAATAATA 4 74 0 

AAGTGAAATT GAAAAAGACA GCTATTATGC GATGAGCGAA AAACTTCAAG TAAAACAAGA 4 8 00 
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TAAGTTCAAA AAAGAATTCA AACCTCTTAT GCACTTAAAA GGTGATGCAT TCAATCAACA 4 920 

GTTACAATCT TTGATTAACA AATATCCACA AATACAAAAA AATATGAAAT CAGAGTTCAT 4 980 

5 

TGCTTATTAT GATAAAGAAA AAAATAGAGA AACAGTAAAA AACTATGCTT GGAACCTTCA 504 0 

AAAATCTATA AATGACATTA TGCAATCATA TCCTAGCACA AAATTTGTAC AGTTTTATAA 5100 

AAGATGATGT TTCCCCGTCA ATGGTAGATG GAAATGGCCG TTTAAAATCG GGATACTAAT 516 0 

10 

GTATTTCCAT C 5171 
(2) INFORMATION FOR SEQ ID NO: 265: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3589 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265: 

CTACACACTA AACCTATTTC AGTTATGGGT GGTACAGTGA TTCTCTTTTC ATTTTTAATA 60 

25 

GGTATTTGGA TTGGTCATCC TATTGAAACA GAAATCAAAC CACTTATTAT TGGTGCGATT 120 

ATTATGTACG TACTTGGGCT TGTAGATGAT ATCTACGATT TGAAACCGTA TATAAAATTG 180 

GCTGGTCAAA TTGCCGCTGC CTTAGTAGTT GCTTTTTATG GTGTGACTAT TGATTTTATT 24 0 

30 

TCGTTGCCAA TGGGTACAAC GATTCATTTT GGATTTCTTA GTATTCCAAT TACTGTGATT 3 00 

TGGATTGTTG CTATTACAAA TGCAATTAAC TTAATTGATG GACTCGATGG TTTGGCGTCG 3 60 

35 GGTGTTTCkG CAATCGGACT CATTACAATA GGGTTCATTG CAATTTTACA AGCTAATATT 4 20 

TTCATAACGA TGATTTGTTG TGTTTTATTA GGCTCTTTAA TTGGGTTTTT ATTTTACAAT 4 80 

TTCCATCCTG CCAAAATATT TTTAGGTGAT AGTGGGGCTT TAATGATTGG ATTTATCATC 54 0 

GGATTCCTTT CTTTACTCGG ATTCAAAAAT ATTACAATTA TTGCATTGTT CTTCCCAATT 6 00 

GTTATCTTAG CAGTTCCATT CATTGATACT TTGTTCGCAA TGATTCGACG TGTGAAAAAA 660 

GGGCAGCATA TAATGCAAGC TGATAAATCG CATTTGCATC ATAAACTATT AGCTTTAGGC 72 0 

TACACACATA GACAAACAGT ATTATTAATC TATTCAATCT CTATTTTATT TAGTCTTTCG 780 

AGCATTATTT TGTATGTATC GCCACCATTA GGTGTTGTAT TAATGTTTGT ATTAATCATA 84 0 

TTTAGTATTG AATTAATTGT TGAATTTACA GGATTAATAG ATAACAACTA CCGACCAATA 9 00 

so 

TTAAATTTAA TTAGTCGTAA GTCATCTCAT AAAGAGGAAT AGGGAATGAA AGCATAGCTG 96 0 
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ATTTACCGTC TTATGATAGT GCTTTTTATT 
GCTTTGGAGT TTCTTCAGTC AAATCGAAAT 

5 

TAAACGCTTC ATAGTCACTT TTAACGACAT 

TTTGGTTTCT TAACATAAAA TGAGTTGAAG 

CATAATTCAG TGTTACTATA CATGGTACTG 

10 

AAATGACGTC TCTAACAGCA CCGCTATAGG 
TACCACCAAA ATATCTTGTT ACTACGACAC 
TGTCTAACAT TGGGaCACCG GcAGTTCCtG 
TATTCATTtC AGGTCCAATA GTATATGCAG 
TTTTTATTGC AGCAATAAAT GCTTTaGCTT 
20 CAATGAATCT TGATTTACTA ATCACATTTT 

TATTTTGTGT CATAATAACT CCTTAATTCA 
CACTGAAAAT GACATGACTA TAAATCGTTT 
GTATCATTGC TATGAGTATA TTTTAGGAGG 
TCTACAAGTT ATCTGTCGCA GGACTTAATC 
AGTGTGACTT TTGAAGATGG CAAGATTATA 

30 

GCCATTCAAA CATTAGAAAA GAAAGTATTA 
TTATTTGTCA TAAATGGAGA TCATTTCGAA 
GATGATTGTC CTTCAGCTTA TCAAGTAGCA 

35 

CATTTAGGTT CTGGTGCATT AGGTTTAGGC 
TTATAAAATT TTAATAAAAG AGTCTATATT 
40 CTTTAAATGT TCATCATTTG AAAGCCAAAA 

TTAGGCTCTT TTTTACGTTG AAATGAGGTT 
CATCCTCATA CTTCACTTAC TAATGAAAAA 
CAAAATTCAA ACTATTATTG TGTTCAATGT 
TATGATTCCT CAGTACATTC CAAGAAAATT 
CGAATGGATA ATGTAACAAG ATATAAAATA 

50 

TATTATCATC TCTCATTTGA ATTGTCGGAA 
CGAGCCATTA GAAAGAGACA AACGATTTTG 



TTTATTCAGT TGGTATATCG AAAGGTAACT 114 0 

TTCCTGCAGT CATTTGATTT AAAAAGTTAA 1200 

CGATATAGTA GCTTACCTTA TCAGTGTAAG 126 0 

CTAATTCATA TTCAAATTTA CCAGTTTGAT 132 0 

CTTCTCGTAG TTCGACACGC CCGATATCAT 13 80 

CGCGAATTAA ACCGCCACCA CCTAATTTAA 14 4 0 

ACGCATTATG aACATCGrGC TTTTTTaATA 15 00 

TCGGTTCACC ATCATCATTC GCnTTTTGAA 156 0 

AACAATTATG AGTGGCATCT TTATGTTCTT 162 0 

CATCTTCATT TTGAACAGGT TTGATATGAG 16 8 0 

CAATAATGTG TTCTTTTTTA ACAGTAATGA 174 0 

TAAGCTTAAG ATTATTTAAT CTTCATTATA 1800 

GATTGCCATT TTCTTTTTAA CTGAAATATT 1860 

ACGACTATGA AAATTGCTGT GATGACCGAT 192 0 

GATAAATATA ATATTcAAAT AGCGCCATTA 19 80 

CCAGAAGAAA AAGTTCGTAC TAAAAAGCGT 2 04 0 

GATATTGTAA AAGACTTTGA AGAAGTAACT 2100 

GATGGTCAAG CGTTATACAA AAAGTTACAA 216 0 

TACTCTGAGT TTGGTCCAGT TGTTGCAGCA 2220 

TATGTTGGCA GAAAAATAAG ATTAACATAA 22 30 

GTAATTGGAA ATTATCTCTC GTATACATGG 2 34 0 

TGCTAAAGAT ATAAGAAAAT CATTATAATA 24 0 0 

TTAAGCATTA AACATTACGG GAAATTAATT 24 6 0 

ATTAAAAAAG AAGTAACAGG TGTCATCAAA 2520 

GAAAGTACAA ATCCAAAGCA TTTTTATCAG 2580 

GTATATTGCA GAAATTGTAT ATCACTGGGT 2640 

ACAGAGAGTT CGCAAAGTTC ATCACAAGCA 2 7 00 

CAGCAGTCTT ATGCCTCAGA ACATATTGTT 2760 

TTATATGCCG TAACAGGTGC AGGTAAGACA 2 82 0 
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TTAAAGACGC 
TTGAAGGGCA 
ATACTATTTT 
AAGCATTGAA 
CGAAACAACT 
ATAAAAAATC 
AGAAAATGTT 
TTTTTAACAA 
AATTAACATA 
ATGGACATTT 
ATTTGGATGT 
CTGGACGTG 
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TCACCACGTG TAGATGTTGT TGTAGAAATT AGTAAACGTA 
GAAGATATAG ACATACTACA CCAGCAATCA AGACAACAAT 
TGCACAGTGC ATCAACTTTA CCGATTCAAA CAGCACTTTG 
GTCGATGCCT rrCCTTTATC AATGGATAAA AATTTACAAC 
AAAGTTCAAC ATGCAACAAT TTATATGACA GCAACACCAC 
ATTCCCCACG AAAATATAAT TAAATTGCCA GCTCGCTTTC 
CCTAAATATC GTrATTrCAA ACTTAATAAT AAGAAGATTC 
TTACAAGATC AAATTAATAA TCAACGTTAT ACACTGGTGT 
ATGATTAAAA CATTTTCGGT TTATAAGCAG AAAATTACTA 
GAGGATGrrr TTCGCnTGA AAAAGTTGAA CAATTAAGGA 
TTTACTACGA CAATATTAGA ACGTGGATTT ACAATGGCAA 
GATGCACATC AATATACTCA AGAGGCTTTA ATACAAATTG 
(2) INFORMATION FOR SEQ ID NO: 266: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1017 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 
mCCAAGAA GGcrrgAAAA AtGTTwCaAG TGGTGCGAmC CCAGTIGGrr 
TATCGACAAA GCAOrTAAAG rTGCTCTTGA AGCGTrACAT GAAAATTCTC 
AAAlW^T GAAATTGCGC AAGTAGGTGC GATTTCAGCA GCAGATGAAG 
TTATATTTCT GAAGCTATCG AAAAAGTAGG TAACGATGGT GTCATTACAA 
AAATGGACTA AACACTGAAC TAGAAGTGGT TGAAGGTATG CAAmGATC 
ATCACCGTAT ATGGrTACTG ATTCAGATAA AATGGTTGCT GAATTAGAAC 
rriAGTAACA GATAAGAAAA TCTCGTCITT CCAAGATATC nACCTTrAT 
GGTTC^TCT AATCGTCCAA TCTTAATTCT AGCTCATGAA GTTGAAGGCG 
AAATATCGTG CTAAACCGTA TGCGTGGCAC ATTTACAGCT GTTGCAGTAA 
rrTTGGTGAT CGTAGAAAAG CGATGCTTGA AGATTTAGCT ATTTTAACTG 
,,^,.^r..r^r.^ TAGATTTAAA AGATGCATCA ATTGATATGT 



ATTTCTTAAT 
TTTTGTTGTA 
TATTGATGAA 
GTCATCTTCT 

ACTTCCAGTT 
ATACCGAATT 
TATAGAAACA 
CGTCCATAGC 
CGATGTCATT 
TGTTGTTATC 



TACGACAAGG 
AAAAAGTTGA 
AAATTGGACG 
TTGAAGAATC 
GTGGTTATCA 
GCCCATACAT 
TAGAACAAGT 
ATGCATTAAC 
AAGCACCTGG 
GTGCGCAAGT 
TAGGTACTGC 



2940 

3000 

3060 

3120 

3160 

3240 

3300 

3360 

3420 

3480 

3540 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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CAGCATTGAT GCACGTGTTA GCCAATTGAA ATCTCAAATT GAAGAAACTG AATCTGACTT 78 0 

TGATCGTGAA AAATTACAAG AGCGCTTAGC TAAATTAGCA GGTGGTGTTG CAGTTATCAA 84 0 

AgTAgGTGCA GCAAGTGAAA CAGAGCTTAA AGAACX5TAAA TTACGTATTG AAGATGCATT 900 

AAATTCTACA CGTGCAGCAG TTGAAGAAGG TATTGTTGCA GGTGGTGGTA CTGCATTAGT 96 0 

AAATGTTTAC CAAAAAGTAA GTGAAATTGA AGCTGAAGGT GACATTGAAA CAGGTGT 1017 



10 

(2) INFORMATION FOR SEQ ID NO: 267: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1409 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267: 

TTAATCCAGC GTTAACTGTA TTTGCATTTA TTATGATTAT TTCGATTCTT TTAGCGTATG 60 

TATTTAAATG GCTTGGATTA GTGGATGATG TGTTATTAAT GGTCATTATC ATTTCAACTA 120 

25 TTTCCTTAGG CGTAGTTGTT CCAACTTTAA AAGAAATGAA TATTATGAGA ACAACTATAG ISO 

GGCAATTTAT CCTATTAGTA GCAGTACTTG CGGACTTAGT AACTATGATT TTATTAACGG 24 0 

TCTATGGCGC AATCAATGGT CAAGGCGGCA GTACAATATG GTTAATAGGT ATATTAGTTG 3 00 

30 

TTTTCACAGC AATTTCATAT ATTTTAGGTG TTCAATTTAA AAGAATGTCA TTTTTACAAA 3 60 

AATTGATGGA TGGTACGACG CAAATCGGTA TTCGTGCGGT ATTTGCATTA ATAATATTAT 4 20 

TAGTAGCCCT AGCAGAGGGA GTTGGCGCAG AAAATATATT AGGTGCATTC TTAGCAGGTG 4 80 

35 

TCGTTGTTTC ATTATTAAAT CCAGATGAAG AAATGGTTGA AAAGTTAGAC TCATTTGGTT 54 0 

ATGGGTTCTT TATTCCTATT TTCTTTATAA TGGnTGGTGT AGATTTAAAC ATACCTTCAT 600 

40 TAATTAAAGA ACCGAAATTA CTAATTATCA TACCGATTTT AATCGTnGCA TTTATCATTT 66 0 

CAAAATTAAT TCCAGTCATG TTTATTCGAC GTTGGTTTGA TATGAAAACA ACGATTGCAT 72 0 

CAGCATTTTT ATTAACATCA ACATTATCGC TCGTGATAGC TGCAGCCAAA ATTTCAGAAA 780 

GATTAAATGC TATTTCAGCT GAAACGTCAG GTATATTAAT TTTAAGCGCA GTCATTACAT 84 0 

GTGTATTCGT TCCGATTATT TTCAAAAAAC TGTTTCCAGT TCCAGATGAG TTTAACCGTA 90 0 

AAATTGAAGT TAGTTTAATT GGTAAAAATC AATTAACGAT TCCTATAGCG CAAAATTTAA 96 0 

50 

CATCTCAGTT ATATGACGTG ACATTATATT ATCGCAAAGA CTTGAGTGAT CGTCGTCAAT 1020 

TGTCAGATGA TATCACGATG ATAGAAATTG CTGATTATGA ACAAGATGTT TTAGAACGAC 10 8 0 
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AAGlTGCrAA ATTAGCCAAA GCACATCAAG TTGAGCGTGT CATTTGCAGA CTTGAAAGCA 1200 

CAACGGACGA TACAGAGITA GTrcATTCAG GTATTGAAAT TTTCAGTAGC TACTTAAGTA 1260 

ATAAAATCTT ATTAAAAGGT TTAATrGAAA CACCTAACAT GTTGAAnTA TTAAGTAATG 1320 

riGAAACGTC ACTATATGAA AITCAAATGT TAAATTATAA ATATGAAAAT ATTCAATTAC 1380 
GTAATTTCCC ATTCGGAGGA GACATCATC 
(2) INFORKATION FOR SEQ ID NO: 2 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4702 base pairs 
CB) TYPE: nucleic acid 
CO STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268: 
AAAGAGGGTT TrTTACTACT ATAATCCATC CTTTAATGGA ATTCCATTTG TCCCrrrGGC 
CCGTTCATAA CCATAGGAAA ATATATGCnG nAATCAT^A TAGTAAATAC ATCAATAATA 
S CGAATAATGT CGGTGAAATC ACGTCCGTAT TACCATTATT AGCTAAAACA TriTCCAAGT 

TTTCrrrrGA ACCGGATACT CATAAATGCT rrrAATGCnT GGTrnGTCT gtcgccatct 
TTAGAATITC TAATACTTTT TTAAArrCTT TAGCGAACAA CTCATTATCT TTATCGTrrT 
- TAGCCATTTG ACGATTCAAA TCGTTAGCTC TTACGCCTAA TGCTCGACCT GTCGCATGTA 

GCrTATTCAC ATTATTGTCG ACTTGATTTA ATIX^CCACT AACAGAATCT GCAATTGAn 
rrGATTCTTG TGTATCTGAT AGCAAnGCG TACTCrTTrC AGAAATTCTA CTAAmClT 
" TATCTAAATT TGAAGACATC GTATTAAATT CATCATTrTT GCCTTTATCA ArmrGGrr 

CTTG^GGCTC TTCAGCAAAA GTCTTmAA CGmTCTAA CTGATCAATC AGCTTGGAAA 
TATCTTCTTJC ATTTTTTCTT gTAnCTTTT TGTTATTTAA AATGTCATCA ATCAGTriGT 
CTGAGTrrrC TTCCATTGAA TCAATrrOaT GTAACACAGC TACriTATCG TCTITGAAAC 720 
rrrCCATGTC ATTGATAACT TGGTCAACCA TCATATCAAT TAAACGrriG TTGTCGAATG 
GTTTATCTTC CCTGCCITTT GTATCTGTGT ACAriTTATA ATGCGCATCA AACCTAGATA 
ATGCACTCAA TTGCTGGCTT AATGCATCTT TCGATAAGCG ACCATCaAGG TTATGATTCA 
ATGTTACATC CACAACACTC GTTCCmCT TATCATTIGG nCATCTTGA CGATriX^Cn 
- GTCCAAATAA CAATIGTAAA TGCATTGTTT TATCTTTTAA GAAATCTTTC TCAGCATCCT 

.r-^AA-^rrA TTGArrrCAA mTATATTC CTTGTTAGAT GTATCGACTT 
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180 
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300 
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420 

480 

540 

600 

660 
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TAAAATGAGG AtCTGTTGCA ACAGTTAATT GATTAATATC ATTACTTTTA ATCGTTTCAG 120 0 

TACGTTGCAC TTTGACACCA TCATTAATCA AACTACTTGT GTCTTGCGCA ACTTGGTTAT 12 6 0 

CGTAATCTGT TAAATTAATG TGTTCTGCTA ACGGTTnTT CAAATTATAT TCATTTTTAT 132 0 

AACGTTTTGC TTCTTTGACA ATTGCTTCGT ATTTATTAGC TTCATCCTCA TTTAAACCTG 13 8 0 

CAGCTATAAA GTCTTGTTTA GACATGTTAT AGATAAATGT TGTATCTGTA TCAGGTTCTT 14 4 0 

TGACAATATC ATCATGAAGT TGTTTCTCTA AGTTTTCAGC GAATTGAGCA TTGTTCATTT 150 0 

TAATGCTATT TAGCGCATCT TGTAAGTCTT TGTTATTTGC AAGCTCATCT TGCAGTGATT 156 0 

CTGTTAATTG CTTACGATAn TCTTCAATCA TACCTTTTGA AAATGGTGAC TCTTGTGATT 162 0 

GAATGATTTC TCTTAATTTA TCTAAGTTTT CTTTAACAGT TTGTTTATAT TCTTCTTTAC 16 8 0 

CTGTATCTTG CATACTTGAT TGTTGATCAA TTTGGCTGTC CATCTGTTTT AATGCATTGA 174 0 

TATAGTTATC AAGTTCCACG CTATCTTTTT GCGATTTATA ATCTTGTAAC ATTTTATCCA 1800 

TCGCTGTATT GTGCTCGTCA AATAATGAAT TTTGTTTTTC AATTAAAGTC GAAACATTAT 1860 

AATCTGTGTT CACTCTGAAC GTATCTGAAT TCGCACTCAA TAATGATTTA TTGTATGTTT 192 0 

25 GGAACCATTT TGTAATGTCT TTGTTTGCAG AAATTGAATT TACAAGCGTA TCTGTAAATA 198 0 

ATTCCGGGAA GTCGTTAATT GGATTTAATA AGTAATTCGA GAATTTACTA TTCACACCAT 2 04 0 

GTTCACGCGT CATAATAGCG CCAACATTTT TTTGTGCATT ATGTAAATTA TCAATGATGC 2100 

TTGTTAAATA AATTTCGACT AAGTTTTTGT TAAAGTCGTT AAGTACATTA CTTACAACTT 2160 

TTTCTGTGTT TTTAGCTACT TCTTCTTTTT GTCCTACAGC TGTTTTATAC TGTAGCGATA 22 20 

TTTTCGATGG TGTTTTAGCG TCTAATTGCA TTGCCAATTT TGAAAAGTTT TCTGGGATAA 22 8 0 

CAATCATGAC TTGGTATCCA CCATTTTTCA AACCAGACTC AGCAACGTTT CTTGTTACTG 234 0 

TTTCAAATTT ATAGTTTTTC TCATTTGCTA ACCTTTTAAT AAATGCTTGA CCCAGCTCAA 24 00 

CTTTTTTACC GTTATATGTC GTTGGTTGAT CCTCGTTAAC AATTGCGATA TGTATTTTAT 2460 

TATTTTTATT ACTTACACTT TGGGATCCTT TTTCTGATTG ATCTCCATAT TTTGTTTGAA 252 0 

CAAAAAATAT CATACTAACT ATGGCAATTA TAATAATTAA AGTGACAATT AATGCATAAA 2580 

45 TCCAATTTTT CTTTTTCATG CTTATTTCCT TTCAGTTGTT TTCTTAAAAA AATGATAAGC 264 0 

AAAGCCACAT TAGAAAATGT GACTTTGCCA ATTTCAGAAT GCTTATTGCA AACCGAAATT 2700 

ATTAGAAAGT TGTTGGTCTT GTTCTTGAAC GGCATCAGCA GTGCtATTcA ATTGTTGTTT 2 760 

AATTTCTTCT AATAATTGTG CAAATTTTTC TACTTTAGGA CTAAGTTGTT GGAATTGCTC 2 82 0 

TTCGAAACGG CTGAAAGCTT GACCTTCCCA GTTCGCTGCA ATTTCACCTt GTGCACGkGT 2880 
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TC^TTTCC TCTGGACTCA TCTTAKTCAT TOCCATAACT AGAAACCTCC TGAATATTrr 
AAGrrTATCa AAACmTTA GGGACACTAT TTTTTGAAAA AGTGCTCCrr ACTCAAATAA 
TATATAAATT ATTAGTATAT GTATATAGTt TTTrAAGTAT TTTTAGCTrr ITTAAAATAA 
ATATATTGAA TATAACCATA TATTrTTAAT TAACCATTCA TTrrrGTAAT ATAAATGTGT 
ATACTAAAAT TAAATTAAAT ACATAAAGGA rTAAATCGTT KTTKTOM^ AAACA,^.— 
ACTGACGATG ACAACTCTTA CTTTATTTAG TATGTCGCCT AACTCGGCTC AAGCATATAC 
GAATGATAGC AAAACATTAG AAGAAGCAAA GAAAGCACAC CCAAACGCAC AGTTCAAAGT 
GAATAAAGAC ACCGGCGCGT ATACTTATAC ATATGACAAA AACAACACGC CAAACAACAA 
TCATCAAAAC CAGTCACGTA CAAACGACAA TCATCAACAC GCAAATCAAC GTGATCTTAA 
CAACAATCAG TACCATrCTT CATTAAGTGG TCAGTATACG CACATTAATC ACGCAATTGA 
rrCACACACA CCGCCTCAAA CGTCACCAAG CAATCCTTTG ACACCAGCAA TACCGAATGT 
CGAAGACAAT GACGATGAAT TAAATAACGC TTmCAAAA GATAACAAAG GUCT-TAr.AC 
AGGCATCGAT TTAGACGAAT TGTATGACGA ATTACAAATC GCCGAATTrA ATGACAAAGC 
;^GACCGCT GACGGTAAAC CTITAGCATT AGGTAACGGT AAAATCATTC ATCAGCCTCT 
TATCACAAGT AAGAACAACT TATATACTGC TGGACAATGT ACATGGTATG TCTITGATAA 
ACGTGCCAAA GATGGACACA CGATTAGTAC ATTTrGGGGA GATGCTAAAA ACTGGGCAGG 
CCAAGCTTCA AGCAATGGCT TCAAAGTAGA TAGACACCCA ACACGAGGAT CAATrTTACA 
AACAGTAAAT GGTCCATTTG GTCATGTAGC CTACGTTGrA AAAGTTAATA riGATGGAAG 
TATTCTAATT TCAGAAATGA ACTGGATTGG TGAATATATC GTTTCATCAA GAACCATCTC 
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TGCrrCAGAA GrrrCATCAT ATAATTACAT CCAnAAATT AATCATGACA TCAATAAAAA 
GCGACCAGTT CGCAGrTTAC AATTCGTAAC ACTGCAAAAT TGGTCGCTTT ATTTrGTAl^S 
rrATTCGATT ATAAAATTAC AAAGAAATGT TCTCTACATT CCCCATTAAT CAAAATCGTT 
TACGAAAGTA TAATTGTAGC TATAATAATC CAAGTCGTAA CAACTAGTGG CACTTATCGTC 
TTOAATAAGA ATATACCGTA TTTTTTCTTG CGATATATAT CCAGTACTAG CCAAATTAAA 
ATGATTATAA CACCAACAAA AATAAATACA GGATTCATCG ATATAGCATC TGCCTGTAAC 
TCAGGTTGCA TTCTTAATTT AG...ATAATT AAC^TCACTA CTGAAATAAT GAAAAAGTAG 
ATACCTCTTA TCTTTGATGT CTGTAAATCT AATTCCTGCT CTTCAAIX^C CTCTTTAGAT 
TCACCCAATT CTTTrGCAAT CAAATAATTT ACTACCTTAG GTTTCACCCA TAAACACTTA 
„ .^.-..AAAA- ATATGATCCA GTATCCATAA ACATTAAAAA GTTGCTTAAA 



3540 
3600 

-> ^ r\ 

J U w w 

3720 

3780 
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3900 

3960 
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(2) INFORMATION FOR SEQ ID NO: 269: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 2004 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269: 

ACAAAAAATT CACCCTCATT AATATTGAAA CTAATGTTAT CGACAGCAAC ATGTTTGCCA 60 

^5 TAACGCTTAG TTACATTTTT AAACTTAATC ACTTTGCCAC CTCTTTTTTT CTCATAGCAT 120 

AAAACCGAGA TTATATGTAT GTATTCCCTA TTTAACCACG TTTATTACAA TTTTCAAATT 180 

TAAATGATTT ATCCTTGAAC TTTTTTAATA AAATAATGAA TAAwAGGwAA TCwCCAGTTA 24 0 

20 AGAAATAGTG TTATTTTACC TTGAATTCAA AAAAACACCC AGTAAAACAA GGAATGCTTA 300 

CTAGGTGTCT TCACTATACT TTGGCTTTAT AATTTTGAAT CGTTtCTAAA AATGCTGGAC 360 

AATAATGTTT TAATTTGTAA CTACCTACGC CATCmATATT AATCATATCT TGTTTCGAAG 4 20 

CAGGCTTACG TTTAGCAAAT TCCTCCAACG TGTAATCAGA AAATATACTT ACAGGTGCTA 4 80 

TCGTTAATTT GTCACTTAAC TTTTTACGAA CTTCTACCAA CTGACTGAAT AATACTCGGT 540 

CAACCCCTTC AACCGTATTT ATAAATACTT TTTCAGTCGC TTTTTGCTTA AATGGTGTTG 600 

TGAATACTTC TACTTCATTA CTGAGTAATT TTTTAATTGA AGTATCACAC ATTAATATTT 6 60 

CGTCATTTTC ATTTAAGAAC CCTTTGAATC TTAATTCATC TATTAAGTGA CTTAATTCTG 720 

ATGTTGTGTA ACCTTTCATT AAACCATGGG TTGAAATTTG GTCATAACCT TTATACTTAA 780 

TATAATCTGk TGACTCTCCT CTTAACACTT GAATGATAAC ACTATAACTC TCTTGTTGTT 84 0 

TCATACGAGC GATGCAACTA ATAATCATCT TAGCTTCTTG TGTCATATTA TATGATTTAT 90 0 

CTTGTTGAAC ACAATTACTA CATTGTTCAC ATTCTTCTAA TTTTTCATTC GGTTCAAAAT 96 0 

AATGGACAAT TGTTGCTTCT AGACATTTTT TTGTTTTTGT ATATTGAATC ATTTTAGTTA 102 0 

ACTTTTCGCC CATTTTATCT TTATAGTCAT CATCAGCTTG AGAGACTGTT ATAAAATACT 10 8 0 

CGTGTAAATT GATAtCGCGT TCGCTAAATA ACAAAATACA TTCACTTTTC AACCCGTCAC 114 0 

GACCTGCACG ACCCGCTTCT TGATAATAAG ATTCTAAATC TCCAGGCATA TTATAATGAA 120 0 

TAACAAAGCG TACATTGGAT TTATCAATAC CCATACCAAA AGCATTTGTA GCAACGACTA 126 0 

50 

CTTTAACACG ATCAAATAAG AAATCATTCT GCGCTTCTTC TCTTTCTTTA TTGCTCAAAC 132 0 

CTGCATGATA TATAACACTT TCAATTTTCT GACTTTCTAA GGCTTCTTGA AGCTCTTCAA 13 80 
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CCAATATAAA TTTTTGACGT TGATAAGTAG GATTTACTTT AAAAATTAAG TTTCTACGCT 1500 

TAGTACTCGT TTTAATTTGA TCAGTTTGAG CGATATTTAA CTTTTCTCTA ATATCTTGCT 156 0 

GTACTTCAAC CGTGGCAGTT GCTGTCAACG CTATTATTGT AAAATCTTGA GGTAACGTAA 162 0 

ATACTTTTGA AATAACATTT TGGTAACTCG GCCTGAAATC ATGACCCCAT TTAGAAATAC 16 8 0 

AATGCGCTTC ATCAAACGCG ACTAAGTGAA TCTTTATACG CTGAAGCATA TTTAAAAAAT 174 0 

ATCGGTTTTC AAATCGTTCT GGTGCAACAT ACAAAAATTG AATTTCTCCA TTTGATAATG 1800 

CTTTTTCAAT ACGTTGTTGC TCTTTTTGAG TCAAACTACT ATTTAAAAAA GCAGCTTGAA 1860 

TTCCCATCGC TTTTAATTGA TCCACTTGAT CTTTCATTAA TGATATTAGT GGACTTATTA 1920 

CAATTGTTGT ACCACCTAAC mATAAACCTG GTACTTGrTA GCmTATAGAC yTACCTCCAC 1980 

CAGTtGGkAA GrCACCAAGC ACAT 2004 
2Q (2) INFORMATION FOR SEQ ID NO: 270: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2244 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270: 

AAAGATTGCT TGCCTTGAGG GTTTATATAT CTGACTCAAT TGCCACATTT TTATCAAGAG 6 0 

TAGTTGATAA TACTCATCAT AATTATAGCT AATATTATAT TTTTTTAAAA GATAGTGTAT 120 

GATTTTCTGG TGTTTGTTGT ATACGTCATT AAATTTCAAG TAGTCATTCT CCAAGTTATA 180 

CGTATAACAA ATATTTCCGG ATAAAGTTAG AATAAAATAT TTAGAAAAAT CATTCATTTG 24 0 

CGTAATCGCT AAATTAAGTG TTAAATATAA GACATAAGTA ATTAATTTAA TGTGATATGA 30 0 

TGTATTATTA CTTTGCTAAA TAGTAGATAG AACAAAATTT GTAATCGGGA GGTAACAATG 36 0 

GATTACGCAC ATTTAAATTT AGAACATTTT TTTGCACGAA ACGACGATTT AGATGTTATA 42 0 

AGAGATCGCG CTGATTTCGT GATGATAAAT AACTTCACTA ATGAAATGAT GTATCGTGAT 4 80 

GGTCAAATTG AAGGCACGAT TGATTTAAAT CAGTACTATT ATAAAAATAG ATCAAATGCA 54 0 

GCAAGTTTTA TTATGATGGA TTATAAAAAA GAAACTAAGT AAACGAACAA AAGAATTTTT 600 

TGTTTTTTAA TACGTGAATA ATAAGATTAT TGATATAAAG GTTTTCAAAG GTTATACAAA 66 0 

AAGATAAAAC ATTTATGATT CGTAGATCAA CGTAAAGTAA TGTTGATAAA TGGTTTAAAA 72 0 

. _ — ^.^^^.^ ^^^^^^^ ATATGTAACA ATGCATAGAT AAAATTGTTA 780 
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kCCThAGUGG 


TGTGGATATG 


AATAAACACA 


AGAAAGGTTC 


TATTTTTGGA 


ATAATAGGAC 


900 




TTGTTGTCAT 


ATTTGCTGTT 


GTCyCaTTTT 


TATTTTTCTC 


AATGATATCC 


GATCAGATAT 


960 


5 


TTTTCAAACA 


TGTTAAATCC 


GACATTAAGA 


TTGAAAAGTT 


AAATGTTACA 


TTAAACGATG 


1020 




CAGCAAAGAA 


ACAAATAAAT 


AATTATACGA 


GTCAACAGGT 


ATCAAATAAA 


AAGAATGATG 


1080 


10 


CATGGAGAGA 


TGCATCTGCA 


ACTGAAATTA 


AAAGTGCAAT 


GGATAGCGGT 


ACTTTTATCG 


1140 


ATAATGAAAA 


GCAAAAATAT 


CAATTTTTAG 


ATTTATCAAA 


GTATCAAGGG 


ATTGATAAAA 


1200 




ATAGAATTAA 


ACGTATGTTA 


GTAGATAGAC 


CAACGTTATT 


GAAACATACG 


GATGATTTCT 


1260 


15 


TAAAAGCTGC 


TAAAGATAAG 


CACGTTAACG 


AAGTTTATTT 


AATTTCACAT 


GCATTATTAG 


1320 




AAACTGGCGC 


AGTTAAAAGT 


GAATTAGCTA 


ATGGAGTCGA 


AATTGATGGC 


AAAAAGTACT 


1380 




ACAATTTCTA 


TGGAGTAGGA 


GCCCTTGATA 


AAGACCCAAT 


TAAAACAGGT 


GCAGAATATG 


1440 


20 


CTAAAAAGCA 


TGGTTGGGAT 


ACACCTGAAA 


AAGCTATTTC 


AGGCGGTGCT 


GATTTCATTC 


1500 




ATAAGCACTT 


CTTATCAAGC 


ACAGATCAAA 


ATACATTGTA 


TAGTATGAGA 


TGGAATCCAA 


1560 




AAAATCCAGG 


AGAACATCAA 


TATGCTACAG 


ATATTAAGTG 


GGCAGAAAGT 


AATGCAACAA 


1620 


25 


TTATCGCTGA 


CTTTTATAAG 


AACATGAAGA 


CTGAAGGAAA 


ATACTTCAAA 


TACTTTGTGT 


1680 




ATAAAGATGA 


CAGTAAACAT 


TTGAATAAGT 


AATTTGATAA 


GCTACGAGTT 


GTTTTTATGA 


1740 




CTCGGACATA 


CTAAAAAGAC 


GCTTTCTATC 


TTGTTTTGAT 


AGAAAGCGTC 


TTTTTGCATT 


1800 


30 


AGAGAAAACA 


CATTGATkGA 


TAAtCCCaCC 


aATGCAAgTG 


GGGcAGGACa 


TCGATAAAGA 


1860 




ATTACTTTTT 


CTTTAGAAAT 


TAGTATTTCT 


TATGCATGAG 


TTTTACTCAT 


GTATTCCTAT 


1920 


35 


TTTTAAGTAC 


ACATTAGTTA 


TAGCTAATGA 


TAAAGAACCA 


CTACATAATA 


AATCATTAGT 


1980 


GTTTTTTTAT 


CATTTCTGTC 


CCaCTCTCAT 


CGTGATTTGA 


AATTTTCAAT 


TGCGATTTTA 


2040 




ATTTCATCTC 


TTACACGTTG 


GAACTCTGAC 


CAAGGCTTGC 


CTGCAGGATC 


ATCAAATCCC 


2100 


40 


CAATGTTCTT 


TCTTAACATT 


TGTTGGTAAA 


GAAGGGCAAT 


TTACGTCTGC 


ATCACTACAT 


2160 




AATGTAACAA 


CTAAATTTGA 


ATTTnTAATA 


ATATTATTAT 


CGGATTAAAA 


TCTGATGGAT 


2220 




GATTTGATAT 


ATCAATGCCT 


ACTT 








2244 


45 


(2) INFORMATION FOR SEQ ID NO: 271: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1371 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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ATAAGCAATT TAATTTTGAG TCTACAATGG AGGAATTATC ATCTTTATCA GAGACTTGCC 60 

AACTTGAAGT GTTGGGTCAA ATTACTCAAA ACAGAGATCG TGTAGATCGC AAATATTATG 120 

TTGGTAAAGG TAAAATTGAA GAAATTCAAG CATTTATTGA GTTCAAAGAT ATTGATGTAG 180 

TCATCACAAA TGATGAATTA ACGACTGCAC AATCCAAATC ACTAAATGAA GCTTTAGGTG 24 0 

TAAAAATTAT TGATAGAACT CAGTTGATTC TTGAAATATT TGCATTAAGA GCAAGAAGTA 3 00 

AAGAAGGTAA ATTGCAAGTA GAGCTAGCAC AACTTGATTA TTTATTACCT AGATTGCAAG 360 

GCCATGGTAA AAGCCTTTCT CGTTTAGGTG GCGGTATTGG AACTAGAGGC CCTGGTGAAA 420 

CGAAGTTAGA GATGGATCGC AGACATATTC GAACTCGTAT GAATGAAATT AAACATCAAT 48 0 

TGCGGACGGT AGAAGAACAT CGCGAAAGAT ATCGAAATAA AAGAAATCAA AATCAGGTGT 54 0 

TTCAAGTAGC TTTAGTTGGT TATACAAATG CTGGTAAATC ATCATGGTTT AATOTTTTAG 600 

CAAATGAAGA GACGTATGAA AAAGATCAAT TATTTGCAAC GTTAGATCCT AAAACACGAC 660 

AAATTCAAAT AAATGATGGA TTTAATTTAA TTATTTCAGA TACTGTTGGT TTTATACAGA 720 

AACTACCTAC GACGTTAATT GCAGCTTTTA AATCAACTTT AGAAGAGGCT AAAGGTGCAG 7 80 

ATTTATTAGT ACATGTCGTA GATAGTAGCC ATCCTGAATA CCGTACGCAG TATGACACAG 84 0 

TTAATGATTT AATCAAACAA TTAGATATGA GTCATATTTC TCAAATAGTT ATTTTTAATA 900 

AAAAGGACTT ATGTGATCAT GCATCAAATC GTCCAGCAAG TGATTTGCCT AATGTTnTG 96 0 

TTTCTTCTAA AAATGATGGT GATAAATTAC TTGTTAAGAC GTTATTTATT GATGAAATCA 1020 

AAAGGCAATT AACTTATTAT GATGAGACAA TTGCGACGAA TAATGCAGAT CGATTATATT 108 0 

TTCTAAAACA ACATACATTA GTGACTGAAC TTAAATATGA TGAAATTGAA AATGTTTATC 114 0 

GTATAAAAGG ATTTAAAAAA TAATAAAAGG ACGAAATTCA AATGAAAGAT ATAAGTAAGA 12 00 

TAGl^GCTGA CGTCGAATCA ACGTTAGCAC CATATTTTAA AGAAATTGAA GAAACAGCAT 12 60 

ATATTAATCA AGAAAAAGTA TTAAATGCAT TTCATCATGT CAAAGCAACC GAAAGTGATC 13 20 

TACAAGGATC AACAGGATAC GGGTATGATG ACTTTGGACG TGATCATTTA G 13 71 
(2) INFORMATION FOR SEQ ID NO: 272: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6035 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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CGTAAAcCTA TGCGTTTTAA TATTCTGAAG TTACTTAGTT CATCCTCAGT TTCATCCATT 120 

TGTTTAATAT AAATAATACA TCCAGCTGCT ACTAAAAATG CTAATCCTAA AAATGATGTA 180 

ACAAATATTA GAATACCGTT AGTAGCATcG ACCTCTTTTT TCATTTCATC ATACGTGATG 24 0 

ACTTTGTCTC CAAACTGTTT TGCAATTGCT TGAGCTTTTT CCTTTTGTGA TGTTTGTTTA 3 00 

ATATCATATC CATAAAAAGT ATGAACGTTA TTTTGTGTTT TCAACTGCTG ATACTTTTCA 3 60 

GGACTTACTT CGATGACAGG TGAGTTGAAG CTTAGATTTA AAGGATAAAC CTTACCTTTG 4 20 

TCTTCTTGTG TAcACGGAAA GTTTCATTCT TAGTACCTTT TACTACTAAA TCTTTGTTTA 4 80 

AATGGATATT AATAATGTTA GGCAGCGATT TTGTATTTGT AATGATGGCA TTGTTGCCTG 54 0 

TtAACTTGTT ATTTGCACTT AAAATAGAAT TCX3TGCX3ACC TGAATCACTA CCATTTTCCA 600 

AAGTAATAAC CTGATCATTA ACATTATCTA CAGTAATAGT TTCGTAAGCA TTTTTAGAAA 660 

ATGTAATTTG TTGTTGGCTT AGTTTAGTTT CAAATTGTTT AGCATCTTGA GTAGCGACCA 720 

CGTTAAATTC ATTTGGTGCC ATAGATGTAA GGGTTTGATC TGTATTTGAT TTAGATAATG 7 80 

CCGCAAAACA CAATACAGTT ACTGTAACTG CAGAAATAAT TGCAATGATA GTTAAAGACA 84 0 

25 TGGCATTTTT CTTCATTCTG TACATAATAG ACGATGTGAA TACAACATCG GTAATAGATA 900 

CGCGTCCATT TTTTGATTTT TTCAATGTTT TAAAAATAAG TGACACGGAA CTTCTGAAGA 960 

ATAAATAGGC GCCTACAACC GTTAAAAATA AAATGATAAA CGGTGATGTC ATAGCCATAG 102 0 

TTAGTGCTTT GAACGTACCA AACATTTCTG TCGCCATATA ATAGCCTAGT GCAATCATAG 1080 

CAATACCTAA TACGCCTGAA ATAACCTCTG CAGTCGTTAC TTTAGCAGTT GTGGCATCAG 114 0 

TTTTAATTGA ATCTTTCATC ATTGATAAGA TACTACGTCT TTTTAGAAAT AAAGCACTTT 1200 

GAAATAAAAT CAGTACATAC GCAATAATTA GCATGAAAAT AGTTAAAACA AGGGCCATAG 126 0 

GTTCGAAATG TATCGATAAG TTAATCGATA ACGACATCAA TTTAGATACT ATGGAAAGCA 1320 

ATAaTTGTGC ACCTGCAATG CCACATAATA CACCGACAAC ACCTGTGATT AAAAATACGA 13 80 

TCATTtGTTC AAGTGCTAAC ATTTnCAAAA TGTTTTGTCG TGTTAAACCA ATCaACTGAA 144 0 

ATAGCGCAAA TTCACGTGTA CGGCGTTTTA CGrmTAAATG ATTGGCATAC ATTAAAAAGA 1500 

45 TGACAATAAT GATAAATAAA AATATTGATC CGACTAAAGC ACCTTTCTTA ATGATGGCCA 1560 

TCGAGTCGTC ATTATTTACA CCTTTAGTAA ACTGTAAGGT TGTAAAACTG AAATATAAGA 16 2 0 

CGATGCTAAA AAATAATGAA AATAAATACA TTGCATAATG TTTTAAGTTT TGTCGTAAGT 1680 

TTTTGAAAAC GATATGGTTA AATGTCATTT GAGACACCAC CTAATACTGA TTGAAGATGT 174 0 

ACAATGTCTT CATAAAAGGC CTGTTTAGAA CGTCCTTCCT GATAAAGTTG TGTATGAATT 1800 
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ACCATGACAA TAGTTCTATC AAACGATTTA TTCATTTCTT CCAAACGTTG 

TTTGCACrrr tcgagtcgag tccgcctgtt ggctcatctg caaatatgat 

TGAACAAATG CTCTCGCTGC TGCAGTTCTT TGrTGTTGAC CACCAGATAA 

TATrrArrrc ctaggtcata aatacctaat gctgtcotga tcgctttata 
ATTGrrGCCT TCGACArrrr ttgaacagat aaaggtaaca TAATCrrrrc 
aatgtcggca gaatactgta ATCTTGGAAG ATGAAACCTA ATGATTCrrr 
GCAAGTGCTT TrTGATTAAG rTTATTAAGC TCTTGTCCGT TAGCAATCcg 
;^rrTGGTC AATTGAACTT AGTACATTTA ATAAGGTTGT CTTACCrGAT 
CCATAATCGC AACGAAnCG CCTTmX^TA T^CAAAGTT AATATCTTTA 
ATGTGTGCTT TTTACCGTAT GTITTTGAAA CATCTgCACT GATAATATCG 

AcrccrrtTG TArrrAATrr cattttaaat aatgtttgga gtagtagcct 

TTACAATTCA ATGAATGAAC CTTACAGAGT TGAAArcTAT CGCTACTTTAG 
AGTGAGGATA CAGATTCATC GTACATATTA GACAAAAGCA ATGGTGCTTT 
ATGTTTGTGT AAATTGAGAA AAGGGAATTT AATTATTGTA TAATAAATTT 
TTAAAAGAGG GTTrTATTTG AAAGGAATTG ATTGTTAT^; AAAAAGGAAA 
AAATCGTCTA GTTTAATAAT GGGTGTATTA TTATTAATCT TGGCAGTCGT 
TTTCCAATTG AAAATTTTTA TGCTATTACC TCGrTGATTG GACTGTTTGT 
GGTGTGArrC AAATCGTTTA CCGTAGAAAA GCAAAAGCTT TAGTAGGTGO 
TGGArrCTGT TTATGGGGAT TGTAGATAlT CTATTTCGTC TATTAGrTAT 
GGCGCAAGTT CAGCATrCTT TATTTATATG T^CrrrTT GGrTTATTrr 
TCrC^ATTAT rrACGrTTTC GGGTAGTGGT AGCTTAAAAC TAATTTCAGT 
TTATTAGGTA TTGrTTTCGG TGTCATTTTA TTATriAATC CATTAATGGG 
ATTTCGACGA TGATTGCTAT TGCATTTGTA TTCGTAGGTG TCATrTATGT 
CTTGCrrAAG TAAAATGAAG CGCrTCAAAA GAAGGGTGT.. ACATGAAGTT 
CCTTTTTGrr GTGTTTATGA AGCATAAAAA AGGGGCGCTA CCTACAATAA 

rrnT-raTTaTT TTTCAATACG ATTAATAGTT 
CCCATATTTT TATATTTTAC TATTATTGl i iii^^ 

^^nTvrvA^B TPTTTATAGT TATCTTTAGT GACATCAAAT 
CAAAATATTT TTCTAAAAAA TGTl iAiA^Ji 

TACCATTCCT TGTTAAAGTT AAATGATTrT CAGACAirGT AGCACGGCCA 
TAATAAAT^C TGTACAAATA TTGAATCrGG ATGCGTrTGA 



TAATAGGTCA 
TTGTGGTTTG 
TTCGCTAGGG 
ATTTTCTTCC 
•TwrT* aji r QGTT 
GCGGAATTTG 
cTACCGCTAG 
CCAGAAGGCC 
AGTGCTTGAA 
TCATAGTCTC 
TTATCTAAAC 



iAGai X a i- 
CTAAGTGATG 
TTTGTAAAAA 
TCAAGGTATT 
TATTTTTACA 
ATTAATTAAC 
TAACCAAAAT 
TTTTAATGTT 
TAGTTCTATC 
GATTTTTAAT 
TATCGTCTTT 
TGTAGATGCA 
TGTGTCATAT 
GTAAGATACG 
ACATTTAGTC 
TTTTCTGAGC 
AATGATTGTG 
TTATATTCGA 



1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

25B0 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 
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AATGATCATT TTCGAATTTT TGAACATAGA AAATATCCTT GTCTTCGTTG TTAAAAATAG 3 72 0 

CGCGGAATGT ACCACTGATA TCAGTAATTG GTTGTGTATG CTCAGATGAA GTAATAGGAA 3 78 0 

TGGCATGTAG AGGTAAGTCT CCAAAGCCAA CATCAGTTAC ATAGAATACA TCATTTATAG 384 0 

AAACAACAAG TGAAGCATGT GAACCGTTCA GACTACGACC GCCACCGGGw GTGTGAATAG 3 9 00 

TAGCTGACAT TAATTCAGGA TTAAATCCTT TTTGTTGTAA ATAGGCTTTG AAAAATGTAT 3 960 

TTAATTCATA ACAAAAACCA CCACGTTTAT CATGAACAAT TTTATTAAAA AGTGCATCGA 4 02 0 

TATTTATAGA TATCGGCTTA CTATTTTGAA CATCAATATT TTCAAAAGGT ACAGTTAACA 4 0 80 

TAAAACGTGT TGCATAATAA TTTAATGCTT CAATACTCGG TCGATTATAA CGAGATGAAT 4140 

CAATTTGTAA ATAATTCTCT AACTTCGCAA TATTCATAAG CATAGCGCCT CCTGTATTAA 4 200 

AGATTATAAT TAAATTTTAA ACAGAAATAC TGAAATTTTA AATTCGAAAG CATTGAATTT 4260 

TGGATAAATA CATTTTAAAT AGAAAAATAC GCTCTCAAAA TGAAGTCATC TCTAAAAGAA 432 0 

ACGATTTAAA GATGACTACT GAGAGCGTAg CATAATGGAA GAAGTGTGCA GGGTGTCTAA 4 3 80 

AAATGCAACA ATACAAAGGT AGTTGCAAGA CAAGTTGCCT TATCTAGACC ATTTGTGTTC 444 0 

25 TATGCGACCA AACTTCCAAA TTAAACTTGA AATAAGCCAA GTAATTAAAA ATAATGCAAC 4 500 

TAAAATATAG CCTAAATAAT CAAATTCGAT CGAACCAATG AATGCCCAAA ACGCACCATG 456 0 

TAAATCTAAC TTATCAGCAA GAATTTGTAG CAATTCAATC ATCCCAATCA CTAATGCTGC 4 62 0 

CATGACTGAT ATCGCAGTAA TCGTTATATT GTAATAGATT TTGCGAATAG GATTGAAGAA 4 68 0 

TGCCCAATTA TAGGCATACT TCATTACAAC ACCATCTAAT GTATCCAATA AACTCATACC 4 74 0 

TGATGCGAAT AAAATTGGTA AAGATAAGAT TCCGATAAAT GAAATGGCTT GTTGTGATGC 4 80 0 

GCCTGAAGAA AGAGCGAGTA ACGCAATTTC ACTAGCTGTA TCAAAACCAA GTCCAAATAA 4 86 0 

AAAGCCAAGT GGCAATACGT GCCAACTACG CGTGATTAAT TTGAAATAAG GTCCTACAAA 4 920 

TCGAGAAACC AATCCTCTAG ATTCAAGTAA TGCATCGACT TCAGCTTCTT CAATGTGTTC 4 980 

ACGACGTAAT TTAGCGAACA AGTTAATTAA AGAGATTAAA ATAATTAGAT TCAACACACC 504 0 

GATAAGCACT AAAAAGAAAC CTGAAACTAG TGTACCAATC GTTCCACCAA TATCTTGGAA 510 0 

ATGCGGTAAT TCATCTTTAG CCCATTTTAC AGATACCCCT AAAAAAACAG CCATTAAAAA 5160 

TACGACAGAT GAATGTCCAA TTGAAAAATA GAAACCCACA CCAGATGGAT CTTTGCGTTG 522 0 

CTGTAATAAT TTGCGaACCG TATTATCTAT TGCAGCAATG TGATCTGCAT CAAATGCATG 52 80 

50 ACGCAAACCT AATGTATATG CAAGAATCCC CaTACCAAAT AAGATATGAT GGTCTTTTCC 534 0 

AGCAATCCAT AAAAAACTAA ACCCAATAAC GTGTAACAAA ATGACAATAG CTATGTATGG 54 00 



35 



40 



45 



55 



EP 0 786 519 A2 



ATATTTAATC ATACTGTATG TTCAATGGGC ACTCTAGTAA TAAGTGTTCA TATAACAAAA 552 0 

ATGTTATGCC AAATTATTTG TTATATAAAA ATATACATGT AACCACAAAA GATTTTTTGC 5 580 

^ GATATATATA ATTTGATAAA TTAACCAACA ACAATGTAAG ATGTCACTTT GCTTAACTTG 564 0 

GCATCCTTTT TATGATTTTC AAATTCAAAA AAATGAGCAA AATGAATCTC TTTAcCAGTT 57 00 

TTTAATATTT CaATACCATG CATGGAACCT AAGCACCCAT GTG7GATGCT GGAATCaATA 576 0 

10 

TTGAGACTAG CAACCTGATT GTAATGATTA GATAGTTCTT GAATTAATAT TTGAGGTCCG 5820 

TATATGTCAA AGCGGCCAGG GACAGACCAA ATAAATTCTG TTGTAACCAG TGAACGTAAT 5880 

AATTCAATAT CTAATGCTGC TGTAACAACT ATAAAATCTA TCATTTGTTG ACGTTTAGGC 594 0 

15 

GCATGATTGC ATGACACATC TCCTGTTAAC TTAAAAGGTA ATGATGACTG AACTTCCGTT 6000 

TTAAAATGTA GTTGGTGCTG AAATAAAGCT TGTTC 6035 
2Q (2) INFORMATION FOR SEQ ID NO: 273: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
25 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273: 

30 TTTTGAACAG CCATATTTAT TCACCCTCAA CATCATTATA ATGGTATTAG TCGCATTACC 6 0 

TTCACTTGTT TTAGCTATAT ATGATTATAT GAGTTTTAGA ATTTCTTCTG CTATTTTACA 120 

ATTTCTAGGG GCTATCTCTT GGTTCTTTTT ATCATTGATA TTATCGCTCA CACAATTTAC 180 

35 

hCCTTTTACA TTAGCGTCAT TTATAACTTC AATTATTTTG TTCACAAGCA CAATTATCAC 240 

ATTAGCCATT GGTGGTAAGT CTGTTGAAAA GAATGATTCC CCTTAAATTC CAAATGAAAA 3 00 

AAAGGTTCTG AAGGCCGCTA TAAAACACAG TTTTTCAGAA CCTCTATACT TCTATTCAAT 3 60 

40 

GATATATGGT TTGCAATTTT CTACCTTTAA ATCCACAGCT TCTGCCCTTG AAACTTTGTT 420 

AAAATAAACC ATCAAACAAC GAATGACAAC TTGATGTGCA ACAATGACAA TATCATCTTT 4 80 

TTGTGTATCT TCATTGACAA CATGATTCAT AAAATGTTCT ACGCGTTGAT ATACATCTTC 54 0 

ATAACTTTCT CCTTCAGGCG CTTTTTGTGA AAAACTATGA CGAAAGTCTT TAAAGTTTGG 600 

ATCATTGAAA TATTTTTCAT ATTTCGGATT CGCACTGATT TCATCTTTAT ATTCACCCTC 66 0 

AAATACGCCA AGTGAACGTT CTCTTAATAG AGGGGTAGTC GTTGATGCAA TGTCATATGG 72 0 

^AAAA-ATaT TCAAACGTTT GCTGTGTTCT TAATAAGTCT GAAACATATA CATGTTTAAT 780 
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CGGCACATCT AATTGTCCAC AAAAATAAGA TCGAAAATGT TTATTATCAT AATTCGATTT 900 

TGATTCGCCA TGTCTAACTA AATAAATCGT CATAATATTA CTCCTTACCT TATGTATTTC 960 

ATATCTACCA TAACACTTTG ACTACTAATT CGATATCAAT CTTAATATTC TATTCTAAAA 1020 

AAAGAATTAA TTCATATnT 103 9 



(2) INFORMATION FOR SEQ ID NO: 274: 

10 

[i] SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1496 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274: 
GAGAGAATnT GCAATTAGTT ATTCAATTAG TTGATTTAAG ACATGATCCA ACACAAGATG 6 0 

ATATCTTAAT GTACAATTAT TTGAAACATT TTGATATTCC TACTTTAGTT ATATGCACTA 12 0 

ArGAaGACAA AATTCCaAAA GGTAAGGTyC AAAAGCATAT TAAAAATATT AAGACACAAT 180 
25 TAGATATGGA CCCAGACGAT ACAATTGTAA GTTATTCATC AATTCAAAAT AATAmaCAAC 24 0 

AACAAATATG GAATTTAATT GAACCGTATA TTTCATAGTT TTTGTACGTC AAAACTTATA 3 00 

CAAAAATTTT AAAAATAATG TAAGCACGAA ACTTTTAATT AGTACACAAT TGATAACATT 3 60 

rrTCAACGTT CATCATTTTG TCAAAAACTC AAAAGTAAAT TAGAAAGATT ATAATTTATT 4 20 

TAAGCATCGT ACTTAATTGG ATTTTAAATT ATGTTATAAT ATTTGTATTG TTAGTATATA 4 80 

TGGGGGCTTT TCAAATGCAT TTTATTGCAA TTAGTATAAA TCATCGCACA GCTGATGTgC 54 0 

35 

ACTAAGAGAG CAAGTTACTT TTAGAGATGA TGCCTTACGA ATTGCCCATG AAGATTTATA 60 0 

TGAAACTAAA TCTAyTTTAG AAAATGgTCA TATTaTCAAC ATGTAATCGA ACTGAAGTAT 660 
ATGCTGTTGT TGATCAAATT CACACAGGTC GTTACTATAT TCAACGATTT CTAGCTCGTG 720 

40 

CATTTGGATT TGAAGTAGAT GATATTAAAG CAATGTCAGA AGTAAAAGTG GGGGACGAAG 78 0 

CaGTAGAACA TTTATTGCGT GTCACTTCTG GTTTAGATTC AATCGTACTT GGAGAAACTC 84 0 

AAATTTTAGG TCAAATAAGA GATGCATTTT TCTTAGCGCA AAGCACAGGT ACGACAGGrA 900 

45 

CAATTTTTAA TCATCTATTT AAACAGGCAA TTACTTTTGC AAAAAGAGCA CATAATGAAA 96 0 

CAGATATAGC TGATAATGCT GTAAGTGTGT CTTATGCTGC GGTCGAGTTG GCGAAAAAAG 1020 
50 TATTTGGCAA ATTGAAAAGT AAGCAAGCTA TCATTATTGG TGCAGGGGAA ATGAGTGAAT 10 80 

TATCACTATT AAATCTTCTT GGTTCTGGAA TTACTGaTAT TACAGTAGTA AATAGAACAA 114 0 
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TACCAAATTT ACTTGAAAGT GCAGATATrG TGATTAGTrC AACGAGTGCA CAATCTTATA 
TCATTACAAA ..^TGATA GAAAGAATTG CAGAAAATAG AAAGCAAGAT TCACTAGTAT 
TGATTGATAT TX^CAGTrCCT CGAGATATTG AACC^TAT TAG1.5CCATC ACAAACATCT 
TTAATTATGA a<.TTGATGAC ITAAAAGGTT TAGTTGATGC AAACTTACGT GAGCGACAAT 
TAGCGGCTGC AACAATTTCG GAACAAATTC CTACAGAAAT ACATGCACAC AATGAQ 
(2) INFORMATION FOR SEQ ID NO: 275: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4826 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275: 
CTTGATTTTT TCCCtTTAGT ATTTTCCaTt TGanTGTCGC AGCTrWAAA TCCTGCTTTG 
CTTCTCTAGT GAACTTCATA ATTAAAGCAG CTACAACGAA" TGATACAAGT GCAGCAAGGA 
AGACACCGAG TAACATGTGC AAGAATTCAC CTCTAGGTGC AITTAAACAG TAAACTATAA 
ATGAACCTGG TGACGCGGGA CTT^AAATC CAAATCCTGT TGCTTGATAA GTTGCAACAC 
CAGTCATTCC ACCTAAAATA ACAGCGATAA ATAATAAAGG ACGCATTAAT ACATAT^GGA 
;^TAAArrrC ATGAATACCA CCTAAGAAGT GGATAATTCC AGCACCATAT GACGTTGCTT 
TTGCAGTGCC TrTTCCAAAA ATCATATAAG OUVGTAAGAT ACCTAAACCT GGTCCAGGGT 
TAGATTCAAT TGl^TATAAA ATTGATTGAC CAGCTTTTGC AGCTTGATCT GCACCAAGCG 
GTGTGAATAC ACCATGGTTA ATCGCATTGT TTAAAAATAC AATm^CA GGCTCTACTA 
;^TACrrAC AAGTGGAAGT AGGTGTGCAT GTACTAATGC TTCAACTGCC ACTGATAAAA 
TATGCATAAT AAATTTCATA AGTGGTGCTA AAATTTTAAA TCCTGCAATC GTCATGATAA 
ATCCTAAAAT ACCAGCAGAA AAGTrATTAA ATAACATTTC AAAACCrn.C GGCGTTCTAG 
OrrOAATCAA TTGCTCGGTC rrcrrCATTA ACCAACCAAC AAGTGGACCC ATAATCATTG 
CACCAAGTAA CATTGGTGTA TCAGGTAATG CAACGATGAC CCCCATAGrP GCI.rrGCTO 
CGATGATACC ACCACGTAAA TCATAAATTA AACGACCACC ACTAAATGCG ATCAATAATX:- 
GGATTAAATA AGTAATCArr GGTCCTGCTA AAGTAGCTAA ATCTTrGTTA GGTAACCATC 
CATTATCTAT AAAAATGGCC GCGATAAAAC CCCAAGCGAT GAAAGCGCCA ATGTTrGGCA 
^,,„,^^,eT ACTTAAGAAT GATCCAAATG CTTGAACACG ACGACCAATT CCTrTTTTCT 
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GAGAGGTTAC TTGTTACTCA ATATAAACAA AAATCAACTT TGTCAAAATA AATGTGACAA 12 0 0 

AATTAAATAA AGTGTCATCA ATGTGACAGT ATAGATATTT TGAAAAAGTA AAACAAAAAA 12 60 

5 ATTGTTTTAG GATTTTTAAA ATTTTATTGT GAAAATATTT GCAAAACAAA ACAACACCGT 132 0 

gTACAATAAT GATTAATGGA AAGGGGGAAA GTTCGGCAGT ACAGTTAAAG CGCCTGTGCA 13 8 0 

AATAAATATT TGTATTTGAA GATTAAAGGT TAATATATGA GTGGCCTTTA TAGAGTGCAA 144 0 

10 

TATATGTATT TGTAGACGAG GAGGATAGTG ATCGAATAGA TCGGCGGATG CTATCCCGGA 15 00 

TGTGGCTCAT TCGTTAGCTT ATTAAGTAAA ACATTAGGGT GACTTAATGG ACPJ^GTThA 156 0 

TAAGATCGCC AGAAATTGAA TATAAAAAAT ATTAATATGG AAAGTACAGT GTGAGCAATT 1620 

15 

TGTATAGTTG TAAAAATAAC TATGCTTAAT TTGTTATGGA TGAATGCGAT GATAGCATGT 16 80 

TCCTATTTAT ATTATGAAAG CAGATTGTCA ATCTAAATTA TCGGCAATAA ATCATAATTT 174 0 

20 ACGCGTACTA TTCCAATATG GAGGAAAATG TCGTTATGTG TGGAATTGTT GGTTATATTG 1800 

GCTATGaTAA TGCCAAAGAA TTATTATTAA AAGGTTTAGA AAAATTAGAA TACAGAGGTT 18 6 0 

ATGACTCTGC AGGTATCGCA GTAGTAAATG ATGATAATAC AACTGTATTT AAAGAAAAAG 192 0 

25 GTCGTATTGC AGAATTACGT AAAGTTGCTG ATAGTAGCGA TTTTGATGGA CCTGTTGGAA 198 0 

TCGGTCACAC ACGTTGGGCA ACACACGGTG TACCGAATCA TGAAAACTCT CATCCACATC 2 04 0 

AATCATCAAA TGGCCGTTTT ACTCTAGTTC ATAACGGTGT TATTGAAAAC TATGAAGAGT 210 0 

TAAAAGGTGA ATACTTACAA GGTGTATCAT TCATTTCAGA AACAGATACA GAAGTTATCG 216 0 

TTCAATTAGT TGAATACTTT TCAAATCAAG GACTTTCAAC TGAAGAAGCA TTTACAAAAG 22 2 0 

TTGTGTCATT ATTACATGGT TCATATGCAT TAGGTTTATT AGATGCTGAA GACAAAGACA 22 8 0 

35 

CAATCTATGT TGCTAAAAAT AAATCACCAT TATTATTAGG TGTTGGTGAA GGTTTCAATG 234 0 

TTATCGCATC AGACGCACTT GCAATGTTAC AAGTGACAAG CGAATATAAA GAAATCCATG 24 0 0 

ACCATGAAAT CGTTATTGTT AAAAAAGATG AAGTTATTAT TAAAGATGCA GATGGAAACG 24 60 

40 

TTGTAGAACG TGATTCATAT ATTGCTGAAA TTGATGCATC AGATGCTGAA AAAGGTGTTT 252 0 

ATGCACACTA CATGTTAAAA GAAATTCATG AACAACCAGC AGTAATGCGT CGTATTATTC 25 80 

^5 AAGAATATCA AGATGCAGAA GGTAACTTGA AAATTGATCA AGACATCATC AATGATGTTA 2 64 0 

AAGAAGCAGA CCGCATTTAC GTTATTGCAG CAGGTACAAG CTACCATGCA GGTTTAGTAG 2 700 

GTAAAGAATT TTTAGAAAAA TGGGCTGGCG TACCAACTGA AGTACACGTT GCATCAGAGT 27 6 0 

50 TTGTCTACAA CATGCCATTA TTATCTGAAA AACCATTGTT CGTTTATATT TCTCAATCAG 2820 

GTGAAACTGC AGATAGCCGC GCCGTATTAG TTGAAACTAA TAAATTAGGT CATAAATCAT 288 0 
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TACACGCGGG TCCTGAAATC GCAGTTGCAT CTACAAAAGC ATATACTGCA CAAATTGCAG 3000 

TATTATCAAT CTTGTCTCAA ATCGTTGCAA AAGAGCaTGG TCGTGAAGCA GATATTGATT 3 06 0 

TATTGAGAGA ATTAGCAAAA GTAACAACAG CAATAGAAGC AATTGTTGAC GATGCACCAA 312 0 

TTATGGAACA AATTGCTACA GATTTCTTAG AAACAACACG CAATGCATTC TTTATCGGAC 3180 

GTACTATTGA CTATAACGTA AGTTTAGAAG GTGCGTTAAA ACTTAAAGAA ATTTCTTACA 3 24 0 

TTCaAGCAGA AGGTTTTGCT GGTGGAGAAC TTAAACATGG TACAATTGCC TTAATCGAAG 33 00 

AAGGTACACC AGTTGTAGGT TTAGCAACAC AAGAQAAAGT TAATTTATCA ATTCGTGGTA 3 3 60 

ACGTTAAAGA GGTAGTAGCA CGTGGTGCAC ATCCATGTAT TATTrCTATG GAGGGTCTTG 3420 

AAAAAGAAGG CGACACTTAT GTCATTCCTC ATGTACATGA ATTGTTAACG CCATTAGTAT 34 80 

CAGTGGTTGC ATTACAATTA ATTTCATACT ATGCAGCATT ACACAGAGAT TTAQATGTTG 354 0 

ATAAACCACG TAACCTTGCT AAATCAGTTA CTGTGGAATA ATTCACTTTT TTAGAATCAA 36 00 

TCATGTATTA AAATTAAAGT ATATGGCACC CTTTTAGATl AATCGACTAG AAGGGTGCTT 366 0 

TTTTAGGTCG ACTTaGCTTT TACTTCATCT TAATTTGGCA GAAATGCGTa AAAATGAAGT 372 0 

GTTTTATTTA TTTAAATAGT CTGACAATTA AGGGTGTTAT GTTAATATGA TTTTATGAGA 37 8 0 

AGTATGGAGT AGCAATAAAG GGGTGACCTC GCATGTTAAT TCAATTAGAT CAAATTGGGC 3 84 0 

GAATGAAGCA AGGAAAAACA ATTTTAAAAA AGATTTCTTG GCAAATTGCT AAAGGTGATA 3 900 

AATGGATATT ATATGGGTTG AATGGTGCTG GCAAGACAAC ACTTCTAAAT ATTTTAAATG 3 960 

CGTATGAGCC TGCAACATCT GGAACTGTTA ACCTTTTCGG TAAAATGCCA GGCAAGGTAG 4 020 

GGTATTCTGC AGAGACTGTA CGACAACATA TAGGTTTTGT ATCTCATAGT TTACTGGAAA 4 0 80 

AGTTTCAAGA GGGTGAAAGA GTAATCGATG TGGTGATAAG CGGTGCCTTT AAATCAATTG 414 0 

GTGTTTATCA AGATATTGAT GATGAGATAC GTAATGAAGC ACATCAATTA CTTAAATTAG 4200 

TTGGAATGTC TGCTAAAGCG CAACAATATA TTGGTTATTT ATCTACCGGT GAAAAACAAC 42 60 

GAGTGATGAT TGCACGAGCT TTAATGGGGC AACCCCAGGT TTTAATTTTA GATGAGCCAG 432 0 

CAGCTGGTTT AGACTTTATT GCACGAGAAT CGTTGTTAAG TATACTTGAC TCATTGTCAG 43 8 0 

ATTCATATCC AACGCTTGCG ATGATTTATG TGACGCACTT TATTGAAGAA ATAACTGCTA 44 4 0 

ACTTTTCCAA AATTTTACTG CTAAAAGATG GCCAAAGTAT TCAACAAGGC GCTGTAGAAG 4 5 00 

ACATATTAAC TTCTGAAAAC ATGTCACGAT TTTTCCAGAA AAATGTAGCA GTTCAAAGAT 4 56 0 

GGAATAATCG ATTTTCTATG GCAATGTTAG AGTAAATATT TTGCAAATAA TAAGTAATAA 462 0 

TGACAAAATT TAATTAAGAT AAAATGGACA GTGGAGGGCA ATATGGATAA CGTAAAAGCA 46 80 
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AAAGATGTCA TTAATCAATT GAGAGAGAAA GGATATAAAG TATTTTTGGC AACAGGACGT 4 800 

TCGCATTCTG AAAATACATC AACTTG 4 82 6 

(2) INFORMATION FOR SEQ ID NO: 276: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4846 base pairs 

(B) TYPE: nucleic acid 

10 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276: 

GAATAAAAAG TAAAATTACT CGCCTTTGTT ACCTTTTACT TTATCAATaA AATCAGTTGC 6 0 

TTTTTCTTTT GCATTTTCAA CGAATTCTTT CGCTTTACCA GAAGCTTTAT CTTCTTTACC 12 0 

TTCGTTTTCT AAATTTTTAT TATCAGTAAC ATTACCTACT GTTTCTTTAA CATTACCTTT 180 

TGCTTGTTCA AATTTACTTT CGTCTGCCAT AATAAATGCC TCCTCGGAAT AATTAAATGT 24 0 

TATATATAAT ACTTACCCAC TGAAAAATTA TCTAAACATT TTACTTAAAT AATTTTTGAT 3 00 

25 ATTGATTTGA CGTCATTTTA TAACTAGCGA AATAGATTCA TCATTAACTT GAGGGAGTGG 3 60 

GACTGAAATA ATAAAGAATC ACTAATGATT TATGATGTAT TAGTCACTAG CCATGTGAAA 420 

TTAAAAATAA GAATAAATGA GTAGCACGCA TGCATATAGG ATTTTACTTT ATCCGTAATA 4 80 

GCATCTCATT CCTAAATATC ATATAAATAC CTGTTTAAAT TAAAAAGCCC AGCAACATCA 54 0 

CGTTACTGAG CCATTAATAT GATTTATTTA GCAGGAATAA CTGCACCATT GTATTTTTCA 600 

TTAATGAAGT CTTGAATATC TTTAGATTGT AATACTTCAA TTAATGCTTT GATTTTCTTA 660 

TCATCTTGAT GTCCTTCTTT AACAGCAATT AAGTTTGCAT AAGGATTATC TTTCGCACTT 720 

TCTACAGCAA TAGAATCTTT TTTAGGATTT AGTTTTTGTT CGATTGCAAA GTTCGAATTA 780 

ATGATAACAG CATCAGCGTC TTCATTTTGA TAAATTTTAG GTAAGAATTC TGCTGATTGT 84 0 

TTATTATTAA ACTTAATATC TTTTTTATTC TCTGTAATAT CACTAAACTT AGCATCTTCA 900 

ATTTTTACGC CTTTTTTGAT TTTAATTAAA CCTGCATCAA CGAAGAATTT TAAGAAACGT 960 

CCTTGTTCAG CTGGATTATT AGACACATAG ACTGTTGCAC CTTTTGGTAA TTCTTTTAAA 1020 

CTTTTATACT TTTTAGAGTA TACAGCCATA GGTTCTAAGT GAACATCACC GGCACTTACG 1080 

ATTTTGTAAC CTTTATCCTT TTTCTCTGTG TTTAAATATG GTGTATGTTG GAAATAGTTT 114 0 

GCGTCAATTT CACCTTTGTC TAGTAATTTA TTAGGTGTAG TGTAATCGTT AATTGTTTTA 1200 

ATATCTAGTT CATAACCTTT TTTCTCTAAT AATGGTTTTG CTTTTTCTAA AATTTCAGCA 1260 
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TTACCGCAAG CTGCTAATAC AACTGCAAAT GTTAATACTA AAATAAGACC AAATAATTTT 1380 

TTCATAAAAT GAAACCCCCA ATTTATCGTT TATCAAGTTT ATTTGTAAGC CAATCCCCAA 144 0 

TGAATTGGAT TATAAATACA ATAATTAAAA TAAAAACTGT TGATACTAAA ATGACATCAT 150 0 

TTTGATTTCG AGTGAAACCT GTTAAGTATG CTAAATTTCC TAAACCACCG GCACCAATTA 156 0 

CACCTGCAAC TGCTGTTGAA CCAACTAAAG CGATTGCTGT AACTGTAATG CCAGACACTA 162 0 

GCGCTGGCAT AGCTTCAGGT AAAAGGACTT TACGAATTAC TGTCCAAGTA TTAGCGCCCA 168 0 

TTGACCAAGC CGCTTCGATG ACACCTTTAT CAATTTCTTT AAAAGCAATT TCTACGAGCC 174 0 

TTGCATAAAA CGGTGctGCG CCAATGATCA AGGCTGGTAA CGCACCTGTC GGACCACTTA 1800 

TCGTTCCAAG TATCAAACTT GTAAATGGAA TTAATAATAA AATTAAAATA ATAAATGGTA 1860 

TCGCTCTAAA TAAGTTAACA ATGAAAGAAA CGATAGAATA AAATAACCTT GCACCGATAG 1920 

ACTTACCTTT AGCAGACAAG AATAATAACA CACCTAAAAT AAGACCAAGT ATAAATGCAA 198 0 

ATATAGTTGA GACGACTGTC ATGTATAGTG TTTCGACTAT TGCACJTCCAA ACT7CTGGCC 204 0 

ACTGAATATT AGGCATTGTA ATCATTTCAT TTATAATTTC ACTAAATGAT TTACCCATGT 2100 

CTTAACACCT CCATTTTAAC TTGTCGCTCA ATTAACTCTT TTTCGAATTT TCCGAAATCT 216 0 

ACACTTGAAA TATATGGAAT ATGCAGAACT AAAAAGCCGA CTGTTCCATT TTTTGTATTT 2220 

TTAATATTTG CTTCTAAAAT ATTAATTTTA ATATCATAGG CAGTTGATAG ACTCGATACA 22 8 0 

^0 ATAGGCTCGG TTGTTGTTGA ACCAGCGAAA ACTAATCTAA CGATATATGC ATCTTTTTCT 234 0 

AATGGCTCTA ATTCTGTTAA AGATGTTTCG AAATCATCAT TTAAATCGTC TTTCACAAAT 24 00 

CGTTTTGTCA CAGTGTGTTG CGGATTTTCA AAAACCTGTG TCACCGGTCC TTGTTCTATC 246 0 

ACTTTACCAC TTTCCATAAC TGCAACTTCA TCACAAATAC GACGAATGAC ATGCATTTCA 2520 

TGCGTAATTA GTACAATTGT TAAATTTTGT TGTTCTCTAA TTTTTAGTAG TAGATCTAAA 258 0 

ATTTCATCTG TTGTTTGCGG ATCAAGTGCA CTTGTTGCCT CATCACAAAG CAAGACCGTT 264 0 

GGATCATTaG TAACGCTCGT GCAATCCCAA CACGTTGCTT TTGTCCACCT GATAACTCTG 27 00 

ATGGATAAGC CTTTTCTCTA CCTTTTAAAC CGACGAGTTC GACAAGTTCT AATGCTTTTT 276 0 

GCTTAGCTCT CCTTCTAGGG ACACCTGCAA TTTCAAGCGG AAACATAATA TTTTTTAACA 282 0 

CAGTCCTTGA CCATAACAAA TTAAAATGTT GGAAGATCAT ACTTACTTTT TGTCTTTTTG 28 90 

CTCTTAATCC ATTTTTGGAC AATTGACCTA TATGGTCTCC ATCTATAATA ACTTCACCTG 2 94 0 

ATGTAGGCGC TTCTAAATGA TTAAACATTC GAATCAAAGT ACTTTTTCCT GCTCCAGAAA 3 00 0 

AACCAA'TGAC GCCATAAATC GATCCTGCTC GAATCGATAA ATTAACGTGA TCTACAGCAA 3 06 0 
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TTCCTCCCTG TGTTGCTTAA TAAAATAAAA 
AGAAAGCAAT AGTAGTATTG TTTCTCTCAT 
^ CACCATTTCT ATATAAGACG GTTGCCGGGC 

ATAAGAGTTT ACGCATCATT TAATTTGTAT 
ATAACTATTT TAAATTTTCT AACAAATCAG 

10 

CTTCTTTATC TTTATTCATC AACAATAAAA 
AAAACTGAGG ATAAAAGTTT AAATCTATTT 
CAATGTCTAA CATTCTTTCT GaAACCTTAC 

15 

TTAAATGTTT GTCTTCCTCA TAAAATGTAG 
TTTACTAAAA CTAACCTTTC ATTATTTATA 

20 GTAAAGCCAT GTTTTGAAAG TACATGCGCC 

CAGTAAGTTT TATCAATAAA TATATGTTCA 
CTAATTCGAT CTCCTTCGTC ATCAGAATCG 

25 GATTCTATCA TATCATCAAG CTTATCTATA 

ACTGGTTCTG CAATAACCTG TTGCACCCTT 
ACTTTATTTA CAATAGCCAT CATCATCACC 
TCATTTCACA AAATTGGTAT GAATAAAACA 
ATAATAATTA CAAACTCAAT GTTTGACTAG 
TAAGTCAAAA ATCAACATCC TAAAATCTAC 

35 

AAAATACACC TATAACAAAC TTTTCAATTA 
AAGTGAGCTT ACAATAATGT GCAAGTTGGC 
TTAGAAATTA GTATTTCTTA TGCATGAGTT 

40 

ATTAGCTGTG GCTTATGAAA ACAGGCTGGG 
TTATATTGGC AGTAGTTGAC TGAACGAAAA 

^5 GTCAGGGGCC CCAACACAGA GAATTTCGAA 

ATGGGCCCCA ACAAAGAGAA ATTGGATTCC 
GGACGACGAA ATAAATTTTG CGAAAATATC 

50 AGATACTACA CTTATTAAAG TAATATATTT 
TACATGTAAT CATCAAACTT CATAGCCTCA 



AATGCTTTCT CAATATCGAT AGAAAAATTG 3180 

CTTCAAAAGT TAAAACTTTA TGTGAATTGG 324 0 

TTCGTAGGGC ACATCCCTCC ACCACTCTCG 3 3 00 

TAATCCTAAC ACCTTAGTAA AATTTCGTCA 33 6 0 

TCACCGATTT AAATGCATAA ATTCGTTTTA 34 2 0 

TCGGCGTAGA CATGATTTGC ATATCTTTAC 34 8 0 

TCAATAATGG TAACTGCAAT ATTTCATTAG 3 54 0 

AAGTACCACA CGTTGGTGTA TAACCAAAGA 36 0 0 

TTACATCTTT GATGTCTAAT GAATTATTCA 3660 

TTCGGTAAAA GAGGTGTTTC TTTCTTACAA 3720 

AAATATTGTT TGGGGCAATT CGCAACTTGA 3780 

CTTTCACTCA AATAACGTTT AAACCAATTT 3 84 0 

GCTAATACAA AAACTTGTTT ATCATACAGT 3 900 

CTCATTGTTC CATGAGTACA AATAATATTG 3 960 

TTTTTATCAG ATTTTCCTTC AACAATTATC 4 02 0 

CTTTAAAATC AATAAACATC TGTCACTGTA 4 08 0 

TAAATCACAA AAAATTTAAA CTAGCTTAAT 414 0 

CTGGAACATT TAACATAAGC AGACAAAGGC 4 200 

AATGTTATAT TAACAATAGT TAACCAAAAG 4 260 

TAGCGGGGCC CCAACACAGA AGCTGATGGT 4 32 0 

GGGGCCCCAA CATAAAGAAA TACTTTTTCT 4380 

TTACTCATGT ATTCCTATTT TTAAATACAC 444 0 

ACATAAATCA ATGTTCTATG CTCTACGAAg 4 500 

TGCGCTTGTA ACAAGCTTTT TTCAATTCTA 4 56 0 

AAGAAATTCT ACAGGCAATG CAAGTTGGGG 4 620 

CAATTTCTAC AGACAATGCA AGTTGGGGTG 4 680 

ATTTATGTCC CACTCCCTAG ATTGATCTAT 4 74 0 

TTATGATTCT CTTAGCTGCA ATCCCATGAA 4 800 

AGGTCAGTAG ATTTCA 4 84 6 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1843 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 

AACAAAGACA CAATCGAACA TGAACCATCA GTAAAAGCTG AAGATATATC AAAAAAGGAG 60 

GATACACCAA AAGAAGTAGC TGATGTTGCT GAAGTTCAGC CGAAATCGTC AGTCACTCAT 120 

15 AACGCAGAGA CACCTAAGGT TAGAAAAGCT CGTTCTGTTG ATGAAGGCTC TTTTGATATT 180 

ACAAGAGATT CTAAAAATGT AGTTGAATCT ACCCCAATTA CAATTCAAGG TAAAGAACAT 24 0 

TTTGAAGGTT ACGGAAGTGT TGATATACAA AAAAAACCAA CAGATTTAGG GGTATCAGAG 3 00 

GTAACCAGGT TTAATGTTGG TAATGAAAGT AATGGTTTGA TAGGAGCTTT ACAATTAAAA 36 0 

AATAAAATAG ATTTTAGTAA GGATTTCAAT TTTAAAGTTA GAGTGGCAAA TAACCATCAA 420 

TCAAATACCA CAGGTGCTGA TGGTTGGGGG TTCTTATTTA GTAAAGGAAA TGCAGAAGAA 48 0 

25 

TATTTAACTA ATGGTGGAAT CCTTGGGGAT AAAGGTCTGG TAAATTCAGG CGGATTTAAA 54 0 

ATTGATACTG GATACATTTA TACAAGTTCC ATGGACAAAA CTGAAAAGCA AGCTGGACAA 600 

GGTTATAGAG GATACGGAGC TTTTGTOAAA AATGACAGTT CTGGTAATTC ACAAATGGTT 66 0 

GGAGAAAATA TTGATAAATC AAAAACTAAT TTTTTAAACT ATGCGGACAA TTCAACTAAT 72 0 

ACATCAGATG GAAAGTTTCA TGGGCAACGT TTAAATGATG TCATCTTAAC TTATGTTGCT 780 

35 TCAACTGGTA AAATGAGAGC AGAATATGCT GGTAAAACTT GGGAGACTTC AATAACAGAT 840 

TTAGGTTTAT CTAAAAATCA GGCATATAAT TTCTTAATTA CATCTAGTCA AAGATGGGGC 90 0 

CTTP^TCAAG GGATAAATGC AAATGGCTGG ATGAGAACTG ACTTGAAAGG TTCAGAGTTT 960 

40 

ACTTTTACAC CAGAAGCGCC AAAAACAATA ACAGAATTAG AAAAAAAAGT TGAAGAGATT 1020 

CCATTCAAGA AAGAACGTAA ATTTAATCCG GATTTAGCAC CAGGGACAGA AAAAGTAACA 1080 

AGAGAAGGAC AAAAAGGTGA GAAGACAATA ACGACrCCAA CACTAAAAAA TCCATTAACT 114 0 

45 

GGAGwAATTA TTAGTAAAGG TGAAyCgAAA GAAGAAATCA CAAAAGATCC GATTAATGAA 12 00 

TTAACAGAAT ACGGACCAGA AACGATAACA CCAGGTCATC GAGACGAATT TGATCCGAAG 12 60 

so TTACCAACAG GAGAGAAAGA GGAAGTTCCA GGTAAACCAG GAATTAAGAA TCCAGAAACA 132 0 

GGAGAyGTAG TTAGACCACC GGTCGATAGC GTAACAAAAT ATGGACCTGT AAAAGGAGAC 13 80 
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CCAACACTAA AAAATCCATT AACTGGAGAA ATTATTAGTA AAGGTGAATC GAAAGAAGAA 1560 

ATCACAAAAG ATCCGATTAA TGAATTAACA GAATACGGAC CAGAAACGAT AACACCAGGT 162 0 

^ CATCGAGACG AATTTGATCC GAAGTTACCA ACAGGAGAGA AAGAGGAAGT TCCAGGTAAA 16 8 0 

CCAGGAATTa AGAATCCAGA AACAGGAGAT GTAGTTAGAC CACCGGTCGA TaGCGTAACA 174 0 

AAATATGGgA CCTGTaAAAG GAGACTCgAT TgTaGGAAAA AGarGAATTc CaTTCaGGAA 18 00 

10 

AGAcGTaATT TaTCCTGTTT AGCACCCGGG GCAGAAAAGT TAG 184 3 
(2) INFORMATION FOR SEQ ID NO: 278: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 853 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278: 

TAAACAGCGC GTGTACTTGT GATTCCCCCT TCTTCTATTT TACCCACCCG GGAAATAATA 6 0 

25 C 'TTTTCGCGA TTCCTTACTT GaACAAGCAA TATTTTATCa GCTGTTTCTT CAACTAAACA 12 0 

GACACATTTA ATCATCTTTG ACACCCCAAC TTTGTGAAAT CAATTTTTCA AATTATACTG 18 0 

TACAATTATG TTATCATATA TGAGTAGTTA TAGCGCAAAA CGTTAGCAAT TCAGCGCACC 24 0 

30 

CAACTTTTCA TATAAACAGA AGATACTAGG GGGAATTATT ATTATGGCAA AACgTTCCAA 3 00 

ATCACAACGT TTATCAAGTT TACTAAATGT CGCAGGTTTC ATAGTCGACG GCTACAATGG 36 0 

CTATAAATAT CATGCTAAAA ATAAAAAATT AGTATATCTT TCATTAGGTT TAAGCACTGT 420 

35 

AGGAACCGTG TTAGACTTTT ACATTTCAAT TAAGTCACCA CGTAAGTTCA AAAAAGCAGT 480 

GGCP^TTGTT ACTTTAATAA CAAACGGTGC TAGATTATTT ACAAGCATTC GCAAAGTAAA 54 0 

40 ACATGAATAC TAATTCAGAA AAGGATTGGT CGAACATAGA ACATGAAGTT CATTCGACCA 60 0 

ATCCTTTTTC ATATACAAAA ATTCCTATTA CAACAATCAC GACTTTGATA GCCGCGCCAT 660 

AAAATTTAGG ATTCAATCCA ACTTTTCAGC TTGTGaAATG TAATAGGaAT TCATTATATA 72 0 

45 TTTATATACG TAAGACTTTA GTGAATATAT CTATAATTAT TTACTTGGTA AGCTGGTACC 78 0 

GTTCTGTAAG TTAAAATACC TGGTGCAGCT GAATAGTTCA TTTCTGAAAC TAAAATACTA 84 0 

CCATCGTTAT TTACACGTTC TACAAACATA ACATGACCAT AGTAACCTAC ATCTGTTTGA 900 

50 

GCGATAGAAC CTACAGTAGG TCTATTGTCG ATAGTGTAAC CATCTGCTGC CGCTGCGTTA 96 0 

TCCCAGTTAT TAGCATTCCA CCAATAAGTA CTAATACCTT TACCAATTTC AGCACGACGA 102 0 
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GTATTGTAAC CTCTATTTGT CGTTGTTGCA GATCCTGAGT TCGTAGATGC aTTACCAGTT 114 0 

ACTTTCAATT TTTGACCCGG ATATATAAAG AAATTATTTA AGCCATTTAA GCTCATAATT 1200 

TTTTGATAAG TTGTACCATA TTTTGATGCA ATTAATGACA ATGAGTCACC TGCTTGTACT 1260 

GTATAGTATG ATCCGCCACC TGAGTTCGTT GATGGACGGC TACTATTGCT CGCAGCGTTA 1320 

CTTGAGCTAG CAGTACCTGA TACTTTTAAT TTTTGACCTG GATAAATAAA GAAATTATTT 138 0 

AAACCATTAA GTCGCATAAT GTTTTGGTAA GTTGTACCAT ATTTTGATGC GATTAATGAT 144 0 

AATGAGTCGC CTGCTTGTAC TGTGTAGTAT GATCCGCCAC CTGAGTTCGT TGATGGACGG 1500 

CTACTATTAC TCGTAGAATT ACTTGAGCCA GATACTTTTA GTACTTGGTT TGGGAAAATT 1560 

AGATTAGATG TTAAATTGTT TAATGACTTT AATTTAGCAA TCGAAATCCC ATACTTATTT 1620 

GAAATTGCCC ACACTGATTC ACCCGGTTTT ACTGTGTGAG TTGTAGCCGC ATTTGCTTGA 1680 

GTTGCCGCAA CAGCGCTAAT CGCGCTTGTC CCAATAATAG CTGCAATTAC TTTTTTTTGC 174 0 

ACTTTAAAAT CCTCCTCTTG CTTAACTTTC CTAACATTCT TTTATCCGAA TTTATGAATA ISCO 

CTACATCATT ATACGATTTT ATTATGTATA ATAGGTTGAT GTTTGATGAC ATTATGaTTA 186 0 

25 CAAAAAAATC ATATACTGTA TCATCAAATT TTATAATTAT CCCTTAAAAT TATTACAACT 1920 

TATTAGATTT TACAATATCT AAATTATTAC AATTTCATAA TATTTCACTA TAAAATGATT 1980 

ACAATCCCTT TCTCTATTGG AAATAATTTT ATTCTCCAAC AATAACGCCC TACAAACATA 204 0 

AGCATGAACT TTGCTTGTAG GACGCAATAT AATTTATTTT GCTAATGACG TTTCTATTGC 2100 

CTTAATCTCA TCTTTAGATA AATTAACAGG TTTCTCTCCA TCTTTGACAT CTTCCTGCAA 216 0 

CGCTTTTTGA GCTTCTTTTG AATGATACAA TTCTACGATT TTAGCATATG TTTTGTTATC 222 0 

CAAGTCTTTG TCATTAACTG CAACAATATT AATATATGGC TTTACTGCAT CTGAATTTGA 228 0 

TTTTTCTAAA AATATCGGAT CATTTTTAGG ATCTTTACCC GCTTTAGTTG CTACACCGTT 234 0 

ATTAATAACT GCAATATCGA CATCAGATAA AGCACGTGCA GTTTGTTGTG CATCTACTGC 24 0 0 

AGTAATTTTT AAATGTTTTG GATTTGACGT TATATCTTTC ACCGTGCCTG CTAATCCGAA 246 0 

ATCTTTTTTC AGTTTTATTA AACCAGCTGC TTCTAATAGT TTAAGTGCAC GTGCTTGGTT 252 0 

TGACACATCA TTTGGAATGA CAACTTTAGC ACCATCTTTA ACCTTTTTGA CATCTTTAAT 258 0 

TTTATCTGAG TAAATGCCCA ACGGTGCTAA AACTGTTGTA CTTAATGCTG AAATCTTTGT 264 0 

TCCTTTATGC GCCTTTTTAT ATTGATCTAA AAATGCAAAA TGTTGGAATG CATTCATATC 270 0 

AATATCACCA TCATTTAATG CTTTATTCGG TAAATTGTAA TCTGAGAAGT GCTTAATCTC 276 0 

......... ...^^^ — .j^^^f^ry^^ TCCCAAGCCT TAGTGTCATT 2820 
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50 



AAGTGCTACG 


ATAACTAACC 


CAATCAATCT 


TTTCATTCTA 


TCAATTCCTT 


TCAAAATCTT 


2940 


CACTATATAT 


CATTAATGTC 


TACGTATGAA 


TCTAGCTAGA 


ACATTCCCTA 


GCGTTTGAAT 


3000 


CACTTGGACA 


ATAATGACTA 


ATACAATAAC 


GGTAATAATA 


ATGACCGTCG 


TATCAAATCT 


3060 


TTGATAACCA 


TACACTAAAG 


CTAAGTCTCC 


TATACCACCA 


CCGCCAACAG 


CTCCTGCCAT 


3120 


CGCCGTACTT 


CCAATAAGTC 


CAATAATCGC 


AGTGGTAATT 


GCTAATACTA 


ACGAACCTAA 


3180 


AGCTTCAGGA 


ATTAAAAAAT 


ATCTAATGAT 


TTGTAGTGGT 


GAAGCGCCCA 


TCGnTTTCGC 


3240 


CGCTTCAATA 


ATCCCCTCGT 


CTACTTCCAA 


TAATGAGTTT 


TCAACAAGTC 


TTGCAATGTA 


3300 


AGGTGCCACA 


TATACTGTTA 


AAGGCACGAT 


GGCAGCAGTC 


GTACCAATTG 


AAGTACCTAC 


3360 


TACTAATTTT 


GTGAATGGCA 


CAATCGCAAT 


TAACAAAATA 


ATAAATGGTA 


GTGACCTTAA 


3420 


AATATTGATT 


AAAGGATTTA 


AAACTTGATG 


TATCACTATA 


TTGGGCCATA 


TGCCTTGTTT 


3480 


TCGAGTAATT 


ACCAATAAGA 


CACCTAATGG 


AATACCAATC 


ACTGCTCCTA 


AAAATAAAGC 


3540 


AATAGATACC 


ATATATAGCG 


TTTCGTACAA 


TGCTTGTAAT 


AACTGTGCAC 


TGTCTAAATC 


3600 


AGAACCAAAC 


ATATGTTAAT 


GcACCTCCTC 


AAATTGAATA 


TTTTTCTCTT 


TGAAATATTG 


3660 


ATTTATTGCC 


GTGTCTTCAA 


ATTGTTGATC 


CATATTAAAT 


CGAAGCCACA 


TATAACATAC 


3720 


GGTGTTACCT 


TGTATTTCTG 


ACATAGATGA 


AAATAAAATT 


TTAACCTCTC 


TGCCACAAAT 


3780 


TTGAATCAAG 


TCATTTATAA 


TCGGTTGTGT 


CACCTGAGTT 


TCCTCGACGA 


AGATTTTATA 


3840 


ATCTTTAAAA 


TCGCCAACTT 


GTTCGTCATT 


CAATCGACGA 


ATCAATGATG 


TACTTGGCTC 


3900 


AGTCTGTATA 


ACTGTAGACA 


CAAAATTTTG 


AGCAATCGTC 


GTTTTAGGAT 


GACTAAACAC 


3960 


CTCTTTAAcA 


GTTCCTGTTT 


CAACCACTTT 


CCCCTTTTCC 


ATTACAGCAA 


CACGATTACA 


4020 


AATGTCTTTA 


ATAACGCGCA 


TTTCATGTGT 


AATCATCATA 


ATTGTAATGC 


CAAAGGTTTG 


4080 


attgScattc 


TTTAATAACG 


TCAATATCGA 


AGCAGTCGTT 


GCTGGATCCA 


ATGCGCTTGT 


4140 


TGCTTCATCG 


CATAGGAGTA 


TTTTCGGATT 


AGTAACAAGC 


GCTCTTGCAA 


TAGCCACCCT 


4200 


TTGCTTCTGC 


CCACCAGATA 


ATTCATCAGG 


AAATTGGTCT 


■rriTl'ATCAC 


TCAATCCTAC 


4260 


AAATTCAAGC 


ATTTCCGTTA 


CTCGTTGCTT 


AATTTCTGTT 


TTGCTTTTCT 


TACTTAAAAT 


4320 


GAGTGGCATT 


GCTACATTTT 


TAAATACGGT 


AGCTGAATTT 


AATAAATTGA 


AATGCTGAAA 


4380 


TATCATACCG 


ATATCTTTCT 


TAATATCCCT 


CATCATTTTA 


TCGCTATAAT 


TCGTAATATC 


4440 


ATGTCCATCT 


ACAATCACTT 


GTCCATTCGA 


GGCAGCTTCA 


AGATGATTCA 


CGAGTCTTAC 


4500 


CAACGTACTT 


TTTCCTGCAC 


CACTATATCC 


AATCACACCA 


AAAATATCAT 


TGCGATTGAC 


4560 


CGTAAATGAT 


ACGTCCTTCA 


AAGCATCTAT 


TTTTTGCTTC 


TTTTTATTAA 


AGGTCTTACT 


4620 



55 
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10 



15 



20 



25 



30 



35 



40 



50 



TAAGAGCATT 
AATTAAAAAA 
CAAATTAAAT 
TCGCACAAAA 
AGTGCTGCTA 
GTATCTTCAA 
ATATCTGCTG 
GCACCTAAAC 
TCTGTATTTT 
ACATCATGAT 
GCTTGTAACA 
AATGTTTTAA 
GAATTTGAAA 
GCAACGACAT 
CCTAAATCTA 
AAACCGTGTT 
TTAGAAGGTG 
TCTGTATGCG 
AATGCGCGAT 
AACAGCATAA 
CCATTTTCTA 
GAATATTCAT 
TAAATTGGTG 
TTAGTTTTCT 
ATCACTACCA 
CAATGCACCT 
CAAACTTTTG 
ACGTCTCTCT 



ATATGTAAAA 
TAAGTAATTC 
CCATAAAAAT 
AAGCACTAGC 
TTATTTATAA 
TACCTACAGA 
GAATGGATGC 
TTTCAGCTAA 
TCACTTCAAA 
TTAAATGACT 
TTTTAATAAT 
TACCCCTCAC 
TAAATGCTAA 
CACTATGTCC 
ATGGATTCTG 
CTTTCGCAAT 
TTTCAATAAA 
TTGTATCTAC 
^AGTACCGCC 
CAACTGCACT 
ATGTCGCAAT 
ATCCTTGACG 
TTGTAACGGC 
TGTTCATTAT 
TCTGGAAATA 
TGTAATGCTG 
ACATTTCGAA 
AAAAATATCG 



TTGCATATAT 
ATGTGACAAT 
GTCCACCAAT 
ATATTCTikAGA 
AGTATCTAGT 
AATTCTTACA 
ATGTGTCATC 
TGTGTAATAC 
TGCAATCACA 
TTCAATACTT 
TTCAATAACG 
AAGTAAATAG 
ACGTTCTGCA 
ACCTAAATAT 
ATAATAAGGT 
TTCAGCAGAC 
CAACATCTTT 
AAAATCCACT 
GTATACATCT 
GATTGCTGCA 
AACGCTTTCT 
TAAATCACCA 
ACCTGTATAA 
TATTCTCCTC 
CGACAACAAT 
CACCTGaAGA 
AGGCATCTTG 
GCCATTTCTC 



CGTcAATACA 
GACGAATTGT 
CCTCCACAAC 
ACAACAAACG 
GCTTGTTTTA 
AGTCCGTCTG 
AATGCAGGTA 
GATGTTGCTT 
CCTGTATGGC 
GGATGGAACA 
CTGCGATTAA 
CTATCTTOAG 
AGCTTGTCAT 
TTCGTTGCAG 
GTCATAAATG 
TTTTTAATGT 
GTTGTTGGGC 
TCAATGCCAA 
GAATTTAAAA 
ACACCTGAAC 
ACAGAACTTC 
ATATCATCTT 
TCGTCTGTTG 
ATAATTAAAT 
CGTACCTTCA 
ACTGCCTACT 
ATCTTTAATC 
AGAACCGATA 



ATTTGCCGAA 
GAGACTACTA 
GCAATTACTA 
TTGAACTCAA 
AATCATCGAC 
TAATACCTTC 
CTGAAATTAA 
TAATCAATTG 
CATCCGCTTG 
CTTGTTGCAC 
TTTGTTCCAT 
GTCCTAAAAT 
CCGATGTTGC 
AATGTAAGAC 
TGTTATCAAC 
CAGTAACACG 
GTATCGCTTG 
ATCGTGTAAA 
TAATATGATC 
TAAATGCAAA 
TTGTTGGATT 
GTAAATATGT 
TGTGCCCACC 
ATTTGCTTAG 
GATAATTGCG 
AACAACCCTT 
GTAAATATCC 
CCTTCAGTGT 
-^.^ATAACATT 



TTTTCTAAAA 
TGACATTTAT 
AATATTAACA 
AATATATGCC 

X'"wrw-kA w — * — - 

TTTAGCTCGA 
ACTTTCCACT 
TTTGGCACTT 
AGCCATATGG 
AGCTGGATGT 
ACGTAAACCT 
GCCACCTGTT 
AACTAAACCA 
AATATCGATA 
AACTGAAATC 
TAATAATGGA 
TACAATTGAA 
TACTTTTGTC 
TCCTTTGTCC 
GCCATGTTTG 
CGCAGTACGA 
ACTTGTTTGA 
ATGAATTAAT 
ACATATATCG 
CTTTTAAATT 
CATTTATAGC 
CATCTACAAG 
CATGTGCATG 
GCACGTGATG 



4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5 94 0 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 



ATAATCAATT TGCTGTAATG CTGAAGTCAA 
CGGATTATGT TCGGATTCAA ATTGATTCAT 
5 ACGTGCAGCT AATTGTGCCC CATGCATACC 

ACCAAGCGCT ATCATAATAT TAATCTTTTC 
ACATTTCAAA TGATGTCTAT TCGCTGCAAT 

10 

CGCTTCAACA ATAGTTTGAC CTGCACGCAC 
ATATTTCCCG AGTCTGTCTT TAACACTGCC 
AATTTTAACT TTATCATCAC TATAATGTTC 

75 

ATCATAAGTA ATCATAGATG CACCCTCATC 
CTTCATGTCT CAATTTTAAT TCTTACTTTT 

2Q TTTTGCAATT ATCTAAGTTT CGATTAATTC 

AAAAAAACAC CTGAGCAACA CAAATACTTG 
TAATTGCACG ATAAAGACTA AAATAATAAC 

25 CCAACCACTG AATAATCTAA ATCGATCTTT 

TAATAATTGT CCTACGACAA GCGTAGTACC 
AACGATGAAA TCCATATTAT CAAAAATCGT 

30 

ACCAAAAGAT AAGGTTGCTG GAATACTAAT 
GACTTTTTTA CGTTTTGTAT TGTCATTCTT 
AGAAATAGAT GACGTTAAAG CCGCAAATAA 

35 

TGTGCCTAGA TGCATTTGAC TAAAGACCAT 
TTCT5'GTGGT TCATAGCCAA AACTATGTAA 
AGATACAAAG ATATTCATAA CAACGATAGA 
AGAGGCATAA CTCGCATAAG taatcatacc 
TTGACCTAGC GCAAACAAGA TGCCATCAGC 
45 AAATTTCACA CCTTCTAAGA CGCCATCTAA 
AAAGACAAAC AGCAATGGCA TCATAACTTT 
TAACATGACA ATAATCATCG TAGCGAATAT 

50 

AGGATTTGAT ATTACCGCTT CAAAATTCAT 
AACTAATTGC CATAATACTT GTCCGATGTA 
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TTCGGGTCCC AATGTATGAA AATATGTATC 6 54 0 

ATAAACGGCA CCATATTTTT CAGCATAGGA 6 6 00 

TTCAGACTGA CTCGTCCTTG AAACTTCTGC 6 6 60 

TTCTGAAAAA CCATACGGCG CAAAGATCTT 672 0 

AGCTAACCCT ATGCCTGTAT TACCAGCAGT 6 780 

ACGCCCTTCT TGAATTGCCT TCTCTACTAA 6 84 0 

TCCAGGATTC CATTGTTCAA GCTTGGCATA 6 9 00 

TAACAGTACT AATGGTGTAT TGCCaATTAA 6 960 

TGACATGCCG ATCAAATGAA TGAAACCTTT 7 02 0 

CAGATAAGAA TTATAAACGA CATTTTGTTA 7080 

AGAACCAGTA CTAAATTTTC AATTCCAAAC 714 0 

TGTGTCAGAT GCTTCTATAT ATTAACTAAA 72 00 

GACAGGCATC GCATACTTAA TTAAGTAATA 72 60 

ACCAAAATAT TGTTGTAATA ATTTTTTATC 73 20 

TAATGCGCCT AATGGCATCA ATACATTCGA 73 8 0 

TCCCGCACCG AATCTTACAT CTTTTAAGAT 744 0 

GATAAATACT AAAATACTAC CGATCACTGC 750 0 

CGTGAAGTTA GAAACATTTA ATTCTAATAA 756 0 

GAACAGCACT AAGAATCCCA AATAGAATAA 762 0 

TGGCAGTACT TTAAATAATA ATCCAGGCCC 76 8 0 

AGCCGGAAAT ATAGCTAGAC CTGCCAATAC 774 0 

AATAGCTGAT GACTTAATCG TCATGTCTTT 780 0 

TGTAGTTCCT AATGATAACG TAAAGAATGA 786 0 

AGTAATCTCT GATACTCTTG GTTGTAAAAT 7 92 0 

TGTTAAAGAC TTAATCACAA TGACGATTAA 79 8 0 

CGATGCCTTT TCTAATCCTT TTTCAACACC 8 04 0 

GAATATACCT TGCCCTAGAA CGGTTAACCA 8100 

TTCTTGGAGA TGATTGATGC GTTGAAATAT 816 0 

AATGACAATC CAACCACCGA TAACACTATA 8220 



55 
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TTTACCAGTT AATTTACTAT ATATTTGTGT TGTATATGTC CGTCCCATTT TCCCAACAGT 8340 

GAATTCCATA ATGAGTAATG GCAACCCAAC AAAAATGGTG AATATTAAGA ACATAGCTAG 84 00 

AAAGGCACCG CCGCCATAAA TCCCTGCCAT ATATGGGAAT TTCCACATGG CACCAAGACC 84 60 

GATTGCAGAA CcCGCACTAG CTAAAATAAA TCCAGTTGAT GACTTCCATT GTGATTGTTG 8520 

TCTTTTCATC ATTCAC S53 6 



10 

(2) INFORMATION FOR SEQ ID NO: 279: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4328 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279: 

GCTTTGGCCA TTTTATGTGG CGATTGAGAC AATCtGTkGT TGTCTTATTT GATGTTGTAT 60 

TTCAACTGGT AATTCTAGTT GCGATTGAAA TAATGGCAAC TTTTCCCAAT CATTAACAAA 120 

TAATTCAATA CCTGCTATGT CTAATACTTT AGCACGTGCA TCATCAACAA GACGGCGTTC 18 0 

CAATTGATTT GCTTCTTCTT TAATACCTGG TGACGTACTT TCTAATATCA AATTAGATAT 24 0 

AGGGATGTGA CCATTAATTG CATAATATAA TGCAACACGC CCACCCATTG AATATCCAAA 300 

30 

C/L^TGTTATT GATTTATCTT TATATTTATC TAAAATTCGG TCTAACAACG TCGTAATATA 3 60 

ATCAAAATTC CACGTTTCAT CCATTGAAGA CTGATCTTCG CCATGGCCTG GTAAGTCTAT 420 

35 AGTGATGACA TGATAGTTAT CAGTAAATTT TTCGATGTGA TTATGATAAG TACGGCTGTC 4 80 

GCTAAGAAAT CCATGCAGAA ATACTAAAAC TTGATTGGTC TCAACGTTTG CTTCATAAAA 54 0 

TTT;CrAATGT GTCATGAATC ATTTCACTCA ATTTCTGGTA TAAAATTTGA TGCTGTTTAA 6 00 

AGTTATCTTC GCGATTCGTT ATCAATTCAT AAATCGTCGA AGTTTCAGAT AACAATGTGG 660 

CATTTTTAAA TTCTGAAACA CTGTTAAAAC GTTTAAAATC GAATTGATAT AACTTAGCTG 72 0 

TATACTCGAA ATCCAATCCC GTCGGTGTGC CAAACAACCG TTCAAAATAG TCAGTTGCAC 7 80 

45 

TTTCTTTTTG TGGTAAATAT GAAAAAATAC CGCCACCATC GTTGTTCAAT AATACAATAT 84 0 

TCATCTGAAT ATTATTTAAT TTTGACATTA ATAGTCCATT CATATCATGA TAAAATGATA 900 

AATCACCTAT CAATAATGTT ATTCGTTTAT GCACAGCCAT ACCCAGTGCA GTTGAAACGA 960 

50 

TACCATCAAT ACCATTCGCA CCACGATTCG CATAGACATC TATATTTTTA TTCAATAACA 102 0 
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TAATTTCTTT ACGCCCTTTT TTCTCTAAGC 
TAACTGTCGT GTCTTCCATT AATGACCTAA 
5 TATCTGGCGC TATCGGAAAG ACATCAATCT 

CATCAGTTTT CTTTAACCAT TGATTTAACT 
TTACGAAATC CACATTTAAG TCTAAGCCGC 

10 

CATTCGGATG ATCAAATTTT CTTAAATGAC 
ATATCGTTGA ATACGTTAGT ATTTGaTCAA 
^5 TTAAACCTTT TTTCTTATTT AAAATGTGTC 
AGTGCGGTAA AATCTTCATC TCAGAAGTTA 
ACGOATCTCT AAATGGCAAG TTAAAATGAA 
GACTAGCAAT TTGCATTTGA TAGTAAATTG 
TAGGCATATC GAACTCATAA CTTACATAAT 
GTGGTGCGCC TACACTTCTT AATTCATGCG 

25 

TTCTACTAAT TTGGCTTTCA GCAATTGCAG 
TACATAATAT AGCGACAGGT CTTTCACTGC 
CACTTCGCTC ATCGGGGTGT ATCCATGTTT 

30 

CAAGTGGCGT TGAGCGTGAT CCCGGACTGA 
ACTCAGATGC PJ^TQTPiPJ^ ACTTGCTTCG 
35 TCGACTCCTA ATGCATTCAT CATAGGTGTG 

TCTGGATCAG AATCTTTAAC AATGCCACAC 
ATAAGCATCG AACGAATTGC PiACPiATPJiAT 
ACCGGTGCAC CATATAATCC TCGCGTACCA 
AATTCTTTTG GATAGCCACC TAAAGCAGGT 
TACGAATCAT CCTTCAGTGG CGCCTTTATT 

45 

TTTAGAATTT TAGGCGTCTT ATCATAATGT 
AAAATACTGT CAACAACAAA TCGATGTTCG 
TCAACATTTT TTGTATCTTC GTCCTCATCT 

50 

TTAGTCGATA GTATTTTATT ATTGACCTTT 
ATAGAATCTT GTGATTCTAA CAAGAATATA 

55 



ATTGCCATTT TTCTAACCAA CTTACGCGAT 120 0 

AGAAATCATT CGCAGAAATC TCATATGAAA 126 0 

TATCATTGTT TTGCACTAAT ATTTGAAATG 1320 

TTTTAGAAAT CACTGGTTTC CCAACACGAA 1380 

TTCTAAACAG CAAATCATAT GTACAGATAA 144 0 

TTAAAGGATC AGCTAAAATA GGCAAATCAT 150 0 

CTTCTTGGTG CTGCATATCC CCTACAATAA 1560 

TTAATGCCGA TGCATCTATA CTTTTTTGAT 162 0 

ACAATTCTGT TGCATTCAAA TCAGGTGTTA 1680 

TTGGCCCTTT ATGTGGTCCA TATAAATATT 174 0 

CATCAATGGT CTCTTTACTA TCATCCGCAA 18 0 0 

TATTAAACAT ATTTACTTGA TTAATCGCTT 1860 

GACGGTCACT TGTTAAAACG ATTAAAGGAA 192 0 

GCGTATAATT CGCTGCTGCT GTACCTGACG 198 0 

CTTTAATTAA CCCAACTGCA AAAAACGCTG 204 0 

TAATATTTGG ATGTGCTTCA AATGCAAGTG 210 0 

TAACTACTTC CCTTACGCCG TACGCATATA 2160 

TTAAAGCTGC TTTATGATTT CCCATTCATA 2220 

AACTTAAGGT TCGTTTCTGC CAATTCACTA 22 80 

CCAGCAAATA AAGTTGCTTG TGCTTTCTTA 234 0 

TCACAATCAT CGTATATATC TATATAGCCA 2400 

AATTCTTTCT GCTCAATAAA ATCCATTGCA 2460 

GTTGGATGTA AATTATCAAT TAAACTAATA 252 0 

TCAGTGTACA AGTGATATAA ATGATCATTT 25 8 0 

AATTCAGTGA TATAAGGTTT AATATCATGT 264 0 

ATTAAGTTTT TATTATCTTT TAAAAATGCT 270 0 

TGTGAACX5TT TAATTGTACC TGCTACAGCT 276 0 

ATTAATTGTT CAGGTGTTTG TGAAAAGAAT 2 82 0 

TAACTGTTTT TTTCTTTAGA ATATGCTTGC 288 0 
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ACAArrrrrr cTrcArrArr aatagattct atagcttcta CTACAAGrrG acgccagtca 

TCrriATAAA TATCrrCATT TCTAGTAATT TCCCCAATTT GCTCGTCCAC ATCTATGTCC 

gatatattgt tgaacaaatc cattaaatcg ttcaatgcct caacagtaaa Acrrrcccrr 

TTAACrGTAT AAGTTAAAAA TGTCCCATTA rTATCAGrTC AAAITAAAAC TTCAGGTAAT 
ACAAAATGAT TTAGTCCAAA CTCTCGCCAT TCATCATCTTG ArriAi..«u. ..^ ^^^-^^ 



" AACCCTCCAA CAACTCGAAG ATGATGrTTC TCAGATTGCG GATGTATAAA TGTGATGTTA 3300 



ACTATCGG 

(2) INFORMATION FOR SEQ ID NO: 28 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1450 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 



3000 
3060 
3120 
3180 



3360 
3420 



TGTrrrAATT TTTCCCAGTC TTTAAAAATA GATTGTTTAT TTTTAGAATT ATTTrTGAAT 
AArrGAATTG CnTGTAGCC AAAATATGAC GTrCGATTAT CATTCAAACG CATATAAAAG 

' CGATCTCCTG CCTCATKJTC AGTGAGATGA AATAATtSTGC TCGGGTCTAG TGACTGTGAT 34B0 

AATrrcAcrr caactgaaac ccATTCcrrr gagctgccat ATATcrcm gacaatatcg 3540 

TCCTCTAATA CGCCCGTAGC CATCCATTTC ACTTCTTTCT TCGTCTTrTT TCACTCATTA 3600 

TTATATrGTA TCATTmXK. ATAATTGTGT TACAAGAATT GCTTAAACTT ATCTTGCAAT 3660 

TTTTCACGTC AATTGACCTr TATGCTACTT TCTATTAAAA TATCTTTGTT ATAAAAAATA 3720 

- TGATTTAAAG AGGTrrTGTA TTCAATGAGT AATCAATATC AGCAATATTC TACAGTTAAG 3780 

AAATATTGGC ATTTAAT^CG TCCTCATACA TTAACTGCTT CCGTAGTACC CGrTTTAGrr 3840 

GGTACAGCAG CATCTAAAAT ATATTrTCTT GGTAGCGAAG ATCATATTAA AATCAGCCTA 3900 

'° rrCATTGCCA TGTTACTAGC ATGCTTACTT ATTCAAGCAG CAACTAATAT GTTTAATGAA 3960 

TACTATGATT ATAAAAAAGO CCTCGAT^AT CATGAATCTG TAGGCArTGG TGGTGCCATT 4020 

GTTCGCAACG GTAT^AGCCC AGAGCTTGTG CTACGATTAG CCATTGCATT TTACATCTTA 4080 

" GCAGCAATAT TAGGTTl^TT rTTAGCTGCT AACTCTTCAT nTGGTrArT ACCAGTTGGA 4140 
TTAGIATGTA TGGCTGrrGG TTACCTATAT ACAGGTGGCC CTTTCCCTAT TTCATGGACG 
CCrrrCGGTG AATTATTCTC AGGCGTATTT ATGGGTATGT rrATTATCGT TATTCCATTC 
TTTATTCAAA CTGGCAATAT TCAAAGTTAT GTAATTTGGT TAAGTGTACC TATAGTAATC 



4200 
4260 
4320 
4328 
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GTTCAATACA GAAAAAATAA ATTTAGATGT TGAAGCATCC TACAATTAAT ACAGATCCAT 6 0 

TTCAATATAT TTAAACTAAA ATCTCGGGAT TTCTAAATTT TGAAATTTCG AGGTTTTnAT 12 0 

5 

ATTTTTATTT AAAATAGCAC ATTTATACTT TATAATAGTA AAGATGAACA TATAAGGAGG 180 

CCAAATCATG GCAAAACATC CATTCGAACA ATTTAATCTA GAATCTAGTT TAATTGACGC 24 0 

TGTGAAAGAC CTTAATTTTG AAAAACCAAC TGAAATTCAG AATCGAATTA TTCCAAGAAT 30 0 

10 

ACTAAAGAGA ACAAATTTAA TTGGTCAATC TCAAACGGGT ACAGGGAAAT CTCATGCATT 36 0 

TTTATTACCA TTAATGCAGT TAATTGATAG TGAAATAAAA GAACCACAAG CAATCGTAGT 42 0 

TGCACCAACA AGAGAACTTG CACAACAACT ATACGATGCA GCGAACCATT TAAGCCAATT 480 

TAAAGCTGGT GTTTCAGTTA AAGmTTAT TGGTGGTACA GATATAGAGA AAGATAGACA 54 0 

ACGTTGTAAT GCACAACCAC AATTGATTAT AGGCACCCCT ACTAGAATTA ATGACTTAGC 60 0 

20 TAAAACGGGA CATTTACATG TGCACTTAGC ATCATATTTA GTTATTGATG AAGCGGATCT 66 0 

TATGATTGAC TTAGGATTAA TTGAAGATGT AGATTACATT GCTGCAAGAT TGGAAGATAA 720 

TGCAAATATT GCGGTGTTTA GTGCTACAAT CCCACAACAG TTACAACCAT TTTTAAATAA 780 

25 ATATTTAAGT CATCCAGAAT ATGTAGCTGT CGACAGTAAA AAACAAAATA PJ^QPJkQKI: 840 

CGAATTCTAT TTAATACCTA CTAAAGGTGC AGCTAAAGTT GAAAAGACTT TAAATTTAAT 900 

TGATATACTA AATCCATACT TATGTATTAT TTTCTGTAAT AGTAGAGATA ATGCAAATGA 960 

30 

TTTAGCACGT TCACTAAATG AAGCTGGTAT TAAAGTTGGT ATGATTCATG GTGGCTTAAC 1020 

GCCaCGTgAA CGTAAACAAC AAATGarACG TATACGTAAT TTaGAATTCC aATACGTTAT 1080 

TGCCaGCGAT TTAGCATCTC GTGGTATTGA TATTGAAGGT GTTAGTCrTG TCATCaATTT 114 0 

35 

TGATGTGCCA AATGATATTG ACTTCTTTAC GCATAGAGTC GGACGAACTG GTCGTGGGAA 120 0 

TTATTrAGGT GTAGCAATTA CGCTTTATAG TCCTGATGAA GhACACPJ^Th TTTCATTAAT 126 0 

40 AGAAGATCGC GGTTTTGTAT TCAATACTGT TGATATTAAA GATGGTGAGT TAAAAGAAGT 132 0 

TAAAGCGCAC AATCAGCGTC AAGCAAGAAT GCGCAAAGAT GACCATTTAA CTAATCAAGT 13 8 0 

GAAGAACAAA GTTCGAAGTA AAATTAAAAA CAAAGTTAAA CCAGGTTATA AGAAGAAATT 144 0 

TAAACAAGAA 1450 

(2) INFORMATION FOR SEQ ID NO: 281: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 1139 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



55 
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w 



15 



20 



25 



30 



35 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281: 

AGTCAGGTAT ATCATGCCaT yCTGAATTGG TCGATATTAA TATCAGTGGT GTTAAAGAAC 6 0 

GAATTGTATA CCAATAGACG CTTTATATTG TAAAATAGTA TTAAATGCaG AATAGAGAGG 120 

AGATTTAATG CGATATGACA AATTATAAAG TTGTCGTTTT AGATATGGAT GACACATTGC 180 

TAAATTCAGA TAATGTGATA TCAGAAGAAA CTGCAAATTA TTTAACAGCA ATTCAAGATG 24 0 

AAGGTTATTA TGTTGTTCTA GCATCTGGTA GACCTACTGA AGGTATGATT CCAACTGCTA 3 00 

GAGATTTAAA ATTACCTGAA CATCATAGCT ATATTATTAG TTATAACGGT AGTAAAACGA 360 

TTAACATGAC TAATGAAGAA GTAGAAGTAA GTAAATCGAT TGGTAAGCAA GATTTCGATG 4 20 

AAATTGTAGA TTATTGTCGA GATAGAGGCT TTTTCGTTCT TACATATCAT GATGGTCAAA 4 80 

TTATTTaCGA CAGCGAACAT GAGTATATGA ATATTGAAGC AGAATTAACA GGTTTACCGA 540 

TGAAACGTGT TGATGATATC AAAGCGTATA TTCAAGGCOA TGTACCCAAG GTCATGGGTG 600 

TAGATTATGT AGCGAATATT ACAGAAGCTA GAATTGATTT GAATGGTGTG TTCAATGATA bbO 

ATGTAGATGC TACGACAAGT AAGCCATTCT TCTTAGAATT TATGGCCAAA GACGTTTCAA 720 

AAGGTAATGC AATTAAAGCG TTATGTCACA AATTGGGATA TTCGGTGGAT CAAGTCATTG 78 0 

CTTTTGGTGA TAGTATGAAT GATAAATCAA TGTTTGAAGT CGCAGGTCTA GCTATTGCTA 84 0 

TGGGGaATGC ATCAGATGAA CTTAAGCAAT ATGCAAATGA AGTTACGTTG GATCATAATG 900 

AAAATGGTAT TCCACATGCG CTCAAAAAAT TGTTATAAAT TTTAAAATAA GCCTTAACAC 960 

ATGATATTTG AATAAGATAT CTTGTGGTTA AGGCTTTTTA TTTTTGTGAA AATGACTTCA 102 0 

GTTATACTAT GGAGGATTTG AAATACATAT TTTAGATTAG TAATGATATC AAACGAATAG 1080 

AGTAAATGTA TATTTttTGA ATAAATCAAG TATTAACTAG TCACGGAAGG nAGATAAAT 113 9 
(2) INFORMATION FOR SEQ ID NO: 282: 

' (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2931 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 82: 

TCTAAAAATG CTGTGAAATT CTTTTATAAA TATCTAAAAG GAATTAATGT TGATAACATT 6 0 

so 

GCTGTGATAG GAAGThAGAC AGCGCAATAT TGTGAATCAC TTGGCATTCG AGTTGATTTT 120 

'^AAAATCAT TTAATCAAAC TAACCAAAAA 180 
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AATGAAGTTG TTAAAATAGA TTTATATACT TCAGTGCCTA ACAAACAAAA TATACAAGAT 3 00 

GTTAAAGAAA TGATAGAACA TCAACAAATC GATGCATTAA CATTTTCAAG TTCGTCGGCA 3 60 

GTACGTTATT ATTTTAATGA AGGATTTGTA CCAAAATTCA AGTCGTATTT TGCTATTGGA 42 0 

GAACAAACAG CACGGACCAT TAAATCATAT CAACAACCAG TAACAATTGC AGAAATTCAA 480 

ACACTCGAAT CACTAATTGA AAAGATTTTA GAAAGTAGGG GCTAAAAATG AAATTTGATA 54 0 

GACATAGAAG ATTGAGATCA TCAGCGACAA TGAGAGATAT GGTTAGAGAG AATCATGTAA 60 0 

GAAAAGAAGA TTTAATATAT CCAATTTTTG TAGTTGAAAA AGACGATGTG AAAAAAGAAA 66 0 

TTAAGTCATT GCCAGGTGTA TACCAAATCA GTTTGAATTT ACTTGAAAGT GAATTAAAAG 72 0 

AAGCTTATGA CTTAGGCATA CGTGCCATTA TGTTTTTCGG TGTTCCAAAC TCAAAaGATG 780 

ATATAGGTAC TGGTGCATAC ATTCACGATG GTGTTATTCA ACAGGCAACA CGTATTGCTA 84 0 

AAAAAATGTA TGATGACTTA TTAATTGTTG CAGACACTTG TTTATGTGAA TATACTGATC 900 

ATGGTCATTG TGGCGTGATT GATGACCATA CACATGACGT TGACAATGAT AAATCATTGC 96 0 

CACTACTTGT TAAAACAGCA ATTTCTCAAG TGGAAGCTGG TGCTGATATT ATTGCGCCAA 102 0 

25 GTAATATGAT GGATGGTTTT GTTGCTGAAA TTCGTCGTGG ATTAGATGAA GCCGGCTATT 108 0 

ACAATATTCC TATAATGAGT TATGGTGTCA AGTATGCATC AAGTTTCTTT GGACCTTTTA 1140 

GAGATGCAGC AGATTCAGCG CCATCATTTG GGGATAGAAA AACGTATCAG ATGGACCCTG 1200 

CTAACCGTTT GGAAGCACTT CGTGAATTAG AAAGTGATCT TAAAGAAGGG TGCGACATGA 1260 

TGATTGTTAA ACCTGCTCTA AGTTATTTAG ATATAGTTCG AGATGTTAAA AATCATACGA 13 2 0 

ATGTTCCAGT TGTTGCATAT AATGTGAGTG GAGAATATAG TATGACTAAA GCAGCGGCAC 1380 

AAAATGGTTG GATAGATGAA GAACGTGTCG TTATGGAACA AATGGTTTCA ATGAAACGTG 14 4 0 

CAGGTGCTGA TATGATTATT ACGTATTTTG CAAAGGACAT TTGTCGCTAT TTAGATAAAT 1500 

AAGGTTTTAT ATTTATGATT TTCCATAAAC TGTAGGAGGA ATTTACTTTA TGAGATATAC 1560 

GAAATCAGAA GAAGCAATGa AGGTTGCTGA AACTTTAATG CCTGGTGGTG TAAATAGTCC 1620 

AGTACGCGCA TTTAAATCAG TAGATACACC AGCAATTTTT ATGGATCACX5 GTAAAGGTTC 16 80 

45 AAAAATTTAT GATATCGATG GTAACGAGTA TATCGACTAT GTACTAAGTT GGGGACCACT 174 0 

TATTTTAGGA CATAGAGACC CTCAAGTTAT TAGTCATTTA CATGAAGCAA TTGATAAAGG 1800 

TACAAGTTTT GGTGCATCAA CATTACTTGA AAATAAATTG GCGCAgcTCG TTATTGACCG 186 0 

AGTACCTTCA ATAGAAAAAG TGCGTATGGT GTCATCTGGT ACAGAAGCTA CATTGGATAC 1920 

TTTAAGATTA GCACGTGGTT ATACTGGCAG AAATAAAATT GTGAAATTTG AAGGTTGCTA 19 80 
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GCCGGATTCT CCTGGTCTGC CTCAAGGTAT TGCTAAAAAT ACAATTACAG TTCCATACAA 
TGATTTAGAT GCACTTAAAA TCGCTTTCGA AAAATTTGGa AACGATATTG CTGGTGTAAT 
CGTAGAACCT GrTGCrGGTA ATATGGGTGT CGTACCGCCG ATTGAAGGTT TTTTACAGGG 
ATTAAGAGAT ATTACGACTG AATACGGCGC ArrGCTAATT rrCGATGAAG TAAT^ACTGG 
TTTCAGAGTC GGTTATCATT GTGCACAAGG TTACTTTGGT GTGACACCAG ATTTAACTTG 
CTTAGGAAAA GTTATCGGTG GAGGACTACC TGTAGGTGCA TTTGGTOGTA AAAAAGAAAT 
CATGGATCAT ATAGCACCAT TAGGAAATAT TTATCAAGCG GGTACGTrAT CAGGAAATCC 
TCTTGCAATG ACAAGTGGTT ATGAAACGrT AAGCCAATTA ACGCCAGAGA CATATGAGTA 
TTITAATATG TTAGGCGATA TACTTGAAGA CGGTTrAAAA CGTCTATTTG CTAAACACAA 
TGTACCAATA ACTGTAAATA GAGCAGGTTC AATGATTGGT TATTTCTTAA ATGAAGGACC 
TGTAACTAAT TTTGAACAAG CGAATAAAAG TGATTTGAAA TTATTrGCAG AAATGTATCG 
AGAAATGGCA AAAGAAGGTG TGTTTTTACC ACCATCTCAA TTTGAAGGTA CATTCITATC 
TACGGCACAC ACGAAAGAAG ATATTGAAAA AACGATTCAA GCATTTGATA CGGCTITAAG 



TCGTATTGTA AAATAAATAT ACGGACAAAT TGAGAGCCTG AACTTTGTTC AGGCTCnTTT 
TAAATGTATA TAAGGCATGG GCGGCGACTT GATAGTGAAA GTCCACTACT A 
(2) INFORMATION FOR SEQ ID NO: 283: 

(l; dCWWC'*'^^^ wtfcr**-^ 

(A) LENGTH: 1421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

r(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283: 



2460 
2520 
2580 
2640 
2700 
2750 
2820 



2931 



AATTATGAAT 


GCATTACCAG 


TATTATTACA 


AAAGAACAAT TAAAAATGTT 


TGTTTATGAT 


60 


TATGATACGC 


ATCTCATTAA 


AAATGTAaTG 


GTTGCAGCAG 


ACGTGTTAAA 


GGCAAATGAT 


120 


ATTCAAGGAC 


ATGAACCATT AATCGTTAAC 


CTTCAAACGA 


TTGATGAAAC 


ATTACATCGT 


180 


TTACCTATGC 


ATAATAGAAA 


AGACATGATG 


GTTAATGGCG 


GTGTACTTAT 


GGCACATTTA 


240 


AATGCCAAAA 


GTGGTCCGTG 


GTTAAAAGAT 


GTGCTAAGAC 


AAATTGAGAT 


AGCGATTGTA 


300 


ACAGGTAAAG 


TAAGCAACGA 


AGAAACTGAA ATTTTGAAAT 


GGGTGGATAA 


TCATGTCAAA 


360 


ATATAGTCAA 


GATGTACTTC 


AATTACTCTA 


TAAAAATAAA 


CCGAATTATA 


TATCTGGACA 


420 
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CCCAGATATT TGGTATCAAG GTATAATAGA CCAATKTACA AAAAGTTCTG CTTTGTTTGA 60 0 

TTTTAGTGAA GTATACGATT CAATAGATTC TACACAACTT GCTGCGAAAA AGTCACTTGT 6 60 

TGGAAATCAA TCTTCATTTT TTATCTTGAG TGATGAACAA ACGAAAGGTC GTGGGCGATT 720 

TAATAGACAT TGGAGTTCTT CAAAAGGGCA AGGACTTTGG ATGTCTGTCG TGTTAAGACC 7 80 

TAACGTTGCA TTCTCAATGA TATCTAAATT TAATTTATTT ATTGCATTAG GGATAAGAGA 84 0 

TGCGATTCAA CATTTTAGTC AAGATGAAGT CAAAGTGAAA TGGCCGAATG ATATATTTAT 900 

TGATAATGGT AAAGTGTGTG GTTTCTTAAC TGAAATGGTT GCTAATAATG ATGGTATAGA 960 

AGCAATAATA TGTGGTATAG GTATTAATTT GACGCAACAA CTAGAAAACT TTGATGAAAG 1020 

TATTAGACAT AGAGCAACAA GTATACAATT ACATGATAAA AATAAATTAG ATAGATATCA 1080 

ATTTTTAGAG ATATTACTTC AAGAAATTGA AAAAAGATAT AATCAATTTT TAACGTTACC 114 0 

TTTTTCTGAA ATTCGTGAAG AATATACTGC AGCTTCTAAT ATTTGGAATA GAACGTTGCt 1200 

ATTTACAGAA AATGATAAAC AGTTTAAAgG ACAAGCAATT GATTTAGATT ACGATGGCTA 12 6 0 

TCTAATTGTT AGAGATGAAG CGGGTGAATC ACACCGTTTA ATTAGTGCAG ATATAGATTT 1320 

TTAACACTAA AGCAAGGAGA GATAGCTATG GGTATGGCAA CCTATGCCGT TGTGGATTTG 13 80 

GnAACAACAG GCAACCAATT AGATTTTGAC GATATCATTC A 14 21 
(2) INFORMATION FOR SEQ ID NO; 284: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2202 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

-(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284: 

CCAAGTTGCC TAAAATGATT AAGCAAGGTT TATACCCTAT GOlAACGATT GAACAAGAAT 6 0 

CTGGAGCCAT CCGACTGCCA ACGATTTCTA GAGTGAnGCG TTCATTACAA TGGGGTAATG 12 0 

ATGCTTATAC AATGATTTTA GATCGTATGA ATATTGAAAC AAATGAATAA TAAATGAACG 180 

ATAAACAATG GTTATCTATC TGCACTAATA AGGTAGATAA TCATTGTTTT TTCACGAAAA 24 0 

AATTTACAGA GTAAAAGAAC TTAAATTTCA TATTAAGTCT TTAGAACTCG ACACTTAAAA 3 00 

ATGCTATAAT CATATGTATG TTAAAAAAGG AGTTTCGGAA AATGTATGAC ATTAAAAAAT 3 60 

GGCGCCATAT TTTTAAATTA GACCCAGCTA AACATATTTC AGATGATGAT TTAGATGCGA 420 

TTTGTATGTC TCAAACAGAT GCAATTATGA TTGGTGGAaC TGATGACGTT ACTGAAGATA 4 80 
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CAAACATCGA AAGTGTAATG CCTGGTTTTG ATTtTTATTT TGTACCTACA GTACTGAACA 600 

GTACAGATGT TGTATTTCAC AATGGTACAT TATTAGAAGC GCTTAAAACA TATGGACATA 66 0 

5 

GTATAGATTT TGAGGAAGTA ATATTTGAAG GGTATGTCGT GTGCAATGCT GATAGCAAAG 720 

TGGCAAAACA TACCAAAGCA AATACAGATT TAACAACAGA AGATTTAGAA GCATATGCCC 78 0 

AAATGGTCAA TCATATGTAT CGATTACCGG TTATGTATAT AGAGTATAGT GGCATTTATG 84 0 

10 

GCGACGTATC AAAGGTTCAA GCTGTCTCAG AACATCTAAC AGAAACGCAA CTTTTTTATG 900 

GTGGCGGTAT TTCCTCAGAA CAACAAGCGA CAGAGATGGC AGCTATTGCA GATACAATTA 96 0 

15 TCGTCGGTGA TATTATTTAT AAAGATATTA AAAAAGCTTT AAAAACAGTA AAAATAAAGG 102 0 

AGTCTAGTAA ATGAATGCGT TATTAAATCA TATGAATACA GAGCAAAGTG AAGCTGTAAA 1080 

GACAACAGAA GGACCATTGT TAATTATGGC AGGTGCTGGT TCAGGGAAGA CACGTGTTTT 114 0 

20 

AACACATAGA ATTGrTTATT TATTAGACGA AAAAGATGTC TCACCATACA ATGTTTTGGC 1200 

TATTACTTTT ACAAATAAAG CTGCAAGAGA AATGAAAGAA CGTGTTCAAA AATTAGTAGG 1260 

TGATCAAGCA GAAGTTATTT GGATGTCAAC ATTCCACTCA ATGTGTGTTC GTATTTTACG 1320 

25 

TCGTGATGCA GATCGAATTG GTATAGAACG CAATTTTACG ATAATTGATC CTACAGACCA 13 80 

AAAATCTGTT ATTAAAGACG TCTTAAAAAA TGAAAATATT GATAGTAAAA AGTTTGAACC 144 0 

TCGTATGTTT ATCGGTGCGA TCAGTAATTT GAAAAATGAA CTTAAAACAC CTGCAGATGC 150 0 

TCAAAAAGAA GCCACAGATT aTCACTCgcA AwTGGTaGCA ACgGTTTaTA GTgGATATCA 156 0 

ACGCCAATTG TCACGTAATG AAGCGTTAGA TTTTGATGAC CTTATTATGA CAACGATTAA 162 0 

35 CTTATTTGAG CGTGTACCAG AAGTTCTAGA ATATTATCAG AACAAATTCC AATATATTCA 168 0 

TGTAGATGAG TATCAAGATA CTAATAAAGC ACAATACACA TTAGTTAAAT TATTAGCAAG 174 0 

TAAGTTTAAA AACTTATGTG TTGTAGGTGA CTCAGATCAG TCAATTTATG GTTGGCGTGG 1800 

40 

TGCTGATATT CAAAATATCT TATCATTTGA AAAAGACTAT CCAGAAGCGA ATACAATCTT 1860 

TTTAGAGCAA AATTATCGTT CGACGAAAAC GATTTTAAAT GCGGCTAACG AAGTGrTTAA 192 0 

AAATAATTCT GAACGTAAGC CAAAAGGACT GTGGACTGCA AATACGAATG GTGAGAAAAT 198 0 

45 

TCATTACTAT GAAGCAATGA CGGAACtGAT GAAGCGGAAT TTGTAATACG AGAAATTATG 204 0 

AAGCATCAAC GTAATGGTAA GAAATATCAA GATATGGCAA TTTTATATAG AACGAATGCA 2100 

50 CAATCACGTG TACTTGAGGA AACATTCATG AAATCTAATA TGCCATACAC AATGGTTGGT 216 0 

CGCCAAAAGT TCTATGACCG TAAAGnAATC CAAAGATTTA TT 2 2 02 
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(A) LENGTH: 78 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285: 
JO AGTGGTGCAA AGATAGGCAT TGATaATACC GCTAAGCCAG CAAGATGATG GCACGATAAA 60 

ACCTAAACAG AAGAAmATAA ATAGTAATAC GATGATAAAT AATGGTCCAC TCATATGTTG 120 

AACTAAAGAT GATGAAAAGT GTAAGATTGT ATCTGAAATC ATACCTTCAT TCAACACTAA 180 

ATTAATACCT CGAGCTAAAC CAATAATTAA AGATACACCT ACTAAACTTG ATGCACCATT 24 0 

GACAAATGCA TCTACAGTTC CTTTTTCTCC CAATCCAGAT TTACCTGTCC CAGCAATAAA 3 00 

CATTATTATA ATTGTAAATA TTAAAAATGC TGAAGCCATA ACTGGGAACC ACCAACCTTG 3 60 

20 

CGTCATAACT CCCCATACCA TAATTGGAAA TGGTAGTACA AATAATGTAA GGATTATCTT 420 

CTTACGCAAA GTAAAATGGG CACTATCGTC ATCTTTTAAT ACAGACCATT GCTGTTCAAA 4 80 

AGCATCTTTG TCTTCATAAG AATATGACGC TTTAGGATCG TTTTTAATTT TTTTACAGTA 54 0 

CCAATATAAA TAACTAATAA CAAAAATCGC ACCGACAATA CAAGCACCTA TTCTCCAATA 600 

CAAGCCATCC GTAAAAGTTG TACCAGCGGC ATTAGAGGCA ATTACAACCG AGAACGGGTT 6 60 

30 AATAGTTGAA AATGTACTAC CGACAGAGCT GGcAAGGAAT ATGGCACCAA CTGGAAACGA 720 

TAGAATCGTA TCCTAACGCT AATAAATATA GGGACTAAAA TCGGATAAAA TGCTACAGCT 780 

TCTTC 78 5 
(2) INFORMATION FOR SEQ ID NO: 2 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286: 

CTAACGnGAT AAGGTTGCAA nTTTATCTGA ACATCTGATG ACTGTAATTT TGTtAATGAT 60 

AAAATATTTG TCACTAATAG ATATAAATAC TGACTTTCTT GAAAACTATG TACAAGTAAT 120 

TGTTCCTTTT CTATGATAGA CATATCTTTA CTATGTGATA CTAAAATATC TAAATkTCCC 180 

ATAATTGTTG TTAACGGTGT ACGTATGTCA TGCGAAATTG ATCTTAAAAA ATTTGAATGT 24 0 

GTCAGTTGAC GTTCAGCCTG TAACATGGAT TCTCTCGTTT GTTTAAGTAA CGTCACATTT 3 00 
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ATCACTTGAG AACTTTGGTA ATCAATGGCT AGAATGCCTT TAATCGGAGA TGTGCCAATT 4 20 

GGTATCAACC ATTTATTAAT GCCTGGAAAT GTATCTGTTG TTGCACCAGC TTGTCTTTCA 4 80 

5 TTTTTAATTA CCCAGCTTAA TGCTTGTTCA TGCTGTTGAG TCGTATTATC GATATGGTTT 54 0 

TGCAATGGTA TTGTTTTAAT TACTTTCGAT TGATTGATAA CGTATATAGT AATTGATTGT 600 

TGCAATAATT GATTAATTTG GTATCCAGCA TTTATTAGTA AGTTTTCAAC TGTATAAGTT 660 

w 

TGTTTAATCG AATCATTAAA TTGAAATAAT AAATCTGTAC GATAAAGTTG CTTTTTAGTA 720 

ATGGaGTAwT GGAATTTAAT TTGTnTTAAT AAAGCACTCG TTAAAATACT TGTnAAAATG 78 0 

CTAACGATAA ATGTAATAGG ATAGTCAAAG CG 812 

75 

(2) INFORMATION FOR SEQ ID NO: 287: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 173 2 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNSSS ; double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287; 
ATnnATTATT ATTACTGCTA TTTTAAATTT TAAAAAATGC TTTTGATTAT ATTCAACAnT 6 0 

TTGTATAAAA TTAAATTTGC TTTTGATTAA AGCATGAAAA TTGTAATCAA ACCATAAATT 12 0 

30 

GTCGTATGAT GTAGTTAGAA TTTTAAAATG CAGGAGGTCa AGTATATGAC TGAAATaACA 18 0 

TTCAAAGGTG GACCAATCCA CTTAAAAGGT CAACAAATTA ATGAAGGTGA TTTTGCACCT 24 0 

GATTTTACAG TGTTAGATAA TGACTTAAAT CAAGTAACAT TAGCAGATTA TGCTGGTAAA 30 0 

35 

AAGAAATTAA TTAgTGTGGT ACCATCAATT GATACAGGTG TTTGTGATCA GCAGACTCGC 36 0 

AAAT5*CAACT CTGATGCTTC TAAAGAAGAG GGGATTGTGC TTACAATTTC AGCAGACTTA 420 

40 CCATTCGCAC AAAAAAGATG GTGCGCTTCA GCAGGTTTAG ACAATGTCAT TACATTAAGT 480 

GACCACCGTG ACTTATCATT TGGTGAAAAC TATGGCGTTG TTATGGAAGA ACTTCGCTTA 54 0 

TTAGCTCGTG CAGTATTTGT ATTAGATGCA GATAATAAAG TTGTTTATAA AGAAATCGTT 600 

AGTGAAGGTA CTGATTTCCC AGATTTTGAT GCTGCTTTAG CTGCATACAA AAATATTTAA 660 

TCATTAAAGA GATAAATCTT AAAATGTATA CATCGTGTCC ATCGTTGTCA ACAGCATTAA 720 

AATAGAATTG TTTTCTATGA TTGCTAAGAC CTATGGGCAC TTTTTATTGG AGAGGGACGA 780 

so 

ATATGGCAGA ACAACAAACA ATTATGGAAC GCTTGTTTCA TACATTAGAT GAAAAAGCTA 84 0 
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CATTCCAATT TGCATATTTA AGTTTAATGC aGGAAGAAAA GATAcAAGCA AATCATCAAA 1020 

TTACACCAGA TTCAATTGGA TTGATACTAG GATTTTTAGT TGAGCGTTTT ATGAACAACC 1080 

AAGAAGAATT ACATATTGTT GATATTGCAA GTGGTGCCGG TCATTTAAGT GCTACTGTAA 114 0 

AAGAAGTGTT ACCTGraAtT GcGGTTATGc ATcATTTaAT TGaAGTTGAt CCAGTTTTAT 1200 

CACGTGTTAG TGTACATTTA GCAAACTTCT TAGAAATTCC TTTCGATGTG TATCCTCAAG 126 0 

ATGCCATCAT GCCACTACCA TTAGAAGAAG CAGATATCGT TATTGGTGAT TTTCCAGTAG 13 20 

GCTATTATCC AATTGATGAA AGAAGTAAGG AGTTTAAGCT AGGTTTTGAA GAAGGACATA 13 8 0 

GTTATTCACA TTATTTATTA ATAGAACAAG CAATAAATGC ATTAAAAGAT GCTGGATATG 144 0 

CCTTTCTAGT GGTACCAAGT AATATTTTTA CAGGTGAACA TGTAAAACAG CTTGAAAAAT 150 0 

ATATTGCAAC AGAGACAGAG ATGCAAGCAT TTTTAAATTT ACCACCAACT TTATTTAAAA 156 0 

20 ATGAAAAAGC GCGAAAATCT ATATTAATTT TACAAAAGAA AAAATCGGGT GaAACAAAGC 1620 

CAGTTGAAGT ATTATTGGCA AATATTCCTg ATTTCcAAAA TTCCTTCACC AATTTCCAAG 1680 

GATTTATGGA CAGAGTTAAA ATCCAGTGGG ATGGGACCAC CAAATCGTCC TA 1732 
(2) INFORMATION FOR SEQ ID NO: 28 8: 



15 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2779 base pairs 

(B) TYPE: nucleic acid 

30 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288: 

AAAAGAACTA GCTAAACGCA AGCAAGAAGC TATTAGTAGA ATTAAAGACT TTTCAAATGA 6 0 

AAAAATAAAT AGTATTCGAA ATAGTGAAAT TGGCACAGCT GATGAAAAAC AAGCAGCAAT 120 

40 GAATCAAATT AACGAAATTG TGCTTGAAAC AATTAGAGAT ATTAATAATG CGCATACATT 180 

ACAGCAAGTT GAGGCTGCAT TGAACAATGG TATTGCTCGA ATTTCAGCAG TACAAATTGT 24 0 

AACATCTGAT CGTGCTAAAC AATCGTCAAG TACTGGAAAT GAATCTAATA GCCATTTAAC 300 

AATTGGTTAT GGAACTGCAA ATCATCCATT TAACAGTTCG ACTATTGGAC ATAAAAAGAA 360 

ACTTGATGAA GATGATGACA TTGATCCACT TCATATGCGT CACTTTAGTA ATAATTTCGG 420 

TAATGTTATT AAAAACGCTA TTGGTGTGGT GGGTATCTCT GGTTTACTAG CTAGTTTCTG 480 

GTTCTTCATT GCCAAACGTC GTCGTAAAGA AGATGAAGAG GAAGAATTAG AAATAAGAGA 54 0 

TAATAATAAA GATTCAATAA AAGAGACTTT AGACGATACA AAACATTTAC CACTTTTATT 600 
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AAATAATGGC GAGTCACTCX5 ATAAAGTTAA ACATACGCCG TTCTTCTTAC CAAAACGTCG 720 

TCGTAAAGAA GATGAAGAAG ATGTGGAAGT TACAAATGAA AACACAGATG AAAAAGTGTT 780 

GAAAGATAAC GAACATTCAC CACTCTTATT CGCAAAACGA CGCAAAGATA AAGAGGAAGA 84 0 

TGTTGAAACA ACAACTAGTA TTGAATCTAA AGATGAGGAC GTTCCTTTAT TATTGGCTAA 900 

AAAGAAAAAT CAAAAAGATA ACCAATCCAA AGACAAAAAG TCAGCATCAA AAAATACTTC 96 0 

TAAAAAGGTA GCAGCTAAAA AGAAGAAAAA GAAAGCTAAG AAAAATAAAA AATAATTTGT 102 0 

TTCTTTGATA AATAGaGGAG CACCGATTGA CATCACATCA GTCGGTGCTC CTTTTATTTA 1080 

TTCTTTTTAA TTAATTTATA CAATGCCTGT TGAGCGTGTT GATTCGCTTC TTTGTTTTGT 1140 

TCTCTCGGTA TCCATTTAAC AAATAATAAA TCAAAATCTT TTTCAAATAT TTCTATTTGA 1200 

TCAAAATAAG GTTTGAAATT TGCGTTTTTC ACATAACCAG CTTCAATGCT ATCTGCAATT 1260 

AGCTTTGAGT CTGTATATAA TAGTGCGTTT TGAACATTTA ATTCACGTGC ATGTTCTAGT 1320 

GCATAAATAC ATGCAGCCCA TTCTGCAGTG TGGTTATCCA TTTCOCCTAA CTCATGTGTA 13 SO 

TATGTATAAT GCTGCTCATC TTCTTTGATT ACAATGGCAC ATGTACTTAT GCCTGGATTT 144 0 

cCTTTCGTCG CAGCATCAAA ATTTATTTTC GCCATAATAA ACCTACTTTC TATTCAATAC 1500 

TTAGTTAAAG TTACTATTAC TGTAATACAA AATATGTTGG GTAATCCATT AAAAAACACG 1560 

CATCACTTAA ATAAGTAACA CGTGTTTAAA ATACTCGCTG ATTCAAAGAT GATTTTCTAA 1620 

TACGTAtACT GTaATATACT TCCTAAAAAA ATCATCTTCA GGCTGGGACA TAAATCAATG 1680 

TTCTATGCTC TACGATGTTA TATTGGCAGT AGTTGACTGA ACGAAAATGC GCTTGTAACA 1740 

AGCTTTTTTC AATTCTAGTC AGGGGCCCCA ACACAGAGAA TTTCGAAAAG AAATTCTACA 180 0 

GGCAATGCGA GTTGGGGTGT GGGTCCCAAC ACAOAAOATG ACGAAAAGTC AGCTTACAAT 1860 

AATCTGCaAG TTTGGGATGG GCCCCAACAA AGAGAAATTG GATTCCCAAT TTCTACAGAC 1920 

AATGCAAGTT GGGGTGGGAC GACGAAATAA ATTTTGCGAA AATATTATTT CTGTCCCACT 1980 

CCCTTAAAAC TTATTCTTTT GTGTAGTAAG TGCGTTAATA GCCTTGATCT AACTTATCAA 204 0 

TCTTACCTTT ACGATAAAAT GATTTAGCAA TATATCCAAA TGGTACATTG AAAACTGTTG 210 0 

AAGCTAATTT TAATACGTAC GTTGTAATAA ATATTTCAAa TACAAmTGTa CCAGGTAAAC 216 0 

TTCCGATAAA TGCGATAGCT ACAAATAAAG CTGTATCAAT TATTGrGCTT AAAAATGTAC 2220 

TACCATATGg CACGGATGAA AAACGTTTTa TCmGaACTAA ATACTTTTTT AATTAGTGAA 22 80 

AAGATAAATA CATCAATATG TTGACCAATA ATATATGCGA CGATTGAGCC TAAAGCAATG 234 0 

... r^nu^rmr^rn ccatatcttc tggtgcagga 2400 
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CAAACTGCTC TTTTTGCAAC TCTACGCCCA TAAATATCGT TTAATATATC TGTTGCTAAA 2 52 0 

TAAATAGAAG CAAACATGAC ATTACCTAAA GTTGCTGAAA TACCAAAGAT TTCTACAGTT 258 0 

TTAATCACTT GTATGTTGGC AATGATTGTG CCAATTGCAA CCCATGCAAT TAAACCTTGT 2 64 0 

TTACCAAAAA AGCGATACAT AAGTACCATA AGCACGAACG TTGCAATAAA CGTAACTAGT 2 700 

CCTAAAATTT CATTATACAT ATTAAAATGT CCTCCTAAAT TTTGATCATG CGGGTGTTTA 2 76 0 

GAAACCGCTC AATAAATAA 27 79 
(2) INFORMATION FOR SEQ ID NO: 28 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1999 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289: 

ACTGATGTGC GTTCATCAAA AACAATATAA TCAAATTCAT TTTCATCAAA TTGCTTAAAA 60 

TTATCATCTC TAGATAATGT TTGAATGGTT GCAAATAAAT ATTTGGCATC GACATCTCTA 120 

TGTTTTCCTG TCAACAATCC AAAATCACTA TCATmTTA TAGGTAATAC TTTTTTAAAT 180 

TCTTCCTTAG CTCTATTTAA AATCCCCTCA TTATGAACAA TAAATAAAAA TTTATTAGGG 24 0 

TTTACTTCTC TAACATCTAA TGCACATAAA ATCGTTTTAC CTGTACCAGT TGCAGATATT 3 00 

ATTAACGCCT TATCTTTGGC TTTATCCCTA ATAGCTTTTA ATGACCTTAA TGCTTCTGCT 3 60 

TGCATTAAAT TGGGTACAAT TTCCACTGAT TTTTTCACCT TATCAGCTAG CAGCATTTGA 4 20 

GTTTGTTCAA CCTCCGCTAA TTTTTCTAAG GAACGGTACT CAAATGATTC TTTATATGAA 480 

TTAATCCATT GCTCAGTCAG TGGGGTACTC TTTTGCCATA ACAAGTCAAA TTCACTTTTT 54 0 

40 ACACTATCAA CTAAATCGCC ATTTTTCATA GTAGACAGTA AAACATTATG CTCATAATTA 600 

ACCTTTAACG CATTAGATGT TAAATTAGAG CTTCCTATTA CCATAGAACT ATAATCCTTA 660 

TGCTCAAAAA TATATCCTTT GGCATGGAAT CCAGCAATAT CAGTTAATCT TACCTCTACA 720 

TTTTTTAATT TAAGTAATTC TCCATACATT TTAGGACTAT TAAACCCTAA GTAATTAGAT 780 

GTTAATATTT TCCCTTTAAC ACCCTTATTG CTTAAATCTA ATAGTTGAGC CTTTAAGCTG 84 0 

GcTAAACCGC TTTCTGTTAT AAAAGCCACA GAAAAATAAA ACGTTTCACA TTTTTGAAGT 90 0 

TCATCTATAA TTGTTGAAAG AACTTTTTCA TTTTTATTAT TTACTAAAAG CTTCGGTGTA 96 0 

TAATTCCCTT TATGAGAAAT ATGTTTGTCT ATAAACCCTT TATGTAAAGA TTGATTGAAA 1020 
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CGCTGGGGCC CAATTTAATT 
TTCAGTTAGA GTTGGTAACT 
^ TCCAAAATCA TATTCATGTT 

CATTTCTTCT CTAATTTCTC 
ACnGCCAGGA AATTCCCACA 

10 

AATTTTGTTA TCAGAAAAAA 
CACCCTTCAA TTTAAAATCA 
AAATCAATAT TAAAAATTAA 

15 

ATACACATCT ATTATATAAT 
TTTGGGGCGT ATTTCTTTAT 
20 TATTAAGTCT AACTGAGATT 
TAGAACGAGC TAATGATAGT 
AGCGAGAAGG TTTTGAAAGT 
GTCTTGGTGA AGGTCGACTT 
GTTATTTAAA TGCCCAAACA 
TTTCcAACTG AATTCGCCC 

30 

(2) INFORMATION FOR SEQ ID NO: 2 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: 

GATGACTTTT CCCCCTCATA ATCTTCATGG TCCAGGCGTC CATTAATGCG TCAAAGGATG 60 

GCACATTTTA CCTGGAACAA ATGATTCATA TGGTTCATAA AAATCACGCG TCGTAATATA 120 

ATCTTCTAAA TCAAATGCAT AGAAAATCAT TGGCTTTTTA AATACTGCAT ATTCATATAT 180 

TAAAGATGAA TAGTCACTAA TTAATAAATC TGTTATGAAC AGTATATCAT TAACTTCTCT 240 

AAAGTCAGAA ACGTCAACAA AATATTGTTT ATGTTTGTCT GCAATATTAA GTCTATTTTT 300 

so 

CACAAATGGA TGCATTTTAA ATAATACAAC CGCGTTATTT mTCGCAAT ATCTTGCTAA 360 

^^^^^p^^^ ^Ar^CT'^AAA 420 



TATCAAGTTC GTTTATTGAC AACCATTCAA TACTCTTATG 1140 

CTTTGTTTAA AGTACATTTG TATGTTGTTA ACCTAACAAT 12 00 

CTGTAGTTAT AACTTTGTCT CCAACAATTA AATCACATTT 1260 

TAATCAAAGC GTCTTTTTCA GTTTCATTCT TTTCAACCTT 132 0 

TTAAAGGCAG ACTCATTTTT TCACTTCTCT GTGCACAAAG 1380 

TAATAGCTCC TACTACATTG ATTACTTTTT TCATAAGACT 1440 

TCTTAATTGT TATTCTATCA AAAATTACAA AACTATATAT 1500 

TATTTTACAT TCACATGAAC GCTCTACTCC ATGCATTTTC 1560 

ACTTGTGAAA AGTATTGTCT TGGGGCTGTG TTTTTTTACT 1620 

AATTCATTAC ATAAATGTAA GGGCTTTAGT TTTCATGTTT 1680 

TTGAAAGGAT GTTTAGCAAC AATGGATAAA GAATTATGGA 1740 

TTAGTTAAAC ATTTTTATGa GCAGCAATCT GACATTGAAC 1800 

AAATTAACAT TTGGTACTGC GGGTATACGC GGAAAATTCG 1860 

AATAAGTTTA CTATTGAAAA ATTGGCATTA GGTTTAGCGC 1920 

AACAGTCCAA CAATAGTCAT TCATTATGAT ATTAGACATC 1980 

1999 
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TTGTTTGATC TGTGTCGCAT AAGCTTCATC AAATAGTACA TCAGTACGTG GAACACCTGT 54 0 

AGGCACTACA TTTTTCTCTT TAATACCAAA TGCTTCAGCG TAGAATGGAA TATCGGTTTC 600 

AGATGATACA TAAGCTTTTG TATAGCTACG ATGATTTAAT GAATCAATAA ATGGTCCACC 660 

CTTTTTACCA GTACGACTAA AGCCAACTGT TTTAAAGGCA CCAACGGCAT GCCATACTTG 720 

AATAACTTCT TGAGAACGTC TAAAACGCAC TGTATAAATC AATGGGTGAA AGTCATCAAC 780 

AAAGATGTAG TCTGCCTTCC CAAGTAAATA TGGCAATCTA AACTTGTCGA TGATGCCACG 840 

TCTATCTGTA ATATTCGCTT TAAAAACAGT GTGAATATCA TACTTTTTAT CTAAATTTTG 900 

ACGTAACATT TCGTTATAGA TGTATTCAAA GTTTCCAGAC ATCGTTGGTC TAGAGTCTGA 960 

TGTGAACAAC ACCGTATTCC CTTTTTTCAA GTGGAAAAAT TTCGTCGTAT TAAATATCGC 1020 

TTTAAAAATA AATTGTCTTG TATTAAATGA TTGTTTGCGG AAATACTTAC GTAATTCTTT 1080 

ATATTTACGA ACGATATAAA TACTTTTAAC TTCCGGAGTC GTTACAACAA CATCAAGGAC 1140 

AAATTCATTA ACATCGCTAG AAATTTCAGG TGTAACAGTA TAAACCGTTT TCTTCGAAAT 1200 

GCCGCCTTTT CTAAATTCTT TTAGGTAAGT CTGCAATAAG AAATTGATTT TACCATTTTG 1260 

25 TGTTTCTAAT TCGTTGTATT CTTCTTCTTG TTCTGGCTTT AGATTTTGAT ATGCATCATT 1320 

AATCACATCT GGGTTTAACT GTGCAATATA ATCAAGTTCT TGCTCATTCA CTAATAAGTA 1380 

CTTATCTTCA GGTAAGTAAT AACCATTATC TAAGATAGCT ACATTGAAAC GACAAACGAA 1440 

TTGATTCCCA TCTATTTTGA CATCATTCGC CTTCATTGTA CGTGTCTCAG TTAAATTTCT 1500 

TAATACAAAA TTACTATCTT CTAAATCTAG GTTTTCACTA TGTCCTTCAA CGAATAACTG 1560 

AACACGTTCC CAATAGATTT TATCTATATA TATCTTACTT TTAACCAACG TTAATTCATC 1620 

CTTTTCTATT TACATAATCC ATTTTAATAC TGTTTTACCC CAAGATGTAG ACAGGTCTGC 1680 

TTCAAAAGCT TCTGTAAGAT CATTAATTGT TGcAATTTCA AATTCTTGAC CTTTTAACAA 1740 

CGCTAATTTA nCTACAATAT CTGGGTATTG AATGTATAAG TCTACAACAT CTTGGAAATC 1800 

TTTTGAACCA CTTCGACTAC TACCAATCAA CGTTAGTCCT TTTTCCAATA CTAGACGTGT 1860 

ATTAACTTCT ACTGGGAACT CACTTACACC TAACAGTnCA ATGCTTCCTT CTGGTGAAAT 1920 

GTAATCGATC ATT 1933 

(2) INFORMATION FOR SEQ ID NO: 291: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 2049 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: 
.GTnCGGnCA GATATATTGG TGGTCTTTAG TAAGTGTATC AAATTCATCA GATGTCAAGG 
GCATOrrATC ACCTCCTTAG GTTGATAACA aCATTATACa CGaAAGGAGC ATAAaCAAaT 
GAACACAaGA TCAGAAGOAT TGCGTATAGO CGTCCCACAA GrTTCTAGCA AAGCTGATGC 
TTCTTCATCC TATTTAACGG AAAAGGAACG TAACTTAGGA GCGGAAATAT TAGAA...AT 
TAAAAAAAGT GATTACAGCT ACTTAGAAAT AAACAAAGTT TTCTATGCAT TAGATAGAGA 
ACrrCAATAC AGOGCGAATA ATAACAAACT rrAaCATTTA TCTAAAGGAG TGATAGAGAT 
GCCAAAAATC ATAATACCAC CAACACCAGA AAACACATAT CGAGGCGAAG AAAAATTrGT 
GAAAAAGTTA TACGCAACAC CTACACAAAT CCATCAATTG rTTGGAGTAT GTAGAAGTAC 
AGTATACAAC TGCTTCAAAT ATTACCGTGA AGATAATITA GGTGTAGAAA ATTTATACAT 
TGATTATTCA GCAACGGGAA CATTGAnAA TATTTCTAAA TTAGAAGAGT ATTTGATCAG 
^AAGCATAAA AAAT.3GTATT AGGAGGATTA TCAAATGAGC GACACATATA AAAGCTACCT 
ATTAGCAGTG rTGTGCrrCA CGGTCTTAGC GAnOTACTC ATGCCATTGC IGTACTTCAC 
TACAGCATGG TCAATTCCAG GATTCGCAAG TATAGTCACA TTCATATTTT ATAAGGAATA 
CTTTTATGAA GAATAAAAAA ACTGCTACTT GTTGGAGCAA GTAAC^TXJC AAGATGAGCA 
ATTGTCTTAA ATAATTATAT AAGGAGTTAT TAATATGACC TTACAACAAA AAATACTATC 
ACATTTTGCA ACATATGACA ATTTCAATTC TGATGATCTT GTTGAAACTT TTGGGATATC 
TAAAACACAT GCAAAATCCA CACTTTCAAA ACTTAAGAAA AAAGGAAAGA TTGCAATGGA 
AAGTTGGGGT GTCTGGCGTG TTATTGAATC GCAATTGCAT TTAAGTCTAG TCGAACGTAA 
AAAAGAAATT TTAGAAGAAC AATI^AATT GTTAGCAAGA TTAAAl^AAC AAAGTGATGA 

ccctJIgagaa atagaagaac gtatcaagtt aatgattcgt ctagctaacc AArrrrAAGG 

AGGAGTTAAT CAATGGCAGT ATTAGAAGGT ArrTTTGAAG AATTAAAACT ATTAAATAAG 
AACTTACGTG TGriAAACAC TGAACTATCA ACTCTAGATT CATCAATTGT ACAAGAGAAA 
GTTAAAGAAG CACCAATGCC AAAAGAAGAA ACAGCTCAAC TGGAATCAAT TGAAGAAGTT 
.^.GGAAACrr CTGCTCATTT GACTAAAGAT TAT^nTTAT CAGTAGGAAA AGAGTrCCTT 
;^GCAG ACACrrCTGA TAAGAAAGAA TTTAGAAATA AACTTAACGA ACTI^TGCG 
GATAAGCTAT CTACTATCAA AGAAGAGCAT TATCAAAAAA TTGTraArrr TATGAATGCG 
AGAATAAATG CATGAAGCTA GATCACTCAA ATAGAGCTCA TGCAAAGCTT AGTGCAAGTG 

7\?vcTnAAnn'^ ATTCrAGATA 
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GTCTTAAATA TGAAGGCCTA ACACAGTTTG AGTTTAATAA AGCTTTTCAA AATTATAAGC 18CC 

GAAATCAATA TTACAGTGAA GAGTTGCGCG AATATGTTGa AGAGTACGTA GCTAATGTAG 1860 

AAGAAAAGTA TAACGAGGCT TTGaGTAGAG ATGACGATGT AATAGCTTTA TTTGAAACAA 19 20 

AATTGGATyT AGGTAAATAC GTCCCTGAAT CTTTTGGTAy TGGTGATGTC AtTATATTTT 1980 

CAGGTGGTGT ACTTGAAATT ATTGACCTTA AATACGGTAA AGGCATTGAA GTTTCAGCTA 2 04 0 

TAGATAATC 204 9 
(2) INFORMATION FOR SEQ ID NO: 292: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 942 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 92: 

ATGATGTTTC TATATTCGTA TTAGGAAAAC CTGTTGGTAT ThCPiACAPiAC GCCCTAAAAT 6 0 

TACAATACCG CTGCTCCTAT ACCAATTGCA ACAACAGTTT TAACTGAAAT ATCTTGTTTT 120 

TTCATCTTCA TTACTCCTtt ACATAAAAAA TTCATTATAT TGATGGTGCT TTAGATAAAT 180 

GAATCGTCCA ATCATTTCCA GTACCAATAT GATATAAATC TGAAAATGAG TCTTGATTGA 24 0 

CTGCTACACC AATATTTACT AGCGAGTTAA CATACACAAG AGGTTCACCC ACATTAACAT 3 00 

CTGCAAACGA TCGCGCAAAT TTAATAATAT TTTGATAGAC TTTCTTATCT TGATGATAAA 3 60 

TTGTTACCAC CAAATTATTA CCATGAACAA TTTCCAAGGA TTTTAAGAAT GCCAATGGAA 420 

TATTTGTCCA TAATGACCCA AATCTGATAT CTAAAATATC AATGCTTCCT GTAACAGAAT 4 80 

CCTC^TTTTT TGTCACTTCT CTTATTTCTA ATGCCTCAAT ACTATCAACA TTAAITGCCT 54 0 

40 GACCGAGACG TTCAAACGCT ATCTTATTTG CAGCTAATCT CGCACCATTG TATGCATAAA 600 

CATCTCTACC ATGAAAAGTA TGACTTTCTT CCGAATGAGG CAATCGGCTT TTCACTTCAT 660 

CAATTTCGAT AACTTTTTTA ATACCTTCGT AATGTTTAAT ATGACTTAAA GAACCATTAT 72 0 

CAGGTGTAAT AATGTAATGA CCTGAATATG TTAAGCAAGC AATGTCCGCC TATCACTACC 78 0 

TACACCCGGG TCTACCACTG ATACAAAraC TGTGCCTTTA GGCCAGTATT TTACAGTTTG 84 0 

ATATAAACGA TATGACGCTA CCCAAATGTC ATACGGTGGT ATATCATGCG TTAAGTTTTC 900 

AACACGTATA TCATCATTAC AGTATATGCA ACTCCATACA TT 94 2 
(2) INFORMATION FOR SEQ ID NO: 293: 
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(A) LENGTH: 1268 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



15 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 93: 

10 TGTAATAAAA TTTTATGnAA CATGCTGnGA TGCTACCATG AACCTTCTTC ATTTCTTTAT 6 0 

GTGAGATTGT GaAATTAATC AAATAATAAT ACGGTGGATA CTTTCCTAAT TTACGATATT 120 

CCATTTCCTG ACGATAAAAT GTTAAATAAT CATTTTTTTG AACATCCAAT ATTGAATAAT 160 

GATCTGGATT ATACGTTTGA ATGATGACTT GACCTGCCTT TTCATGACGA CCAGCTCTAC 24 0 

CAGCCACTTG CGTTAATAGT TGATAAGTAC GTTCGCTCGC CCGAAAATCA GGTAAATTTA 3 00 

ACATTGTATC TGCATTCAGC ACACCAACTA AAGTAATATT TGGATAATCT AATCCTTTCG 3 60 

LAATCATCTG AGTACCTAGT AAAATGTCAC CGTTACCTTT TTCGAATTCA GTCAATAACT 420 

TTTCATGTGC ACCTTTCTTT GAGGTTGTAT CTACATCCAT CCTAATTATG CGCGCATCTT 4 80 

CAAATTCTTG TTGCAATAGT TCTTCAACTT TCTGAGTACC AGTACCTAGT TGTCGAATGT 54 0 

GTTCACTCTC ACAATTTGGA CATTGATTCG GTGGCGTCTC TTGGTAACCA CAATAGTGAC 600 

ATTTTAATAA GTCTGTCGTT TTATGATACG TTAATGAAAT ATCACAGTTT GGACATTGCG 660 

GTACATATCC ACAATCCCGA CATAACATAA ACGATGCATA ACCACGTCGA TTTAAAAATA 72 0 

AAACAACTTG TTCCTGTCGA TCTAATCTTA ATTGTATGGC TTCACGTAAA TCTTTTGAAA 78 0 

ACATTGACCG ATTACCTTCA CTCAATTCTT CACGCATGTC TACTATATCA ATTTCAGGTA 840 

^5 AAGCTTGTTG GTTCACTCTG TTTGGTAATG ATAGCAAATG ATAAACGCCT TTTTCAGCTC 900 

GTGCATAACT TTCAAGACAT GGTGTTGCAC TTCCTAAAAT GACTGGACAG TGATGATATT 960 

CAcTS'CGCCA TTGGGCAATT TCTCTAGCGT GATATCTCGG ATAATCTTCT TGTTTATATG 102 0 

TAGATTCATG TTCTTCATCA ATGATGATTA ACCCTAAATT TTTGAAAGGT GCGAACACAC 10 80 

TTGACCTTGC ACCAACACTT ACTCTCGCAC GACCATCCCT AATTTTTTGC CATTCATCAT 114 0 

AACGTTCCCC ATTAGATAAG CCAGAATGTA ATACAGCAAC GTCATCACCA AATCGACGTT 1200 

TGAAGCGTAA AACCATTTGC GGTGTTAGAG CGATTTCAGG AACTAACATC ATCGCCTGTT 126 0 

TTCCTTGG 126 8 
(2) INFORMATION FOR SEQ ID NO: 294: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 294: 

5 TACCACCAAA TAATATATTA GCTGGCATTT TAATAACATT TAnATTTGTC ATGATATCAT 60 

CAATAAAATG TTGAAACTTC GTAATTTCAC CTTCATAATC ATCAATTGCT GCTAATTGCG 120 

CACTCGATGC TTGCTGATCT AAATTTAAAA TATTCGACAT GCGTTGACTA TAATAAACTA 18 0 

10 

AATGTTCTAT TAAGCCATCG TCACTCTTTT CCTTTGGTGC TGACATGACA GCGATACGTT 24 0 

TCAAAGGATA GTGTTGCGCC AATTTTAATG TCATTAATCC ACCTAAAGAC ACACCCGTTG 3 00 

CACTGATAGA TTCATAACCT TCATTGACTA AAAATTGGTA AGCTTTCTCA ACTTCTTCCC 360 

15 

ACCAATCATC TACATTATAT GTCATGAAAT CTTTCAACAA TAAACCATGA CCTGGATAAT 420 

TCGGTGCATA ACAACTAAAT CCTTGGTCAT TTAACTCAGC TGCAAGATGC TTCACATCCC 4 80 

GATTTGTACC TGTAAATGAA TGTAATAATA ATATCGCATG TCCATTTGTG CCTTTTAAAT 54 0 

ACGtGGACTC GGTGTTTTAA TTCTCATTTT TCtaTATAcC TCCACTAtGT CTAAAGakGT 600 

TkGCTAAACG CGTTGtCGTC GATGATTAA 62 9 
25 (2) INFORMATION FOR SEQ ID NO: 295: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2817 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295: 

35 

TATGAAAGTA ATGAATGGTA ATATTATTAA ATTTGATGGA AAAGTAGATA TTGATAATGC 60 

AGA'H^TATC GGTTTTTTAA TTGAGCATCC TAAATTATAT GATAATAAAT CAGGATTGTA 12 0 

TAACTTGAAA TTATTTGCAC AAGTATTAGG TAAGGGTTTT GATAAAGCAT ACACAGACAA 180 

40 

AATTATAGAT GCATTTGGTA TGAGACCTTA TATTAAAAAG AAAGTTAAGA AATATTCAAT 24 0 

GGGGATGAAG CAAAAGTTAG CAATTGCAGT ATCTTTArTG AATAAACCTA AATTTTTAAT 300 

45 CTTGGATGAG CCTACAAATG GkATGGATCC AGATGGCTCa ATTGATGTGC TGACTACAAT 3 60 

TAAGTCTTTA GkAAATGaAC TTGATATGAG AATTCtAATA TCAAGTCATA AGTTAGAAGA 420 

TATTGAATTA ATTTGTGATA GAGCTGTATT TTTAAGAGAC GGnCATTTTG TTCAAGATGT 430 

AAACATGGAG GAAGGTGTTG CATCTGACAC AACGATAGTT ACTGTTGATC ATAAAGACTT 54 0 

TGATAGAACT GAAAAATATC TTGCAGAGCA TTTCCAATTA CAAAATGTCG ACAAAGCAGA 60 0 
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ATTAGATATT TATCCGAAAT ATATTGAAAC ACGTAAAAGT TCATTGCGTG ATACGTACTT 720 

CAATATAAAT CAAAGAGGTG ATAAATAATG AGAATTTTAA ATTTAGTTAA GTATGATTTT 7 80 

^ TATAGTATAT TTAAaArTCC TTTAACATAT TTAGCGaTAC TAGTCGTATC TAGTTTGATT 84 0 

GCAACTCAAA GTATACTTAT GGCAAATTCG ATGGATAACC CGAAACATAT TATTGTCTAT 90 0 

GGATCTGTAT TTGCTGCAGC AAAATGGTTA TTGTTAATAA TTGGATTAAT GTTTGTTGTT 96 0 

10 

AAGACAATTA CGCGTGATTT TTCACAAGGT ACAaTTCAAC TATATATGAG TAAAGTTAAA 102 0 

ACACGCGTTG GAtACATTAT TTCGAAAACA ATTTCAATTa TTTTAATTTC AATATTATTT 10 8 0 

GCATTAATTC ATTATGTGAT TTTGATTGTT GTGCAGGCAT CTAGTAATGG AAAAAATTTG 114 0 

GCGTTTTCTA AATATGTAGA TAATTTATGG TTCTTCCTAA TCTTTTTACT ATTCTTTGGC 1200 

TTGTTTTTAT TCTTAATCAC ACTTGCATCA CAAAAAACAG CAATGATATT TTCATTAGGT 126 0 

20 GTATTTTTAG TACTCATTGT ACCGTTTATT AAACCTTTTA TTACATTTAT CCCAAGATAC 1320 

GGTGAAAAAG TTTTAGATGC TTTTGATTAT ATCCCTTTTG CTTACTTAAC TGATAAAATG 13 80 

ATTAGCTCTA ACTTTGATTT TAGCAATTGG CAATGGGTAA TTTCATTAGG TTCTATAGTG 144 0 

ATATTCTTCA TTTTGAATAT CTTATATGTC GCTAAAAAAG ACATyTAATA AAAATAATTT 150 0 

TGAGGTTGGG AATTTTAAAT TTTCCCAACC TCAAAGTTTG TCTTATTGTA AATTTATTTA 1560 

TTTTCTAATT TATTTAGGAT GGAATTATAA ACTGCTTTCC AAAATGAAGC GTCAGTTTTA 1620 

30 

TAGCGGTTTG ATATAACTAA GTGTGTTTCT TTTTCTAAAT CTGCATAGTC TGGATGATCT 16 80 

TTGCTCGGTA ATTTATCAGC ACGAACATCA GTTACAAATT TTTGGACTTC ATTTGCTCTT 174 0 

GGTCCCCAAA CTGTTTCTTG TTCGAATTGA TCATTCAAGA ATACGAAGAT AGGAATTGCA 18 00 

35 

CGTGATTTAC CATTTGTTAA ATATTGATCG ATCAGTTTTG TATCATCATC TCTATGGAAC 1860 

ACGCGTACTT CTAAAriTAA TGCTTCACTG ATGTGTTTTA GAATTGGGAG ATTCATCATT 1920 

40 GCATCTCCAC ACCAGTCTTC AGTAATTACT AATACTTTAG AATAATTCAT CTCTTTTATT 19 90 

TTTTTGATGC GTGAATCATC TTCTGGTAAC TCAAATGATT GATAGATACT GAGAACGGTA 204 0 

TCTTGATTTG TCTTCATTCC ATCAATGTAT TCATTTAAGG GTTGGCTATT TTTGAAATAA 2100 

45 GTTTCTAAAT TTGTCATTGT AAAAACCTCC TTTAGCATTT ACAACATTAT ACCAATTTAT 216 0 

AGTAATAAAA GGTAATGAAA TAAATTAATT GCAAATTCTT TGTTAATTTT TGTTAAGGAT 2220 

GAAACGGGAA GCACCTTATG CTATATTTAA ATAAGTACAA AGAAAGGGTG ACATCAGTGC 2280 

50 

GTATTCAAAA TCGCTGGGTT GTGTTTATAT TATTTTTAAT CTGTTCTTTT GGTGTATTAA 2 34 0 

' — - nr, ^, ^ "^GT^tGATTT GTCTAATCTT GAAATAAACG 2400 
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TTGATCGTTT TAAATTTTAT AACAGTPiAAG CTCACCCTGA TCTTACCGTT AAAGTGAGAG 2 52 0 

AAAAGGATAA CATCGTTAAG GGGATAATAT TAGTAAGAGA TGAAAAGATA CATACTAATT 2 580 

TTGAtGGGGG AATTGGTTCG CCGATAAATA ACGCgATTGA AAATCTTGGa TTCgGATATA 264 0 

AAAGrACaAA AGTTGGcAAT GrtTkCtCAT CgGTAAAGTA TATTGATAGA GATAACCATT 2 700 

TAAAATTAAA CTTACTTTAT CAAGATTTAG AAATTAAACG TATTGAATTT TTTAGTAAAT 276 0 

AGCTTTAGGT CTTAAAGTTw TAAAAAACGA ATGAaTAATT TTATTGGGAT GAGTGAC 2 817 
(2) INFORMATION FOR SEQ ID NO: 296: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1607 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296: 

TCTGTTAAAA TGATTTTTCT TTTAnAAAGG CCGnAAATCA ATGTTCGATT nTTATTTGCA 6 0 

TTATGGTCTC GATATTGGTA GAATATCAAA TGGTTAAATG AGAAAAACTT GGAGGTGCTC 120 

ACATGTCATC AATCGTAGTA GTTGGGACAC AATGGGGAGA CGAAGGAAAA GGAAAAATAA 180 

CGGATTTCTT GGcAGAACAG TCAGATGTTA TCGCGCGTTT TTCAGGTGGT AATAATGCAG 24 0 

GCCATACCAT TCAATTTGGC GGAGAAACAT ATAAATTACA TTTAGTACCA TCTGGTATCT 3 00 

TTTACAAAGA CAAATTAGCG GTAATCGGTA ACGGAGTCGT TGTTGATCCA GTTGCACTAT 36 0 

TGAAAGAATT AGACGGATTA AATGAACGTG GCATTCCTAC AAGTAATTTA CGTATATCTA 42 0 

ATCGTGCGCA AGTGATTTTA CCATATCACT TAGCACAAGA TGAATATGAA GAACGTTTAC 4 80 

gTGGtGACAA TAAGATTGGT ACAACTAAAA AAGGTATCGG TCCAGCATAT GTAGACAAAG 54 0 

TTCAACGTAT CGGTATTCGT ATGGCAGATT TACTTGAAAA AGAAACATTC GAAAGATTAT 600 

TAAAATCAAA CATTGAATAT AAACAAGCAT ATTTCAAAGG TATGTTTAAC GAAACATGTC 66 0 

CATCATTTGA TGATATCTTT GAAGAATATT ATGCAGCAGG TCAACGTCTA AAAGAATTTG 720 

45 TAACAGACAC ATCAAAAATC TTAGACGATG CATTTGTAGC AGATGAAAAG GTACTTTTCG 780 

AAGGTGCGCA AGGTGTAATG TTAGATATCG ACCATGGTAC ATATCCATTC GTTACATCAA 84 0 

GTAATCCAAT TGCAGGTAAC GTTACTGTTG GTACAGGTGT AGGTCCTACA TTCGTTTCAA 900 

AGGTAATTGG TGTATGTAAA GCTTATACAT CACGTGTTGG TGATGGTCCA TTCCCTACTG 960 

AATTATTCGA TGAAGATGGA CATCATATTA GAGAGGTTGG TCGTGAATAC GGTACAACAA 1020 
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TAAGTGGTAT TACAGATTTA TCTATTAACT CAATCGATGT TTTAACAGGC CTAGACACAG 1140 

TGAAAATCTG TACAGCTTAT GAATTAGACG GTAAAGAAAT TACTGAGTAC CCAGCAAACT 1200 

^ TAGATCAATT AAAACGTTGT AAACCAATCT TTGAAGAGTT ACCAGGTTGG ACAGAAGACG 1260 

TAACAAATGT GCGTACTTTA GAAGAATTAC CTGAAAATGC ACGTAAATAT TTAGAGCGTA 1320 

TTTCAGAATT ATGTAATGTA CAAATTTCTA TCTTCTCAGT TGGTCCAGAT AGAGAACAAA 13 SO 

10 

CAAACCTATT AAAAGAATTG TGGTAGAACT TTATATAAGT CATACACAAT GATTATAAAT 144 0 

ACATGAGCCT TCTATCTTTA TTGGTAGGAG GCTTTTGTTA TGCTTGCTTC TGTATCGATT 1500 

CGATTATTTA GATAAAAAAT ACTAACGTAA AGGCGATATT TGCTAGTCAT AATTTAGAAG 156 0 

15 

rTTAgATGAt AtTtAACGAA AAtTAAgATG anATACtTGA ATGGTAA 1607 
(2) INFORMATION FOR SEQ ID NO: 297: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 055 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297: 

TTAGAAGTAA GCACTTTAAT ATCTTTACCT ACCCATGTAC CAACACCTTC TTTAGGCTTT 60 

GGATTTTCAG CATGGTTATT TGATTTATTC ACCTGTTCAG ATCTATTTAC TTTATTACCT 12 0 

TTATTCGGGT TCTCTAATAC ATCAAATTTT AATCTCGGTG AATAAAAAAG ATATATTAAA 13 0 

AATCCAACTA AAAATAATAC ACCGACGACT CTTATAATTA ACTTTTTCAT CAATCAACCA 240 

35 

CCTAAAAAGT ATTAATACTA TTGTAAAAAA CAACACATTA ATTAGCAAAT TTTCAACACT 300 

GACATAACTG TGTCGTTTCG ATAAAACAAA ATCATCTTCA GGCATAATTT TAACGTCATA 3 60 

AAGATTTTCC CTACACTCTA TATCATAACC TATCTTTATG TTTTCAGGTT GAATTTCATT 42 0 

40 

ATTAAGATTA AAATATGTAT AAAAAAATGG ACAGTTAAGG TATCAAATGA ATACCATCAA 4 80 
CTGTCCGACT ATTCTTCATC AAAAAACCTG ATAAAACAAA TTGCCTTATC AGATTAGTAT 54 0 
45 CATTTGTATA AGCATATTAA TGACCAAATG TTGCTTTAAT CAGTGATGTC GTTTCTCCAC 6 00 
CTGGATATAA TACATATAAT AATAAGTAAA CTGCTACACC TGTAATTGCA GTACAGAACC 6 60 
ATATAACTGA AGCGAATGGC CCGAATTTGC GGTGTACATT ATATTTATCT TTAAATGCAG 720 
TAATAATTTG AACTAGGCCT AGAATACCAC CAATTGTTGC TAAATTAATG TGGAAAAATA 7B0 
nn-RA^A-^^^ TTAATTGATG CTGGTCCGCC AAATGCTGTA TTACCGATAA 840 



EP0 786 519 A2 



TAACATTTTT GTGTTTATTT ATTTCCCTTT TCCAAATAAG TCTCCAACCA ATGGCAATTA 960 

AAATTGCACT AATGACAATA CATGTCGTAC TAATCGTTGG TAAAATTGGA ACGCCCATAT 102 0 

5 

TTTTCATCCT AACTTAATTA ATCTAGATCA AAGTAAGTAA TGAAACAATC ACAGCTAACA 10 8 0 

CGAAAAAGAT CACTAAATAA TTTAGTGAAT ATATAAACAT TTGTGTTGCC CATTTTGTTT 114 0 

GATCTGAATT TTTCTTAAAT GTTGTTAAAC CTAATGCAAT CCATCCTAAA TTTAATAAGG 12 00 

10 

TrGCTAACAC TaCGAATACG ACACCTAAAT TTATTAGTAA TAAAGGTACT GGCAATAAAA 12 60 

TAATCAACCA GATAAACATA CTGACACGTG TACGTTTAAA GCCCTTAACT GATGGTAACA 1320 

TTGGAATATT TGCAAGTGCA TATTCATCTT TACGTTTAAT AGCTAAGGcA TAAAAATGAA 13 80 

TTGGtTGCCA ACmAAATACA ACTAAAAACA GCGCAATCGC TGTTAAACTA ATTTGTCCTT 1440 

CAATTGCAAC CCATCCAATT AGTGGTGGTA CTGCTCCAGG AAAACTCCCA ATCACTGTGT 1500 

20 TCCATGTTGT ATGTCTTTTA GACCATATTG AGTAATAAGA CACATAACCT ACAATCCCCA 156 0 

TAAGACCAAG TACGCCTGAT GGTATATTCA ATAAAAACAA ACAAATTTCT CCAACTAACA 162 0 

TCATACCAAA ACTTAATAGT AATAAATTTT GATCTGTAAT TCTATTATTT ACAGTTGGTC 168 0 

25 TATTTTGTTT ACTAGGCATA ATACGATCAA TATCTTGGTC GTAATAATTA TTTAACGCAC 174 0 

ATGCGCCACC CATAATTAAA GTAGATCCAA ATAGCATTAA TAAAATTTGA GGTATTGATG 18 00 

ATAAGAAGGA ATGATTTGTC ATTACAACTG CTAGCCATGC GCcCGCAAAA GCCGGAATTA 186 0 

30 

AGTTACCTTG AACAAGTCCC ATTTTAATTA TCTGTTGCAA TTCTTTGAAG TTAACTCTGC 1320 

TAATATTTTG TGACmAAGTA TGCTCTTTGC TCATAATCCC CCTCCTTAAA TTTGTTCATA 198 0 

TAAGATTATG ATATCTTAGA TTGCATAAAA AGACTAGGTT TAATAAAATT AAATTGTGAC 2040 

35 

AAATTAACGA CAAGAGAAAA TGTCAATTTT GTGACACAAA TAACATTTAA TTTATTGCTA 2100 

TAATGTATAT GTTAGAAAAT TTTAATAAGT AGAATCATGC ATCTAAAAGA GATTAATATT 216 0 

TAAGCTTCAA ATTTGAGTAA ACGTGGATTA CATAATTATC CCAATAAAAA AATCATTACG 2 22 0 

40 

ATTAAGTTCT TTTTATGTCG TCCACATACA ATACTTGTAA AATTAAATCA TATTTCCTGC 22 80 

GTTGGATCCC ATCTTTTCAT ATCCTACAAT CAGGTCTATT TATAGTATCA TCTCAAATCC 2340 

45 GGCTATTAAT TCTAATTCTC AGTGATGCGT TTTTTTATGA TGGGGTGTAT AAATTGTTTG 24 00 

GCAAAAAGAA TTTAAAATGG TTAGGTGTCG TAGCAACGTT AATGATGACA TTTGTACAAC 24 60 

TTGGTGGAGC CTTAGTTACC AAAACCGGAT CAGCTGATGG TTGTGGTTCT TCTTGGCCAC 2520 

TATGTCATGG TGCGTTGATT CCAGAATTCT TTCCTATTGA TACGATTATT GAGTTAAGTC 2580 

ATAGAGCCGT TTCAGCTTTG TCTTTATTAA TGGTCTTATG GTTAGTTATC ACTGCATGGA 264 0 
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TATTGCAAGC ATTAATCGGA GCTGCTGCTG TTATTTGGCA ACAAAACGAT TACGTTTTAG 2 760 

cATTGCACTT TGGTATATCA TTAATCAGTT TCTCATCTGT ATTTTTAATA ACATTGATTA 2820 

TTTTCTCTAT AGATCAAAAA TATGAAGCTG ACGAATTATA TATCAAAAAG CCATTAAGAC 2 8 80 

GTTTAACATG GTTAATGGCA ATCATCATTT ATTGTGGTGT TTATACTGGT GCACTAGTGa 2 94 0 

GACATGCGGA TGCAAGTTTA GCATATGGTG GTTGGCCATT GCCATTACCA CGATCTTGTA J 0 00 

10 

CCACATTCAG GAACAAGATT GGGTTCAACT CACGCATCGT ATCAnGGTCn nTTAA 3055 
(2) INFORMATION FOR SEQ ID NO: 298: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOIiOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298: 
TTCTTATTTA AAGAAGTCAT TTTTAGAAAT TGTTGAGACT TTAAAAAATG ATCCGTATAA 60 
AATAACACAA TCTTTTGAAA AATTAGAGCC TAAATATTTA GAGCGATATT CAAGAAGAAT 12 0 

TAACCATCAG CACAGGGTCG TCTATACCGT AGATGATCGA AATAAAGAAG TATTAATACT 180 
ATCGGCATGG TCACATTATG ATTAATGAAT ATTCAATATC TGAATAACTT TAATGATAAG 24 0 

30 

TTAATTAAAG AAACTAGTAT TTAAGTGTAG GGAAAATAGC GACGTTAATG CGTTGTTATC 300 
TCTACACTTT TTAATTTTAT AATAGCGCAA GACTAAACAG ATTGAAATTA GTAACAATAA 360 
AAGAATAACG TATTATAATA AGGAATTTTA AATTGTGACT TTTTCGGAAT ATTAAATTTT 42 0 

35 

AGAAATATGA GGTTTTTAAG CGGATTCCTC ACAAAATTTT AAAAATATTT AAGCCTGAAA 480 

ATGATAAAGC GGTAGGQAAC GTTTTTCTGA AaGTTAGTGA TACAATAGTT TTAAGTTGAA 54 0 

ATACAGGAGG ATGAATAACA TGAATCAGTC AGTCAAATTA CTTAAACATT TAACAGATGT 600 

AAACGGCATT GCTGGTTATG AAATGCAAGT TAAAGAAGCA ATGCGTaACT ATATAGAGCC 6 60 

TGTCAGTGAT CaAATTATTG AAGATAACTT GGGTGGCATT TTTGGAAAGA AAAATGCTGA 720 

45 GAATGGTCAA TACTCAATTA TGGAnTTC 74 8 

(2) INFORMATION FOR SEQ ID NO: 299: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4718 base pairs 
^ (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

TOPOLOGY' linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299: 

ACCTCCGAAT AATTGATTCC ATTAACTTTT TTTGTGaAAA AtTTAAACAG GCGAAGTCTT 6 0 

5 

CAATAAGTGA AGATGTCCAA ATTATAAAAA ATACATTCCA AAAAGAAAAG TTAGGTACAG 12 0 

TAATTACtAC ycTGGCGCAA GTGGTGGTGT TACGTATAAA CCAATGATGA GTAAAGAAGA 18 0 

GGCGACTGAA GTTGTTAATG AGGTCATTAC TCTATTAGAA GAGAAAGAAC GTTTGTTAcC 24 0 

10 

TGGCGGATAT TTATTTTTAT CAGATTTGGT AGGTAATCCA TCGCTACTAA ACAAAGTTGG 3 00 

TAAGTTAATT GCCAGTATTT ACATGGAAGA AAAATTAGAT GCTGTTGTTA CCATTGCGAC 3 60 

AAAAGGTATT TCATTGGCAA ATGCGGTTGC TAATATTTTA AATTTACCAG TAGTAGTGAT 4 20 

TAGAAAAGAC AACAAGGTGA CTGAAGGTTC TACAGTTTCA ATTAATTACG TTTCAGGATC 4 80 

TTCAAGAAAA ATAGAAACAA TGGTACTTTC GAAGAGAACT TTAGCAGAAA ATTCAAATGT 540 

20 TTTAGTTGTC GATGATTTTA TGAGGGCTGG TGGCTCTATT AATGGTGTTA TGAATTTAAT 600 

GAATGAGTTT AAAGCCCATG TAAAAGGGGT ATCAGTACTT GTAGAATCAA AAGAAGTTAA 660 

ACAAAGATTG ATTGAAGATT ATACTTCCTT AGTGAAATTA TCTGATGTAG ATGAATATAA 720 

TCaAGAGTTT AACGTAGAAC CTGGCAACAG TTTATCTAAG TTTTCATAAA AGGAGTTTTA 780 

GTATTATGAA AATCATTAAC ACAACAAGAT TACCGGAAGC ACTTGGACCA TATTCGCATG 84 0 

CAACAGTTGT GAATGGTATG GTTTATACTT CTGGTCAGAT TCCATTGAAT ATTGATGGAC 900 

30 

ATATCGTAAG CGCTGATGTT CAAGCACAGA CAAAACAAGT TTTAGAAAAT TTAAAGGTTG 960 

TTTTGGAAGA AGCAGGATCT GATTTGAATT CTGTTGCGAA AGCGACCATT TTCATTAAAG 102 0 

ATATGAATGA TTTCCAAAAA ATAAATGAAG TGTATGGTCA ATATTTTAAT GAACACAAGC 1080 

35 

CAGCGCGTAG TTGTGTAQAG GTTGCGCGTT TGCCAAAAGA TGTGAAAGTA GAAATTGAAT 114 0 

TAG'»AGTAA AATTAAGGAA TTATAATTTT CGATTAATAT GTTTAATCAA GCTTCTAAAT 1200 

AAAACAGAGA GATATATACT ATAGGGGGGC TCACTACATG AAAGTGACAG ATGTAAGACT 12 60 

40 

TAGAAAAATA CAAACAGATG GACGAATGAA AGCACTCGTT TCCATTACAT TAGATGAAGC 13 20 

TTTCGTAATT CATGATTTAC GTGTAATTGA AGGAAACTCT GGCTTGTTCG TTGCAATGCC 13 80 

45 AAGTAAACGT ACACCAGATG GTGAATTCCG CGACATCGcg CATCCTATTA ATTCAGATAT 1440 

GAGACAAGAA ATTCAAGATG CAGTGATGAA AGTATATOAT GAAACAGATG AAGTAGTACC 1500 

AGATAAAAAC GCTACATCAG AAGATTCAGA AGAAGCTTAA TCAATTTTAT ATTTAGCGAT 1560 

GTAATACATT TGCAATAAGT TGATTTGATA CTGTCGATAA AGCATAAAGC TTTGTCGGCA 162 0 

GTTTTTTTAG TTTGTATTAA TGTTTTTTTA TTTTTAATGA AAGGCTAATA AATATATACG 16 8 0 
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15 



20 



25 



30 



35 



40 



TGaTGCTCGT 
CAATGCCTTT 
AATTTAATGG 
CACAAGAATG 
CGAACATGTA 
ACATGGTGCT 
GGAACAACTC 
GGTACGACAA 
TTGATTGCGC 
CAACCATATG 
GAAGAGAAAG 
GCGTTTAATA 
GGTGAATATT 
GTCTATCGTA 
CAGGCTGAGA 
ATCATCGATC 
ATTGAACCAG 
CAGTACTCTG 
GTTAATGATG 
GGCGCGCAAT 
CTTAAAGATG 
CGTACTAATA 
ACTATCGTCG 
ATTGGTGATG 
AGTTTAGCTG 
ACGTATTTAA 
GATTTAATGA 
AAATAATTGG 



ATTTTTGAAG 
TATATGTTGA 
GGGTTAATGT 
AAATCTAAAA 
TTGGAAAGTG 
GAAAGTGTAA 
GGTACTGCGC 
TCGTTGTATG 
ATCACGAGGA 
GATACGGAAG 
ATGCAACGCA 
ATAAAACGTT 
ACCTCCCTGa 
CCAATGATGT 
AGGCGATGCA 
CTGACAGCAC 
GCGTACGAAT 
AAATTAACAA 
CTAGCGTAGG 
TAGGTGCAGA 
GTGCCAAGGT 
TTGGTTGCGG 
GCAAAGATTC 
ATGTATTGGT 
TGGCAAGAGC 
AATGGCTAGG 
TTCGCATAGT 
AGGACTATAA 



TAAGAAAAAA 
AAGAGTATTG 
TCATGCGAAG 
AGTATAAAGT 
TGAAAGGCrC 
AAGGACATTT 
ATGCatGCAA 
TGGTGACACA 
TGCTAATGCT 
AATCGTTCGA 
AGCTGAAAAG 
GTTTGAAAAA 
TGTATTGTCG 
TGAAGAAATC 
ACGTCGTACG 
TTATATTGGT 
TAATGGTCGT 
TAGTACGATT 
AGCGAATACT 
TGTTAAGGTT 
TTCACATTTA 
AACGATTACA 
ATTTGTAGGT 
GGCAGCTGGT 
AAGACAAACA 
ATAAAAGGAT 
AATGGAGTTA 
ATGTTAAATA 
GCATTAGCGC 



GTTGTTTTTA 
CaGATTAAAT 
ACACGCGATA 
GCTACACGAG 
TGGTGTCGAT 
AGGCGAGCGT 
ATGGCGAAAT 
CCGCTCATCA 
CAAGCAACTG 
AATGCGTCAG 
GATATTAATG 
TTAACACAAG 
TTAATTTTAA 
ATGGGTGTAA 
AATCATTATC 
CCAGACGTTA 
ACAGAAATTG 
GAAAATGGTG 
AAGGTCGGAC 
GGAAATTTTG 
AGTTATATTG 
GTTAACTATG 
TGCAATGTTA 
TCCACAATCA 
ACAAAAGAAG 
AATCCTATGT 
CATyTTATAT 
ATGAATATAA 
AAGAAGTTGC 



AAATTACAAC 
TaTAATAATG 
ATTTTGGCAG 
GTTGCTGGGA 
CAAGTTGTAA 
TCTTTATACA 
CACACTTAGA 
CAAAGGAAAC 
TATTATCTGC 
GTCGTTTAGA 
AAATTAGTTC 
TGAAAAATGA 
ATGATGGCGG 
ATGATCGTGT 
ACATGCTAAA 
CAATTGGTAG 
GCGAAGATGT 
CATGTATTCA 
CGTTTGCGCA 
TAGAAATTAA 
GCGATGCTGT 
ATGGTGAAAA 
ATTTAGTAGC 
CAGATGACGT 
GATATAGGAA 
AATATTAATG 
ATAATAGTAA 
GAATTCGTCA 
TGACCAAGTA 



GAATTAAAAA 
ACGAaGgTAA 
CAGGTAAAGG 
AACCTATGGT 
C CAT CGT AGG 
GTTTTCAAGA 
AGACAAGGAA 
ATTAGTAACA 
ATCGATTCAA 
ACGCATAGTT 
AGGTATTTTT 
TAATGCGCAA 
CATCGTAGAA 
AATGCTTAGT 
TGGTGTGACA 
TGATACAGTC 
TGTTATTGGT 
ACAGTCTGTT 
ATTGAGACCA 
AAAAGCAGAT 
AATTGGCGAA 
TAAATTTAAA 
ACCTGTAACA 
ACCAAATGAC 
ATAATCATTT 
TAATCTTTAT 
TTGCGTAAGT 
TTAAAGATTT 
GGAATTGAAC 
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GTATTCGTGG TTGTGACGTA TTTATTATTC AACCAACATC ATATCCTGTG AATCTACATT 3 600 

TAATGGAATT ATTAATTATG ATTGATGCTT GTAAACGTGC TTCTGCAGCA ACAATCAATA 3660 

5 

TTGTAGTGCC ATATTATGGA TATGCAAGAC AAGATAGAAA AGCCCGTAGC CGTGAGCCAA 372 0 

TCACTGCTAA ATTAGTTGCA AACTTAATCG AAAcAGctGG CGCAACTCGT ATGATTGCGT 37 3 0 

TAGACTTACA TGCACCACAA ATTCAAGGAT TCTTTGATAT TCCAATTGAC CACTTAATGG 3 84 0 

10 

GTGTGCCAAT TCTTGCTAAA CATTTCAAAG ATGATCCGAA TATTAACCCA GAAGAATGTG 3 900 

TCGTTGTTCA CCAGACCATG GCGGsnTTAC ACGTGCACGT AAATTAGCTG ACATTTTAAA 3 96 0 

AACTCCAATT GCAATTATAG ATAAACGTCG TCCTAGACCA AATGTTGCTG AAGTGATGAA 4020 

CATTGTTGGT GAGATTGAAG GACGTACGGC AATTATTATT GACGATATTA TTGATACAGC 40 80 

AGGTACAATC ACTTTAGCTG CACAAGCATT AAAAGATAAA GGTGCTAAAG AAGTATATGC 414 0 

20 TTGTTGTACA CACCCTGTTT TATCAGGACC GGCTAAAGAA CGTATCGAAA ATTCTGCTAT 4200 

AAAAGAATTA ATCGTAACAA ACTCAATTCA TTTAGATGAA GATCGCAAAC CATCTAACAC 4260 

TAAAGAATTA TCTGTTGCTG GTTTAATCGC ACAAGCTATC ATTCGTGTAT ACGAAAGAGA 4 32 0 

ATCAGTTAGC GTATTATTTG ACTAATATTT AAAAGGCGTT TGACGAACAT ATTCCAAACG 43 80 

TGTATAATAG TTTCGTTCGT GATTATACGA ATAAATAAAC ACTTGCAAGC AACGATGATG 4440 

TTGATGGGTA AGTGAGGTGC TCGTTTTGAG CAAAAATGAA AGGTGGAAAT GAGAATGGCT 4 5 00 

30 

TCATTAAAGT CAATCATCCG TCAAGGTAAA CAAACACGTT CAGATCTTAA ACAATTAAGA 4560 

AAATCTGGTA AAGTACCAGC AGTAGTATAC GGTTACGGTA CTAAAAACGT GTCAGTTAAA 4 620 

GTTGATGAAG TAGAATTCAT CAAAGTTATC CGTGAAGTAG GTCGTAACGG TGTTATCGAA 46 30 

35 

ttaggcgttg gttctaaaac tatcaaagtt atggttgc 4718 
(2) Information for seq id no: 3 00: 
(i) sequence characteristics: 

(A) LENGTH: 3181 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300: 
AAAATGCATA TAAATACATA TTAAGGAGGA TTTTATGAAT TTTCTTAAAC CTGCAAAGCA 6 0 

50 TATTAAGCCT TTGCCAGAAA ATCAGATAGA TGATACCTAT AAACGATTAC GTCTCCAAGT 120 

ATTTCTTGGT ATTTTCATCG GTTACGCTGG GTACTATTTA TTACGTAAAA ACTTTTCGTT 180 
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TGCTGTTTCC ATCGCATATG GATTTAGTAA 
CAATGCTCGG ATATTCTTAG TTCTTGGATT 
^ GGGATTTGTA CCGTTCTTTA CATCAGGTAT 

TGGATGGTTT CAAGGTATGG GCTGGCCACC 
TGTAAGTGAA CGCGGAAGTA AGACTGCCCT 

10 

TATTATGGCA CCTATTGCTG CTTGGGGTAT 
TTTAAAAGGT TTCGAAGGTG TATTCATTTA 
AATTTCATAC GTATTGATTA GAGACACACC 

15 

TTATAAAAAT GACTTTGCTA CAAGCGATAA 
AGAAATTTTA TTTAAATATG TACTGAACAA 
TATATTTGTT TATTTCGTGC GTTATGGTGT 
AGAAAAACAT TTCGACTTAA AAGCATCAGG 
AATTCCTGGT ACATTATTAT GTGGTTACAT 
25 ACCTGCAGGT TTCTTCTTTA TGTTAGGTGT 

TCCTCCAGGC AATGCTTGGT TAGACAATGT 
TGGACCAGTT ATGTTAATTG GTTTACAAGC 
CACAgcAGCT GGATTAACAG GATTATTTGG 
CGTCTTAGGT GCTGTAGTTG ATAAATTCGG 
AATTAGTGTG TTTGCAATGT TGAGCTTTAT 
CGTTCATCAT TAAATGATAA AAAATAAAGT 
TTCATCTCTT ATAAGTTCAA CCATATGACT 
TGCCACTTTT GTGTTTGCCC TGCTGTTTTA 

40 

CTAAAAGCCA TTTTCCCTTG ATACTTAGAT 
7CCCTAGGAA ATAGAAAGTA GCCATTTAAT 
TCAGCAAACT CTTCAATTTT GTATGGCCGA 
GTCACGAAAT ATCCCGATTT AGTCGGTGTT 
TCTTTAAAGC TAAATGTTAA TGCyTCGTAA 
BO TTAACTTCTG ATTCTTCACT AAAAATATTT 

rAATCTCACT TTATTTTTTC ATAATCGTAT 



GTTCTTTATG GGTACTGTAA gTGATCGGAG 300 

AGCACTCACT GCTATCGTCA ATTTGTTAAT 36 0 

CGGTATTATG TTTGTCCTAT TATTCTTAAA 42 0 

TTCAGGCCGT GTTCTCGTTC ACTGGTTTAG 480 

TTGGAACGTT GCGCATAATG TTGGTGGAGG 540 

TACAACAACT GCATTTATCA ACTTTGGTTA 600 

CCCTGCACTC TTAGCACTTA TCATTGCCGC 660 

TCAATCTCAA GGTTTACCTC CAATCGAAAT 720 

GAAAACATTA GAAACAGAAT TAACTACAAA 78 0 

TAAATGGGTA TGGGCAATTG CCTTTGCAAA 84 0 

ACTTGATTGG GCGCCAGTCT ACTTAAGTGA 900 

TTGGGCATAC TTCTTATACG AATGGGCTGG 9S0 

TTCTGATAAA TTATTCAAAG GTCGTCGTGG 1020 

CACAGTATTT GTATTAATTT ATTGGTTAAA 1080 

CTCATTAATT GCCATTGGTT TCTTAATATA 1140 

ATTAGATTAT GTACCTAAAA AAGCAGCTGG 1200 

TTATCTGTTT GGTGCTGTAA TGGCCAACAT 1260 

ATGGGATGTC GGTTTTATTT TATTAACAGC 1320 

CCTCACTTGG AATAAAGTAG GACAAGAAAC 13 80 

CATATGGTTA TCTTATCGAA AGATGATATA 1440 

TTTTATTAGT ATTCAAAAAA ATATTTACAT 1500 

TTCAATTGAT TACACCACTT AGGATAAACT 1560 

GAAGCTAAGA TACCTTTTTC TACCAATAAT 1620 

TCATCATCGA TAACAGCAAC AATCAGGTAA 16 80 

TTATAATTAT CtTCGTCTTT TGTCCAACAT 174 0 

TTCTTAGCTA ACCTACTTTG ATATGTTTCC 1300 

TCTTGATTAT ATTTTTCTTC AGTTAAATCT 1860 

TTCAACAGTA TTTTAGATTT ACACATATGC 1920 

CATATATTTA mTTTTCGA AAAATACACT 1980 
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10 



15 



TGTATCAACT GTCAAACGAT CACGTAATAA ATAGACGATT AACATTGcTG CTAAAGCACA 2100 

AAGTGACTCA GCAATTAATA ATGACCAAAT GACACCTGTT AGTCCAAACA AAGCATTCAT 2160 

AATAAATAAT ACTGGAATGA TAATTGCACC TTGTAAAATG GCCATAATTG TAGCACCACG 2220 

ACCTTGCCCA GTCGCTTGAA GCATACCAGT AAACAAGAAA CCTATACCAT TTAATAATAA 2 2 80 

TGATGCCATT GTTACTTTCA AAATAAATGt CGCCATCTCA ACAATGGCTT GATCAGTAGT 2 340 

AAATAGTCCG ACCATATGAT GTCCAATTGT AAATACAGCA CTCATACATA CAACAAAGAT 24 00 

AACGCCGATA GACATGATAA CTGCTTTGAT AACGTCTTTC ATACGGCCTT TATTTGCCAT 24 60 

AAAGTTATAT GCAATTAGTG GTACAACACC TTCACATAAT CCCATGATAA TAAGTTCTGG 2520 

aAATTGCACA AGTCTAAATG AGATACCATA ACTTGCaATC GCGAAGTTTC CATAATGTGC 2580 

TAAAAATAAA TTTAAAACTA ATCCTGTGAA TCCCATTAAG ATACTCATTA AAAATGCAGG 2640 

AATACCGATT TTAAAGATTT CAGAAAGCAT TTCTTTATTA GGTTTCGCAA GTTTAATATT 270 0 

AACTGACACA ACGTCACTAT TTTTCATAAA ATAAATGATA AAGAACAGAG CAGCAGCAAC 276 0 

ATTACTGATT GCAGTACCCA AAGCTGcACC AACAACGTTT aAATCAAAAC CAAAAATTAA 2820 

25 AATTGGATCT AAAATAATAT TTAAGCCTAC ACTAGCTAAC ATACCAATCA TAGAAACCAT 2 880 

TGGTGCCCCA ATTGCACGTG CAAATTGTTC TAATATGAAG AACAAAATTA CAAAAGGTGC 2 940 

ACTTAAAAAC ATTACTTTCA AATAATTACT TGTTAAAGCT AACGTTTCAC CTCTCGCCCC 30 00 

TAAAATTGCT GCGATTTGAT CACTGAATGG TAAAGTAACT AAAATCACGA TAAGTCCTAG 3060 

TGCAATACCA CCATAAATAG AGAAACTACT TACAAATTTA CTCTTAmTAT AGTCTTTCGC 312 0 

ACCTAATAAA CGTGAAATAT AAGTTCCTGC ACCAACGCCA AATAAATTAC CTAACCCCAT 3180 

T 3181 
(2) CENFORMATION for SEQ id NO: 3 01: 



20 



30 



35 



40 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4029 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 301: 

TCATCACCTT CAAATAAATA GGCATGCGAT AATTTATTTG AATGATATGC ATTCGTCAAT 60 

50 TGTTGCTGTT CATCCATTCA TTACAAACTC CCTTTTGCTT TTATATAAAA AGGTTGCCAA 120 

AGAGCAACTA ATTACTGCAA CAATAGCGAC CATGATTTGT GATTATTTTT AGAACCACAA 18 0 
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w 



15 



20 



TTATTAATAC ACTATTTTAA GTATCTCTAA TTGATCiATTG TATTATAGAA TTAAAATTGA 3 00 

TGGAATGCAT CAACTGGCAT AACAAATACA GTAGCACCGC CAACTTCAAC TTCAACTGGA 3 60 

TATGGAATGT ACGAATCCGC ACTGCCTCCC ATAGGTGTAA TAGGTGAAAC CAACTGTTCT 420 

CTATTACCAC ACGTTTGATT AATCACAGAC AATATTTCAT CTACACGGTC ATCATTGACA 4 80 

CCACATAAGA ATGTTGTATT ACCCGCTCTT AAAAACCCAC CTGTTG1"TGC CAATTTTGTT 54 0 

GCTCTAAAGT TATTTTTAAC AAGTTGATCT GCAAGTTCCT GACTATCTTG ATCTTGTACG 6 00 

ATCGCTATAA TCATTTTCAT TTTATAACAC CTCTTCTAAC AATTATATCA TATCTTTTCT 66 0 

AAATATTTGA TGATAGTTTG ATACGTGTCT TCAACAACAT TTTCAAGAGG TTGATCTGCA 720 

TTAACGCTTT TGAACCGTTG TGATTCATTA TGAATGATTT CTTGGTAACC TTCAATTACT 780 

TTTTCGTGAA cTTTAAATCT TCTTGATCTA ATCTATTTTG ATCTCTTGAA TTTTTAATAA 84 0 

TACGTTCGCG ACCTACTTCA GCACTAACAT TTAAATAAAT CGTCAAGTCT GGaTATAATC 900 

CATTTATTGC AAATTCGTTT AATGCTCTTA CTTCTTCAAC GCCAATCCCT CTAGCATAAC 9G0 

CTTGATAAGC TAATGAACTA TCGATATAGC GATCACACAA CACAACCTTA CCTTCTTTTA 102 0 

25 AAGCTGGTAT GACCTTTAAT ACAAGATGTT CTCTTCTAGA TGCAGCAAAT AACATTGcTT 10 8 0 

CaGTtCTAAT GTCCATATCA TTGCCcTTCT AATACAATTT TACGTATTTC TTCACCAGTA 1140 

GGAACACCAC CTGGTTCTCT AGTCATAATG ACATCATAAT CTTTTACTAA TCTATGGTAA 1200 

ACTTCATTAA TTACAGTTGT TTTTCCAGAG CCTTCTGGGC CCTCAAAAGT TATAAAAGCT 12 60 

GACATTTTAT TCATCCTCAA CTAAAATTTT ATTATTTTTA ATTCCTTCAA CTATCATTCC 1320 

AGTTTCCAGA TATTCATTTA CCAATTCTAT CATATTTTCA GTTATTGTTT CTCCTTTGAA 13 80 

AATAATAGGA ATcCCTGGcG GATACGGGAC AATATGTCGC GCCAGAACTT TACCTTTgCC 1440 

TTTTCAAATC ACACCAAGTA ACATATTCAA AGCGTTTTGG TTTATAGTTA CCTTCAGTyG 1500 

TTAAAAGTTG TGTTTGTTTA ACTTTAGAAA CTGATTTTTT CGGTAAAATC ATATCTTCAA 1560 

TTTTACGTAA AAGCGAATCA AATAAATACG TATCATCATG ATGCCATAAC GGCAATATCG 1620 

CTAATGCTTG ATAGTCGTCC GCTAATTCTA AATAGATATG TGCATTCATA AACCAATTTT 16 80 

GAATATCATG ACCTGTAAAA CCTTCATATT TTATCAGCAA CTTCAACGGA TCATCAACTT 174 0 

GAAGCATTTC AAAACCCTTC TTCTCCAAAC ATTCGATTAA TTGCGCTCTC TTATCAAAAA IS 00 

ACACGGTACT ATCATATGTT TTATAGAACT CGGCAGCTGA CTCTAAACTA GCCATAATCA 1860 

AATACGAAGG ACTAGATGTT TGGAAGTAGC TTAGATATTC TATAATAGTT TCTCTATAAG 1920 

^TGCATTTTT ATGAATATAA AGTACCGAGC CCATCGTTAA AGCTGGTAAC GTTTTATGAA 1980 
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CAAAGTGCGC GCCGTGTGCT TCGTCAATGA GTACAGGAAT ATTTAATTGG TGCAAAGATT 210 0 

TGATAACCTC TTCTACATTA AATGTTTCAC CGTAATAGTT AGGATAAGTC AACACAGCAA 2160 

GTTTGTGACC GTCATTATTC AAACGGCTTA AATTAACTTT ATTATAATGA TTCGTTAACG 222 0 

GACTTTGATG CGTTTCAATA AAATGCCCTT CTTGTTGGCT AATATCGAGC GCATGTAACA 22 8 0 

CAGATTTATG TACATTTCTT GCCATTAAGA TATCGCCTTT TTTCTGTGAA AAAGACTGGA 23 4 0 

TGACAGATAA TATTCCTGAA GTGGTGCCAT TCACTAAGAA ATAAGCATCA TAATCTGAAT 24 0 0 

GTTTCTCCAC CTGCTTCATA CTTTCCAAAA TGACTTCTTC AGGATGATGT AAATCAnCTn 2460 

AATCCAGGTA TTTCAGTTTT ATCCATTGTC ATTGATAATT GAGATAAATG ACCGATAGTC 252 0 

ATATTTTTAT GACCCGGAAC ATGCAAAGAA ATCGCTTCTT CTTGATTTAA ACTTTCTAAT 2580 

TTATTTAAAA TAGGTTGCTT CATGATATAC GCTTCCTTTA TTTACACTGT TTTGGAATTA 2 64 0 

GTTACTTTCA AAAGTATTAA TTATATAGTA ACACTTCTTT GACAAAAGTT AGTGTTACTT 2 70 0 

ATGCAATAGC TTGTCTATTG TATAATAATT AATTTCCTTT TTGTACTTCG ATTTAAAAGA 2760 

TATTAGACAT AAAATCTAAA AACAGCAGTA AGATGATTTA TGATTAAAAA CTATCTTACT 2 820 

25 GCTGTTCACT TTTTATAATA CTTCTGAATG TCTTCACTTA TACTTCTAGT CACAGATTTA 2 88 0 

AATAATCAAA AGTGCACATT ATTAAAATAT CAATTTCACA CTCAATGCGG CTCATCGCAT 2 94 0 

TCATTTCTTG TCTAGCAACG TTCTACTCTA GCGGAACGTA AGTTAGCTAC CATCCTCGCT 3 00 0 

AAGAACCTTT CTTGACTTGT GACAATCGCT TGCTTCTTTC CTCTCCTTCG GCTCTCGCTT 3 06 0 

ACTCATTTAG CTCTACTAAA CTCGTTGCGC TCTTTTCTCG TTTCGTCAGA TTCAAACGTT 3120 

TTCACTTCGC CAAGCCATTT TTCTTTGTGT TTACTTTTTA TTTTGACGTT TTAGACATAA 3180 

AAAAAGAGAC cTCACGGTCT CAACTTGCCT GGCAACGTTC TACTCTAGCG GAACGTAAGT 324 0 

TGGCTACCAT CGTCGCTAAA GACCTTTCTT GACTTGTGAC AATCGCTTGC TTCTTTCCTC 33 0 0 

TCCTTCGGCT CTCGCTTACT CATTTAGCTC TACTAAACTC GTTGCGCTCT TTTCTCGTTT 33 60 

CGTCAGATTC AAACGTTTTC ACTTCGCCAA GCCATTTTTC TTTGTGTTTA CTTTTTATTT 3420 

TGACGTTTTA GrCATAAAAA AAAGAGACcT TGCGGTCTCA ATGCGGCTCA TCGCATCCAT 34 80 

TTTTTGCCTG GCAACGTTCT ACTCTAGCGG AACGTAAGTT GGCTACCATC GwCGCTAAgA 354 0 

aCCTTTCTTG ACTTGTGACA ATCGCTTGCT TCTTTCCTCT yCTTCGGCTC TCGCTTACTC 36 00 

ATTTAGCTCT ACTAAACTCG TTGCGCTCTT TTCTCGTTTC GTCAGATTCA AACGTTTTCA 3 660 

50 CTTCGCCAAG CCATTTTTCT TTGTGTTTGC TTTTTATTTT GACGTTTTAG ACATAAAAAA 3720 

AAGAGACCTT GCGGTCTCAA TGCGGCTCAT CGCATCCATT TTTTGCCTGG CAACGTTCTA 3780 
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CATGGGAACA GGTGTGACCT CCTTGCTATA GTCACCAGAC ATATGAATGT AATTTATACA 3 900 

TTCAAAACTA GATAGTAAGT gAAAAGTGGA TTTTGCTTCG CAAAACATTT ATTTTGGATT 3 960 

AAGTCTTCGA TCGATTAGTG ATTCGTGCAG CTCCACATGT GCACCATGCT TGCCACCTCG 4 020 

GAACCTATT 4 029 
(2) INFORMATION FOR SEQ ID NO: 3 02: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7159 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302: 

20 GATGCAATAG TTGAGTAGTT ATAAGTAGCT ACATACAACC GCTCAAATAT AGGTTCAAGA 60 

ACATTTTTAA TGAAGAAAGC GATTTTCCCm CAGAGTGCAA ACGCTTGACT CAATAACGAA 120 

ATGTCACCTA TGTATGGCGT GACTTATTAT ACAGGaGGTG CAAAGTATGT TTGCTATTAT 180 

^5 TGAAACAGGT GCAAAACAAA TCAAAGTAGA AGAAGGTCAA GAAATCTTCG TTGAAAAATT 240 

AGACGTAAAC GAAGGAGATA CTTTTACATT TGATAAAGTA TTATTTGTAG GTGGAGATTC 3 00 

AGTTAAAGTT GGAGCGCCAA CAGTTGAAGG TGCAACAGTT ACTGCTACTG TTAATAAACA 3 60 

AGGTCGCGGT AAAAAAATCA CTGTATTCAC ATACAAACGT CGTAAAAATT CAAAACGTAA 420 

AAAAGGCCAT CGTCAACCAT ACACTAAATT AACAATCGAT AAAATCAACG CGTAATTATT 4 80 

ATGATTACTG TTGATATTAC AGTTAATGAT GAAGGCAAAG TAACAGACGT TATTATGGAT 540 

GGCCATGCTG aCCATGGTGa ATATGGTCAT GATATCGTTT GTGCTGGAGC TTCAgCTGTA 600 

TTG'FTTGGTA GTGTTAATGC GATTATAGGA TTGACATCTG AGAGACCAGA TATCAATTAT 660 

GACGACAATG GTGGTCATTT TCATATAAGA AGCGTTGATA CAAACAACGA TGAAGCGCAA 720 

CTAATTCTTC AAACAATGCT TGTGTCTTTA CAAACTATTG AAGAAGAATA TAATGAGAAT 7 80 

ATTAGATTAA ATTATAAGTG AGGTGCATTC CGATGTTAAA ATTAAACTTA CAATTCTTCG 84 0 

CATCTAAAAA AGGGGTAAGT TCTACAAAAA ACGGACGTGA CTCTGAATCA AAACGCTTAG 900 

GTGCTAAACG TGCTGACGGT CAATTCGTAA CAGGTGGTTC AATTTTATAT CGCCAACGTG 960 

GTACTAAAAT TTACCCTGGT GAAAATGTAG GTCGTGGTGG CGATGATACA TTATTCGCTA 102 0 

^0 AAATCGACGG CGTTGTTAAA TTCGAACGTA AAGGTCGCGA CAAAAAACAA GTTTCTGTAT 1080 

ATO^ArzTAGf' TGAATAATTT TGTCTAGTTA ACACCAGAAG TGAATCTTCT GGTGTTtTTT 114 0 
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AGACGTTATA CTAAATGTGC AcTGTATAAG AGCCCCTAAT CACTAAACTA TAAGGGGGAC 12 6 0 

AAAGGAATAC AGTTGCAGCG TTTAAAGAAT AAACTGTACC ACAATTGGTG CTGAGAAATA 1320 

5 

TAAGTATTTT AAAGCAAAGA TTTATAAAAG TAACTGCATA AGAGCCCCTA ATTATTTACA 13 8 0 

ATATATAAGG GGCTCTAATA TGCTATAATT ATTGGGAAAA TGAAAATTAT ATGTAAAAGA 14 4 0 

GGTGAGATAT ATGTTTGTCG ATCAAGTCAA AATATCTCTT AAAGCCGCTG ATGGTGGTAA 15 00 

10 

TGGTATTACC GCATACAGAA GAGAAAAATA TGTACCATTT GGTGGACCAG CTGGCGGTGA 1560 

CGGTGGTAAA GGTGCTTCAG TCGTATTTGA AGTGGATGAA GGTTTAAGAA CGTTATTAGA 1620 

j5 TTTTAGATAT CAACGTCATT TTAAAGCAAG CAAAGGTGAA AATGGCCAAA GTAGTAATAT 16 BO 

GCATGGTAAA AATGCGGAAG ATTTAGTATT AAAAGTTCCA CCTGGTACAA TTATTAAAAA 1740 

TGTTGAAACA GACGAAgTGT TAGCAGATCT TGTTGAAGAT GGTCAAAGAG CTGTAGTAGC 1800 

20 GAAGGGCGGT CGAGGTGGCC GAGGTAATTC ACGTTTTGCA ACACCTAGAA ACCCTGCACC 186 0 

TGACTTCAGT GAAAAAGGTG AACCAGGTGA GGAATTAGAT GTATCTTTAG AATTGAAATT 1920 

ATTAGCTGAT GTAGGATTAG TAGGTTTCCC TAGTGTGGGT AAATCGACTT TATTATCTAT 198 0 

CGTTTCAAAA GCTAAGCCTA AAATTGGGGC ATATCATTTT ACAACGATTA AACCAAATCT 2 04 0 

AGGTGTTGTT TCAACGCCTG ATCAACGTAG TTTTGTTATG GCAGATTTAC CAGGTTTAAT 2100 

TGAAGGTGCA TCTGATGGCG TTGGATTAGG ACATCAATTT TTAAGACATG TAGAGAGAAC 216 0 

30 

AAAAGTTATT GTTCACATGA TTGATATGAG CGGTTCTGAA GGTAGAGAAC CTATTGAAGA 222 0 

TTATAAAGTC ATTAATCAAG AATTAGCTGC GTACGAGCAA CGTTTAGAAG ATAGACCTCA 22 8 0 

AATCGTAGTA GCTAACAAGA TGGATTTACC TGAATCACAA GATAATTTAA ACTTGTTTAA 2 34 0 

35 

AGAAGAAATT GGCGAAGATG TGCCAGTTAT TCCAGTTTCA ACAATAACGC GTGATAATAT 24 0 0 

TGATCAATTA TTATATGCAA TAGCAGATAA ATTAGAAGAA TATAAAGATG TTGACTTCAC 24 60 

AGTTGAAGAA GAGGAGTCAG TTGGCATTAA CCGAGTATTA TATAAACATA CACCGTCACA 2 520 

40 

AGATAAATTT ACAATTTCAA GAGATGATGA TGGTGCTTAT GTGGTAAGTG GTAATGCTAT 2 580 

TGAAAGAATG TTTAAAATGA CTGACTTTAA CAGTGATCCA GCAGTACGTC GATTTGCTCG 2 64 0 

45 TCAAATGCGT TCGATGGGTA TTGATGATGC GCTTAGAGAA CGTGGTTGTA AAAATGGTGA 2 700 

TATCGTTAGA ATTCTTGGCG GAGAATTTGA ATTCGTTGAA TAGGAGCGAA ACATGATGGA 2760 

CAATAAAGAT TATAAAAAGT TTTATTTAAT TAGAGAAGAT GTCTTGCCTG AATCCGTGGT 2 820 

TAAAACATTG AAGATTAAAG ATGCCTTAAA AAGTGATCCG ACATTGTCCA TTTATGATGC 2 880 

CGTTAAACAG TTTGATCTAT CTAGAAGTGC TTTTTATAAA TATAGAGAAA CGATATTTCC 294 0 
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TGTTGGTATG TTGGCACGTG TACTAGATGT TATATCAAAG TTAGAACTAT CTGTATTAAC 
GATTCATCAA AGTATTCCAA TGGAAGAAAA AGCAACAATA ACATTATCAC TGAATGCTAA 
ATCTAAAGAA ACTTCAGTAG AAGATGTTAT TGGCGCTTTG AGAAATTTAG ATTATGTATC 
AAAAGTAGAA TTAATTAGTA TGAGTATGTA AGGATGTGCC TATAATGTAC GCGTATGTCA 
AAGGTAAGTT AACACATTTA TATCCTACAC ACGTAGTTGT TGAAACTGCT GG-l-GrTOGTT 
ATGAAATTCA AACACCAAAT TCTTATCGTT TTCAAAAGCA TCTAGATCAT GAAGnTTAA 
TTCATACATC TrTAATIGTr CGTGAAGATG CACAATTATT GTATGGATTT AGTAGTGAAG 
AAGAGAAAGA TATGTTCrTG AGTTTAArTA AAGTTACTGG TATreGTCCG AAATCAGCTT 
TAGCTATTTT AGCGACAAGT ACGCCTAATG AAGTAAAACG TGCCATTGAA AATGAAAATG 
ATACGTATTT AACTAAATTC CCAGGAATTG GTAAGAAAAC GGCAAGACAG ATTGTCTTAG 
ATTTAAAAGG TAAAGTGAAA ATTACTGAAG AAGATAGCGA TTCATTATTA CAAGTAGACG 
CTACTTCGAC GGTGCAAGAT CAATTCGTGC AAGAAGCAAT GTTAGCGTTA GAAGCATTAG 
GTTATTCTAA ACGAGAGCTT GCAAAAGTTG AGAAAACGTT AAATAAAAAT AAATATGACT 
CAGTTGATGA AGCTGTTAAG GCAGGTCTTC AATTAGTnST ATCrTAATTT TAAATAGATT 
AATAGGGGAA GTGTTGTCAT GAATGAGCGT ATGGTTGATC AATCAATGCA TAGTGAAGAA 
ACTGATTTCG AATTGTCGCT TAGACCTACG AGATTACGAC AATATATTGG TCAAAATTCA 
ATAAAAAGTA ATTTAGAAGT ATTTATTAAA GCGGCTAAAC TTCGTCATGA ACCATTAGAT 
CATGTATTGC TrrrTGGCCC CCCTGGATTA GGTAAGACAA CATTATCTAA TATCATTGCC 
AATGAAATGG AAGTTAATAT ACGTACAGTA TCAGGGCCTT CATTAGAAAG ACCTGGTGAT 
TTGGCTGCAA TTTTATCAGG ACTTCAACCT GGAGATGTTT TGTTrATrGA TGAAATACAC 
AGACTGAGTA GTGTTGTTGA AGAAGTGTTA TACCCTGCAA TGGAAGATTT CTTTTTAGAT 
ATTATCATTG GTAAAGGCGA TGAGOCTAGA AGTATCCGTA TCGACTTACC TCCATTCACT 
TTGGTAGGTG CAACT^CGCG AGCTGGCAGC TTAACAGGTC CACTAAGGGA TCGArTTGGT 
GTGCACTTAA GATTAGAATA TTATAACGAA TCAGATTTAA AAGAAATCAT TATTAGAACA 

GCTGAGGrrr taggcacagg tattgatgaa gaaagtgcca ttgaacttgc taaacgttct 

AGAGGGACTC CAAGAGTAGC AAATCGACTA TTGAAGCGGG TAAGAGACTT CCAGCAAGTG 
AATGAAGATG AACAAATATA CATTGAAACA ACGAAGCACG CATTAGGTTT ACrTCAAGTT 
GATCAACACG GACTAGATTA CATTGATCAT AAAATCATGA ACTGTATTAT TAAGCAGTAT 
^^^r-CO^C CTGTTGGrrr AGATACGATT GCCGTAACAA TTGGTGAAGA ACGTATTACA 
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GGCAGAAAAG CAACACCATT AGCTTATGAA CATTTTGCAA AGTCGAATGA GGAGAGAGAA 4 86 0 

TAACGTGAAT ATTGAAGAAT TTGACTATGA CTTACCAGAA TCATTAATTG CTCAAACGCC 4 920 

TTTAAAAGAT CGTGATCATA GTCGTTTATT AGTCATGGAT AGAGAAACTG GTGAAATGAA 4 980 

ACATTTACAT TTCAAAGATA TCATTGAGTA TTTTAGACCT GGTGATACAT TAGTGCTTAA 504 0 

CGATACGCGA GTAATGCCAG CTAGACTTTT TGGTTTAAAA GAAGAAACTG GTGCAAAAGT 5100 

TGAAATGTTA ATGTtAACTC AAATTGAAGG TAATGATTGG GAAGTCTTAC TGAAACCAGC 516 0 

TAAGCGTATT AAAGTTGGTA ATAAATTGAA TTTTGGTAAT GGCAAAATTA TAGCTGAATG 522 0 

CATAAAAGAA ATGGATCAAG GTGGACGCAT CATGCGTTTA CATTATGAAG GTATTTTACA 52 8 0 

AGAAAGATTA GATGAATTAG GGGAAATGCC ACTGCCACCA TACATCAAAG AACGTTTAGA 534 0 

TGATCCAGAT CGITATCAAA CAGTTTACGC TAAAGAAAGT GGTTCAGCGG CAGCACCAAC 54 00 

20 AGCAGGATTA CATTTTACTG ATGAGTTATT AATTGAAATT AAAAATAAAG GTGTTAATAT 5460 

CGCATTTGTT ACATTACATG TTGGGTTAGG TACGTTTAGA CCGGTGAGCG TAGACGATGT 5520 

GAATGACCAC GAAATGCATA GTGAATATTA TCAAATGACm CAAGAAACAG CTGATTTATT 5580 

AAATGATACT AAGcCAAAGG ACATCGCATT ATATCAGTTG GTACAACTTC AACACGTACA 564 0 

CTTGAAACAA TTCGACGCGA TCATGATAAA TTTGTTGAAA CGAGTGGCTG GACTAATATA 5700 

TTTATTTATC CAGGATTTGA TTTTAAAGCA ATTGATGGCC AGATTACTAA TTTTCATTTA 5760 

CCAAAATCAA CATTAGTTAT GCTAGTATCA GCGTTTAGTA GTCGTGAAAA TGTTCTGAAT 5820 

GCTTATAAAA CGGCAGTAAA TTTAGAATAT AGATTCTTTA GTTTTGGCGA TGCAATGTTA 58 80 

ATTATATAAA AAGAATGTGA GGATTTTGAA TATGCCTGCA GTAACATACG AACACATTAA 594 0 

AACTTGTAAA CAATCAGGTG CGCGTTTAGG TATCGTGCAC ACACCACACG GTTCATTTGA 60 00 

AACACCTATG TTTATGCCAG TTGGTACTAA AGCAACCGTT AAAACAATGA GTCCAGAAGA 6060 

GTTAAGACAA ATTGAAGCAA AAATCATTTT GGGCAACACA TATCATTTGT GGTTACAACC 6120 

CGGAAATGAT ATTATCAAAC ACGCTGGGGG ATTACATAAA TTCATGAATT GGGATGGTCC 6180 

GATTCTTACA GATTCAGGCG GTTTCCAAGT GTTTAGTTTA AGTAATTTAC GTAAAATTAC 6240 

^5 AGAAGAAGGC GTGGAATTTA GACATCATAC TAATGGGTCT AAATTATTTT TGAGTCCTGA 63 00 

GAAATCAATG CAAATTCAAA ATGATTTAGG ATCTGATATT ATGATGGCAT TTGATGAATG 63 60 

TCCACCGATG CCTGCTGAAT ATGATTATGT AAAAAAATCT ATTGAACGTA CAACACGTTG 6420 

GGCGAAAAGA TGTCTAGATG CACACCAAAG ACCTGAAGAT CAAGCATTGT TCGGCATTAT 64 80 

ACAAGGTGGC GAATATGAAG ATTTAAGAGA ACAAAGTGCA AAGGATTTAG TAGAATTAGA 6540 
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AATGGTTGAA CATACAGAGC AGTTTATGCC TAAAGATAAA CCAAGATATT TAATGGGTGT 6660 

AGGATCTCCa GATGCGTTAA TCGAATGTAG TATTCGCGGC ATGGATATGT TTGATTGTGT 672 0 

CTTACCGACA CGTATTGCCA GAAATGGTAC TTGTATGACA TCGCAAGGTC GTTTAGTTAT 67 8 0 

TAAAAATGCA AAATTTGCAG ATGATTTAAG ACCGTTAGAT GAGAATTGTG ACTGTTATAC 6 84 0 

ATGTCAAAAC TATTCAAGAG CGTATATACG TCATTTAATC AAGGCAGAGG AAACTTTTGG 6 90 0 

TATTCGTCTT ACTACTATTC ATAATTTACA TTTTCTGCTA AAATTAATGG AAGATATAAG 6960 

ACAAGCCATT CGAGAAGATC GTCTTTTAGA TTTCAAAGAA GAATTCTTCG AGCAATATGG 7020 

^5 ATTAAATGTT GAGAACCCAA AAAACTTTTA AGCAAGAGGA GCGTATAAAA TGCAATTTTC 7080 

ATTACTAATA TATATAGTCG TAATTTTTGC GGTTATGTAT TTCTTGATGA TCAGACCACA 7140 

ACAAAAACTG TGCGAAACA '^^^^ 

(2) INFORMATION FOR SEQ ID NO: 303: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3159 base pairs 
(B> TYPE: nucleic acid 
25 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 03: 

TCCATTTATA GAAATTTCTA AAGCAGAnAA TAAGATAGAA GATATCGGCC AAGGTGCAGA 60 

AATCATCAAA AGAACACAAG ACATTACTAG CAAACGATTA GCTATAACTC AAAACATTCA 120 

ATTTGATTTT GTAAAAGATA AAAAATATAA CAAAGATGCC CTAGTTGTTA AGATGCAAGG 180 

CTTCATTAGC TCTAGAaCAA CATATTCAGA CTTAAAAAAA TATCCATATA TTAAAAGAAT 24 0 

GATATGGCCA TTTCAATATA ATATCAGTTT GArAACGAAA GACTCTAATG TTGATTTAAT 300 

'^^ TAATTATCTT CCTAAAAATA rAATTGATTC AGCAGATGTT AGTCAGAAAT TAGGCTATAA 360 

TATCGGCGGA AACTTCCAAT CAGCGCCATC AATCGGAGGC AGTGGCTCAT TCAACTACTC 42 0 

TAAAACAATT AGTTATAATC AAAAAAACTA TGTTACTGAA GTAGAAAGTC AGAACTCTAA 4 80 

AGGTGTTAAA TGGGGAGTGA AAGCAAATTC ATTCGTTACA CCGAATGGTC AAGTATCTGC 54 0 

ATATGATCAA TACTTATTTG CACAAGACCC AACTGGTCCA GCAGCACGAG ACTATTTCGT 600 

CCCAGATAAT CAACTACCTC CTTTAATTCA AAGTGGCTTT AATCCATCAT TTATTACAAC 66 0 

ATTGTCACAC GAAAGAGGTA AAGGTGATAA AAGCGAQTTT GAAATcACTT ACGGCAGAAA 72 0 
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AAAAATTAAA AGCATCACAC CTAAGTAAAC AGTTCaATCA TCTTAAAAAA TCCTGGGACA 90 0 

CTTCATACTT GTCTCAGGAT TTTTTmACAA ATTGAATCAG CCTCATAACA TTAAATTATT 960 

^ TTATCGTACA TTAAATTTAA TAATAACAAC TGATTTTTAT AAGAATAAAG TATCGAmCCA 1020 

TAGTAGATAC ACAAATAATA CAAATGAAAC AATTTAACTT GAAAGCTTAa ATAAATATTA 1080 

TCAAGTTAAT AAACAATTAA TTTTTAGATG GATTCATCAA AAATCGTAAA AAAGCACAAT 114 0 

10 

TTGTATTTTA CAAACATTAA TTAAAAAAGA AAGCAAGACA TTCGTGCAAT CGGTTACCTT 12 00 

AAATTGTTTA CAACTGTCAA CAATACCAAG GTTTTATTAA CTATATTTCT CACAAAATTA 12 60 

GCTTTTAGCA TTCCAAACAA AAAAGGTTAA ATCGAACGGA ATTATGGCAT TTTTAACTTA 13 2 0 

ATTGTAAAAA AAGTTGATAA TGGTCAATTG TTAATGAACA GTTAATTATA ATAACGCCCA 13 80 

AAATATATTA TTATTTAATT AAGTTAAATA AAATTATAGA AAGAAAGTGA AACTTATGCT 1440 

20 TAAAAATAAA ATATTAACTA CAACTTTATC TGTGAGCTTA CTTGCCCCTC TTGCCAATCC 1500 

GTTATTAGAA AATGctAAAG CTGCTAACGA TACTGAAGAC ATCGGTAAAG GAAGCGATAT 1560 

AGAAATTATC AAAAGGACAG AAGATAAAAC AAGTAATAAA TGGGGCGTGA CTCAAAATAT 162 0 

TCAA7TTGAT TTTGTAAAGG ATAAAAAATA TAACAAAGAT GCTTTGATAT TAAAGATGCA 16 8 0 

AGGATTCATT AGCTCTAGAA CAACATATTA CAACTATAAA AAAACTAATC ATGTTAAAGC 174 0 

TATGCGATGG CCATTCCAAT aTaATATTGG TTTAAAAACA AATGATAAAT ATGTTTCTTT 18 00 

30 

AATTAATTAT TTACCTAAAA ATAAAATTGA ATCTACAAAC GTGAGTCAGA CATTAGGATA 18 60 

CAATATCGGT GGTAATTTCC AATCAGCCCC ATCACTCGGT GGTAATGGAT CATTTAACTA 192 0 

25 TTCTAAATCG ATTAGCTATA CACAACAAAA TTATGTAAGT GAAGTAGAAC AACAAAACTC 198 0 

AAAAAGTGTT TTATGGGGCG TCAAAGCGAA TTCATTCGCC ACTGAATCAG GTCAAAAATC 2 04 0 

AGCCTTTGAT AGCGATTTAT TTGTAGGCTA CAAACCTCAT AGTAAAGATC CTAGAGATTA 2100 

40 TTTCGTTCCA GACAGTGAGT TACCACCTCT TGTACAAAGT GGATTTAACC CTTCATTTAT 2160 

CGCCACAGTA TCTCATGAAA AAGGTTCAAG CGATACAAGC GAATTTGAAA TTACTTACGG 2220 

AAGAAACATG GATGTCACTC ATGCCATTAA AAGATCAACG CATTATGGCA ACAGTTATTT 22 80 

45 

AGACGGACAT AGAGTCCATA ATGCATTTGT AAATAGAAAC TATACTGTGA AATACGAGGT 234 0 

CAATTGGAAG ACTCATGAAA TCAAGGTGAA AGGACAGAAT TGATATGAAA ATGAATAAAT 24 00 

TAGTCAAATC ATCCGTTGCT ACATCTATGG CATTATTATT ACTTTCTGGT ACTGCTAATG 24 6 0 

50 

CTGAAGGTAA AATAACACCA GTCAGCGTAA AAAAAGTCGA TGACAAAGTT ACTTTATACA 2520 

AAACAACAGC CACAGCAGAT TCTGATAAAT TTAAAATTTC ACAGATTTTA ACATTTAATT 2580 
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ACTCAGGCTT TGTGnAACCT AATCCTAATG ACTATGACTT TTCAAAATTA TATTGGGGAG 2700 

CTAAATACAA TGTATCTATA AGCTCACAAT CTAATGATTC AGTAAACGTC GTTGATTATG 27 60 

CACCAAAAAA TCAAAATGAA GAGTTTCAAG TTCAAAATAC TTTAGGCTAT ACATTTGGTG 2 820 

GTGACATTAG TATCTCTAAT GGTTTATCTG GTGGACTTAA TGGAAATACA GCTTTTTCTG 2 8 80 

AAACAATTAA TTATAAACAA GAAAGTTACA GAACAACATT AAGTCGCAAC ACAAATTATA 2 94 0 

AAAATGTTGG CTGGGGAGTT GAAGCACATA AAATTATGAA TAATGGTTGG GGACCTTATG 30 00 

GAAGAGATAG CTTCCACCCA ACATATGGTA ATGAACTCTT CTTAGCTGGC AGACAAAGCA 3 060 

GTGCATACGC TGGCCAAAAC TTCATAGCGC AACACCAAAT GCCATTATTA TCTAGAAGTA 3120 

ACTTCAATCC AGAATTTTTA AGCGTACTAT CACACAGAC 3159 
(2) INFORMATION FOR SEQ ID NO: 3 04: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 821 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIjOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304: 

GCAAAAACTT TTCTCCAACT ATGTCAGACT ATAATGGCAT GAACAAAATT GATATGATGA 60 

ATCAAATTAA AGTTGATACG ATGTTGCATG GTTATCACGC TGGATTTTTA TTTGCATTAT 120 

TGATTACAGT TGTTAGTTTC TTTTGTTCAT TTATGTTACA AGGTAAGAAA AAAGAAGTTG 180 

ATTCTCGTCA ATAAATATAA GTTGCTAGCT ATATAAAGCT TTTTAGCAAA AGTTCAACAT 240 

ATTGACTTAT CCGGCATTTC AGATTAAAAT ATTTTTATTC CGATTAGAAT AATAAGAATA 300 

AGGAGATATA TTCTATGAAA AGACTTTTAT TTGTGATGAT AGCTTTCGTT TTCATATTGG 360 

CTGCATGCGG AAACAATTCG TCGAAAGACA AGGAAGCTAG TAAAGATAGC AAGACAATTA 420 

ATGTTGGGAC TGAGGGGACT TATGCACCAT TTAGTTTCCA CGATAAAGAT GGTAAATTAA 4 80 

CTGGTTACGA TATTGATGTT ATTAAAGCAG TGGCTAAAGA AGAAGGTTTA AAACTTAAAT 540 

TTAATGAAAC TTCTTGGGAT TCTATGTTTG CAGGTTTAGA CGCAGOGCGT TTTGATGTAA 60 0 

TCGCGAACCA AGTAGGTATT AATCCTGATA GAGAAAAGAA ATATAAATTT TCTAAGCCTT 660 

ACACATTCTC AAGTGCTGTT TTAGTTATTC GTGAAAATGA AAAAGATATT AAAGATTTTG 720 

ATGATGTTAA AGGTAAGAAG TTAGCACAAA CATTCACATC TAATTATGGT AAATTAGCTA 78 0 



AAAAACCTAA TGcTAAGATC AAAGCAATCA 
TTGCATTTTC TAAAAAAGCA GATGATGAAA 
^ AAATCGAGGA AAACGGTGAA TTAGCTAAAA 

CTAAATCTAA ATAGCGAACA ACAACATGCG 
ATGCTAGAAG GTTTGGTCAA ATATTCAATT 

10 

TTGATTATTG CATTATTTAC AGCATTAATG 
ATTTCACGTG TCTATGTATC TATTATTCGA 
^5 ATATTTTATG GTATTCCAGA ATTAGGTAGA 

ACATTGGCAC CTGTAGTGGC TGCTATTATT 
TCGGAAATTA TTCGTGGCGG TATTATTTCT 
TcCaTCGGTA TGACGTATGG TCAAACGATA 
GTGTCGATTC CTGCACTAGG TAATACATTT 
GGATTTATTT TAGTGGCTGA AATGTTTAGA 

25 

GAATATTTAA CAATTTATGT GTTAGTTGCG 
TCAATTATCC AAGGTATCTA TGAATCTTAT 
TGAACAATAT CCATAAATCA TTTAATGATG 

30 

TTGAACAAGG TGAGGTTGTA ACCTTAATCG 
TACGTATGAT TAATGCATTA GAAATTCCAA 

35 CATATACATC TAAAGATAAA AAATCACAAA 

TTCAAAGTTA TAACCTTTTT CCGCATAAGA 
TCACAGTTAA AAAGTTGAAA AAGGATGAGG 

''^ AAGTTGGTTT AACACATGTC AAAGATCAAC 

AACGTGTTGC TATTGTCAAG AGCACTAGCA 
CCAACATCTG CACTTGATCC TGAACTTGTG 

45 

GCTAATGAAG GCATGACAAT GGTCATTGTG 
TCTAATAACA TTGTATTTAT TCmTGAAGGC 
ATGTTCAATC GTCCGAAAAC AGAAGAATTA 
TAATCAAATA GAACCACGTA TCATGTTTTA 
TGGTTCTTTG TTATGTTGTC ATAATCTTGG 
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AAGGTAATGC TGAACAAAGT AGATCTGCAT 960 

CAGTTCAAAA ATTCAATGAT GGCTTGAAAA 102 0 

TAGGTAAGAA ATGGTTTGGT CAAGATGTTT 10 8 0 

CTAGATGCTG CAAAACAAGC TTTCGGACCT 114 0 

CCTATTACAT TAGTTACATT TGTTTTAGGA 120 0 

CGAATTTCAA CGAGTAAAAT TTTAAGAAGT 1260 

GGAACACCAA TGATAGTACA ACTATTTATC 132 0 

TTATTAACAA ATGACGCTGA CAACCAATGG 1380 

GGTTTATCAT TAAATGTAGG TGCGTATGCT 144 0 

ATACCGAAAG GACAAACAGA AGCTGCaTAT 1500 

CAACGTATCA TTTTACCGCA GGCAATTCGA 1560 

TTAAGTTTAA TCAAAGATAC ATCATTATTA 1620 

AAAGCTCAAG AAGTTGCGTC TACAACATAT 1680 

CTAATGTACT GGGTGGTATG CTTTATTATT 174 0 

ATTGAAAGAG GGTATCGCTC ATGATTCAAT 1800 

TTGAAGTCAT CAAAGGTATT GATTTATCTG 1860 

GTCGATCTGG TTCAGGTAAA ACAACATTGT 1920 

CTGAAGGTAC AGTTTATGTT AACGGCAAAA 1930 

TAGAAGTTCG TAAACAGTCT GGTATGGTAT 204 0 

CGGCATTAGA AAATGTAATG GAAGGTCTTA 2100 

CACGTGGGAA ATCACTTGAG TTACTTGAGA 2160 

GTCCACATGC ATTATCAGGT GGTCAACAAC 222 0 

ATGAACCCTA AAGTGATGTT GTTTGATGAA 22 8 0 

AATGATGTTT TAAAGGTTAT TAAAGATTTG 234 0 

ACACATGAAA TGCGTTTTGC TAAAGAAGTA 2400 

ATGATCGGAG AACAAGGGGC TCCAGAAGAG 246 0 

AGACGTTTCT TAAATGTTAT AAATGAAGAA 2 52 0 

GTATGGCGAT GAAGCCATAT ACATGATGCG 25 9 0 

AGCGATATTT TAACGACGTT TATGATTTAA 264 0 
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TTCTACATGT GCGTTAAAAC CTTTTTTGAA TTGTTGGACG CCATAGTCTT CTGATGACTC 276 0 

TGAAAAGTCA CCGGTAATAC CATAAAAATT ATAGCGATCA ATATGATGCG CTTTAGCAAA 2 82 0 

CTTAATCATT TCCCaCTGCA AATGGTAGGC ACCCATATAA GCATTATATT TAGGGTTTGA 2 8 80 

ACCACTAGAT AAGTAATAAA CTTCATGCTC ATTGTAGATA AATAAAGCAG AAGCTAAGTT 2 94 0 

TAAGACTGCA CCATCTTGTT CAATTTGTTC TATTGTATTG TCGATTTTAC GCTTATTGCT 3 00 0 

10 

ATTTAGCTGT TGTTCTAGCT GTGTGCGTTT CGTTTTATTT TTCTTTGAAT TAGGACTTTC 3 06 0 

TTCCAATGCT TCTTCAACAC CTGAGAGTTC AGCTGTTAAT TGTTGTTGCT TTAATTGTAA 312 0 

15 CGTTTTTAAA TACTCGTTTA AATCAATATA CGCCAACTTT AACATGGCGT GGTCATCGTA 3180 

TAACTTTTGC ATTTCTTCAA AGTATGGTAA CTCACGGAAT TTGAAACCGT GCTTTTCCTC 3240 

AGCCATATGG AATAAGTCGA AAAAAGTTTG CGTTTCATCA ATCGTTAAcG TTTTaGTTTT 3300 

GACArCAAtA TCaTATGTtT tTtTAATATT ACGTCTCGtT TGATAATCCA TTTCTTTTAA 3 360 

AAGTTGGTCT TCAGTCTTAT CTTTTAAATC TAACACTGAC AGCCAACGGA TTTGGCTCAT 342 0 

TGAATCATAA CCTACAGGGA AACCTTGGTG TTTATAACCT AATTTATCCA TTGTTCTaAC 34 80 

25 

AAATGCTCGG TTATCATAAG ATTTAACAAT TTCACCGTCT GCATTGCGTA AATTTTCAAT 3 54 0 

TAAATATGGA TCTACAAGGA CATATAAAGA ATTGTGTTTC TTTAAATATG ACGTTAATGC 36 00 

TTTAAAGAAA AATGCTACTA ATGATTGATT TGTATAATCC ATCACTGGCC CGCGATGTGT 3 6 60 

30 

ATAAAAATAT TTGAA-AA-ATT TAAGTGTGCG TGCTTCTGTC AATAAGCATC CTGCAATCAC 3 720 

TTGACCATTG TCATCTTTAA CCCCTACrAG ATGCACATCG CCTTTTAAAT CAACTCTATG 3780 

35 ATTGTAATGA ATAGCTGATT GTGTGTAATG TGAAAAATGC T 3 821 

(2) INFORMATION FOR SEQ ID NO: 3 05: 

r (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1422 base pairs 
'^^ (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305: 

GCGACACCAT TGTTTGCAGC AACTGTTAAT TTACCAGTTG ATGGTTTCGA CGGTGTTGTT 60 

GGTTTTGATG GTGTAGTAGG GGTAGTTGTA GATTGCGTAC CCCATGGCGC CACTTTACCC 120 

ATTTTTATTA AATATTTTTC ATTAATTAAG TCATATAATT GATCATAACT ATAATTATGA 180 
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10 



15 



TATTGTAATT GTGTAGCTGC ATAGTCAGCA TAGTTATTCA TTGAACGTGC AAATGAAGCA 36 0 

TAGTCGTGTG TGTGTACGAT TTCAACATTG ATGAATCTAG GGTTACCGAC TGCACCGACA 42 0 

CCCCAAGATA AGTAATCCGT TGGTGCTGTT TCGATTATAC GATCCCCATC AACAAATGCA 480 

TGTACGAATG CGTTTTGATA GTTATTTTTC ATATAACTAA TTTCACCATT TATCGTCGAA 54 0 

CGATCATTAG CTGTATCATG AACTACGATA CCTTCAGGAC GACCTACGCC GTTACGGTAT 600 

GCGTATTTAG GGAAGTAAGA TGTATAATCT TCTTCAATTT TAGGTGCTTT TAAGTTATTT 66 0 

TTACAAATGT AATCGTTAAT TGAAGAGTTT ACTTGTGGTT TATATTTTGG CAAACTCGTT 72 0 

TTTGGTGTTG CAGCAACTGA TCTTGGTTGT GCTGAAGCGC TAAAAGTAGT TACTTTAGGT 78 0 

GTCGCTTCAG TTTTAGCTTT AGGTGCTGAT GTAGTTGCAG CTTTAGGTGC TGCGGTTTTA 84 0 

TATTGCGTyT CAAGAGCTGC AGGTTTAGCA GCTGATTTAA TTAATTCTGG ATTAATTTGA 900 

20 TTTTCTGAAT TATCATCTTC ATCATCAACT AAACTATAAC CAGCATTTGT AACATTAGTG 960 

TTAGTTTTAG GTGCTGTAGT GCTTGTTGAC TTTGCAACAG GCTGCGTATT ATTTGTAGTC 102 0 

GCTGATTGAT TAGCACGAGT GTCACCATTT ACTTGTGCAG TATCAACTTT TTGACTTACT 10 80 

TGAGCATTGC CTGTTTTGTT ATTTGCTGTT TTTGGTTGGA CAATAGCAGG GTCTTGATAT 1140 

ACTTGAGTGC CAGAAATGTT TTGCGTTGGA TTTTTTACCT CAGCTTTTGC TTGTTCAGTA 1200 

GTTGCTTTAA CTTTATTACT ATCTAAAACG TTTTTATTAG TAGTTTGATC TTGTGTCGTC 126 0 

TCAGCTGCTT GAACTTGATG TGCAGTGACT GCTGAACCTA CAAGCGTTAA TGCAACCATT 132 0 

GAgGTAGTTT GTAATTGAAT TTTTTCGCCA TTCTATTTAT TACTCCyAAC ATTTATTAAT 13 8 0 

TATtACTAAC ATTATAGTAC CTGTnTTATA TACCTGTGCG TA 1422 

(2) INFORMATION FOR SEQ ID NO: 306: 

r(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6076 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



25 



30 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306: 

ACGCGTTGGT TATTTCAATC ATAATATTAC TCTGCAAATA CACAGCCCTG TAACCGAGTA 6 0 

AATGGATAGA GATTTGAACA AATGAAAACA ATCAACTAAT GGAAAGGATA AAATATTATG 120 

CACAACAATA ATGAATTCAA CAAAAAACTG AAAGATTTTA TAGGTAGCGA TAAACGGATG 180 

GCTTTAGTAA AGGGTTATGT TAACGAGTAT AAATTAACAA CTGTTTTAAG AGCTTTAAAT 240 
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GTAGATAATC GAATTATTCC TAAAAATATA ACTCAAAACA CAATCTTCAA ATTGAGTAAT 360 

TTAACCTTAG AGGTTAACCT CTATGAAAGA AACACAATTT ATCACGGAGA CTTTGCTATT 420 

5 

TACTATCCTG TAGAATCTGC TCTAATAATA GAAACAGATA CTCCTAGGCT AATTAATCAT 4 80 

ATAAATAACA ACCCAGTCCA AAAAATATTT ATAATAACAA CAAATGACTG GAGCTTTAAT 54 0 

ACACTTGaAT TAGAAAAAAT AGTAGACGAA ACTATAATTT ATGATTTAAA ACAAGAAGAT 6 00 

10 

CCTAAACAAT ATGAAATTTT ATATAAAAAT AAACACX3GAA AATTGCCTTA TTGATGAATT 660 

AAAACGCAAA CTTAGAACAC CTTACCCCTT TTCCGTCCTT ATTGTAATTT TCCAACTACT 720 

;5 CAAATTAAAA ACCACCCGTG CAAACAGGTG GTTTTATTAT ATATAATTAT TCCCACTCAA 780 

TTGTGCTTGG TGGTTTTGAT GTAATGTCAT AGACTACGCG GTTGACGTGA TCTACTTCGT 84 0 

TTACGATACG ACTAGAAATC TTTTGTAAGA CTTCCCAATC GATGCGTGCG AAGTCACTTG 900 

20 TCATACCGTC GATAGATGTT ACTGCACGAA TACCTACTQT GTGATCATAC GTACGGTAGT 960 

CTCCCATAAC ACCTACTGAT TGAATGTTTG GTAACACTGT GAAGTATTGC CAAATTTCTC 1020 

TTTCAAGACC TTCTTCTCTA ATCACTTGGC GTAAAATCGC GTCTGATTCT CTAACGATTT 1080 

25 

CTAGTTTATC TTCAGTAATT TCTCCAAGTA CACGAATACC AAGACCAGGT CCTGGGAATG 1140 

GTTGTCTCCA TACTAAATGT TCTGGAATAC CTAACTCAAT ACCTAATTTA CGTACTTCAT 12 00 

CTTTAAACAA TGTATTGATT GGTTCGATTA ATTCGAATTC CATGTCTTCT GGTAATCCAC 1260 

30 

CAACATTGTG GTGTGATTTG ATTGTTTGTG CTGTTTTAGT ACCTGATTCG ATGACGTCTG 1320 

TATATAGTGT TCCTTGCGCA AGGAAGTCTA CACCTTTCAG TTTTGATGCT TCATCATCAA 1380 

ATACGTATAC AAATTCATTA CCAATGATTT TACGTTTTTG TTCAGGATCT GAAACACCTT 1440 

TTAATTTATT CATAAAGCGA TCTTTCGCAT TAACACGAAT AATATTCATG TTGAAACCTT 1500 

CACCGAATTG CTCCATAACC ATGTCGCCTT CACCTTTACG AAGtAAGCCA TGGTCTACAA 1560 

40 AGATACATGT TAGTTGATCA CCTATTGCTT TATGCAATAG TACAGCTACA ACAGATGAAT 1620 

CTACGCCGCC ACTCATCGCA CATAATACAC GACGGTCTCC TACGCGTTGA CGAATCTTTT 1680 

CAATTTCGAT TTCGATAAAG TTTTCCATTG TCCATTGACC TCTACAATCA CAAACACGAC 174 0 

'^^ GG ACAAAATT ATTTAATAAA TCATTACCAT ATTCTGTATG ACGTACTTCT GGATGGAATT 1800 

GAACACCATA AATGCGACGT TTCTTATCTT CGATTGCTGC ATAGTCTGTG CTTGGGCTAT 1860 

CAGCGATAAC TTCAAAGCCT TCTGGAATTT CAATAACTTT ATCAGAATGA CTCATCCAAA 1920 

50 

CAGTTTGTTC TGCTGGTAAG CCAGCGAATA ACTCATCTGA CTTCGCATTA ATGATTGCTT 1980 

. ^.p^j^^^^ 2040 
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25 



TAAATGAACC TTCTTCATAA ACTGAATTTG GACCACCTGA TAAGATAATA CCTTTTGGAT 2160 

TCATTTTCTT AATTTCTTCA ATTGAAATTT CATGATCGTG TAATTCACTA TAAACGCCCA 2220 

TTTCACGAAT TCGGCGTGTA ATTAATTGGT TGTATTGGCT ACCAAAGTCT AAGACAAGGA 22 80 

TTAACTCTTG TTCTTTTGCC ATTTCCATAT TTGTCGTTCT CCTTTATCTT AATTAGAATG 234 0 

AGTAGTTCGG TGATTCTTTC GTAATTTGAA TATTATGTGG ATGGCTTTCT GCTAAACCAG 24 00 

CAGGACCCAT ACGTGTAAAT TGTGCTTCTT CGCGTAATTC TCTTAAATCG TGTGAACCAG 24 6 0 

TATAACCCAT ACCAGCACGC ACACCGCCCA TTAATTGGTA AATTGTATCT TGTAACGCAC 252 0 

CTTTATAAGC CGTACGTCCT TCGATACCTT CAGGAACAAA nTCTTAGGC GCTTTGTCCT 2 580 

CTTGGAAGTA ACGGTCGTTT GAACCTTTTT CCATCGCACC TAAAGAGCCC ATACCACGGT 2640 

ATACTTTATA TTGTCTACCT TGGAAAATTT CTGTTGCGCC TGGGCTTTCT TCAGTACCTG 2700 

20 CTAATAAGCT ACCTAACATA ACCGCATGTC CACCAGCAGC TAATGCTTTA ATGATATCTC 2760 

CTGAGAATTT AATACCACCA TCAGCAATGA TAGCTTTACC ATGTTTGCGT GCTTCAGTtG 2 82 0 

CACAATCATA AATTGCTGTA ATTTGTGGTA CACCAACACC TGCTACAACA CGCGTCGTAC 28 8 0 

AAATTGAACC TGGGCCAATA CCAACTTTAA CAATATCTGC ACCCGCTTCA AATAAATCTT 294 0 

TTGTTGCTTC TGCAGTTGCT ACGTTACCTG CTACTAATGT GATTTCTGGG TAAGTCTTCT 3 0 00 

TAATATGTTT CACTTGATCG ATAACACCTT TAGAGTGACC ATGTGCTGTA TCGATAACTA 306 0 

AGACATCCAC ACCTGCTTCG ACTAATTTTT GAGCACGAAT ATCAGTATCT TTTGAAATAC 312 0 

CAATTGCTGC GGCTACAAGT AGACGACCAT GTTCATCTTT TGCTGCATTA GGGAATTCGA 3180 

TAACTTTTTC AATATCTTTA ATAGTAATAA GACCTTCTAG ACGTCCGTCT TTAACTAATG 3 240 

GTAACTTTTC AATCTTATGT TTTTGGAGAA TTTTTTCTGC TTCTTCAAGT GTTGTATTCA 330 0 

CTGtGAGCTGT AATTAAATTt TCTTGCGTCA TTACATCTAC AATTTTAATC GAGAAGTCTT 336 0 

40 CAATAAAACG TAAGTCACGG TTTGTTAAAA TACCTACTAA GTTGCGATCT TCTTTATTAT 3420 

CAACAATTGG TACACCTGAA ATACGGTATT TACCCATTAA TGCTTCTGCT TCATAAACGC 348 0 

TTTCTTCTGG CGTTAAGAAA AATGGGTTTG AAATGACACC ATTTTCTGAG CGTriTACTT 3 54 0 

TTTGAACTTC GTCCGCTTGT TCTTCAACGC CCATATTTTT ATGAATAACA CCTAAACCAC 360 0 

CTTGACGAGC CATAGCAATC GCCATTTTAG ATTCAGTTAC AGTATCCATA CCAGCAGAAA 366 0 

TAACTGGAAT ATTTAATTTA ACTTTGTCTG ATAATTGTAC GCTTAAATCA ACGTCTTTCG 372 0 

GTAAAATATC AGATTGTGCT GGAATTAATA ACACATCATC AAACGTTAAT GATTCTTTTG 3780 

CAAATTTACT TTCCCACATT AAAAACAGCC TCCATTTTTC AAATTAATTA GTTATATTAT 3 84 0 
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GCAGAGATTG CGCCTAAAAC AATTCCGTTT TGAGTCAACC ATGCAAATTG TTCACCTAAA 3 960 

CCTTTAAATG CTTGTGGTAC AGCGCTTATA CCAGTACCTA AtCCTACTGA TACAGCGATA 4 020 

5 

ATTAATAAAT TGTTTTGATT TTTAAAATCG ATATGTCCTA ATATACTAAC ACCATATGCC 4 0 80 

ATTACCATGC CAAACATAGC TATCATCGCA CCGCCTAACA CAGGTAGCGG TATGATATTT 414 0 

GCTAATGCGC CAAGCTTAGG TATACAACCA CATATAAGTA ATAACACGAC CATGCCGTAT 4 20 0 

10 

ATAACATTGT TTTTCTTAGC GCCGGATAAA GAAACAAGTC CTACATTTTG CGAATAGGCT 4 260 

GTATACGGAA ATGAATTGAA TATAGAACCT AAcACTATCG CTAGACCTTC CGCAGTATAA 4 320 

^5 CCTTTACGAA AATCTTTTCT TTCTAACTTC TTACCGGTAA TTTCACTTAA CGCATGATAG 43 80 

ACACCTGTCG ACTCAATTAA ACTAACGATA GCTACAATAA AGAACACTAA CGTCGATGTC 4440 

ACATCAAAGC TAAATCCAGA GAATCTAAAC GGCACTGGGA TGCCTAACCA ACCGGCATGA 4 500 

TTGACTTGAT TAATATCGAC CATCCCAAGT AAGCCAGCAC CTATCGTTCC TAAAACGAGT 4 560 

CCAATTAATA TGGCAATACT CTTAATAAAT CCAGTTGTGA ATCTTTGTAA AAGAAGAATA 46 20 

ATGATTAATG TCATTAAACC TAACAAAATG TTCTTAACAT CTCCATAGTC CTTTGCACCT 4 680 

25 

TGACCTCCAG CTAAGTAATT CATTGCTACT GGCATTAAAT TGATACCAAT GATAGTAACA 474 0 

ACACTACCCG TTACTACTGG TGGGAAGAAT TTTACAAGAT GTGAAAAGAA AGGCGCGATG 4 800 

ATAATAACTA ATATCCCTGA TAAAAATAGC GAACCATAAA GTACATCTAT TCCTTTCGTT 4860 

30 

TGACCAATTA AAATCATGGG CGCAACAGCC GTGAATGTAC ATCCAAGAAC GATTGGTAAT 4 920 

CCTGTTCCTG TTACTTTATT GGCTTGTAAA AATGTGGCAA CCCCACACAT AAATATATCT 4 980 

ACTGTAACTA AGTAAGCGAT TTGTTCAGGT GTAAACTTCA AACTTGTACC AACAATGATT 504 0 

GGAACTAAGA TAGCACCTGC GTACATAGCT AAAAGATGTT GAACACTTAG GATTAAATTT 5100 

TTCATTATTC TTCTCCCACC AATGTCACTT TGTTTCCTTC TAGTGAAGCA ACCTTGCAGA 516 0 

40 GAGAAGAAAC TGTTAAACCT GCTTCTTCTA AACGTTGATG CCCATTTTGG AAACTCTTTT 5220 

CAACAACAAT ACCAATACCA GCTGTCTTAG CATTCGCTTG CTGTGCGATA TCGTATAATC 52 80 

CTAATGAAGC ATCACCATTT GCTAAAAAGT CATCGATGAT AAGTACAGTA TCTTCTTCTG 534 0 

ATAAAAACTC TTTTGAAACA ATGACCGTAC TTGTTTTATT TTTAGTAAAT GAATGAATAG 54 0 0 

ATGTTTCATA ATAACCATCC GTCAAAGTGC TAGGTTTTGC TTTTTTCGCA AATAAACATG 54 60 

GCACATCAAA ATGCAGTGCA GCCATGATTG CAGGTGCGAT ACCGGAAGCT TCAATGGTTA 5520 

50 

AGATTTTAGT AATCCCTTTA TCTTTAAATT GCTCGTAAAA AGTGCGACCA ACTTCATTCA 5 580 

" - - . -^^^^^-T^^^;^^- '"^A.AAA.'^^ TT'^T'^ATrAA ^64 0 
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ATTTGTGTGA AACATTTTGC TCTTAAATTG GTGCTAGATA CAAAAAAATC CCCAAACTAA 576 0 

ATAATAGTTT CAGGCTTTAT GAGTGAACGa ACATGCATAA CGAATTTGTC ATGCAATCAA 5820 

5 

TGTAAAAGAA GTTTCATCAA ATAAACTGTG ACCATCATAT AAAATGATAT AAATCACCCA 58 80 

CCATGGTTAC AATTTAATGG CTGAAGCTAC TCCTAGTATT GTGTTGTTAC TCATAGTCAT 5 94 0 

GTCGTTCAAG GCAACATGGT AGAAACTTCT AAAGCCATAT TCTTTAGATT ATATGAGTTT 6 000 

10 

ATGTAAATTA TTTAACGATA ATAGCAAATT TTCGGCATTT TTTCAATAAC TGCTTAGGTA 6 060 

ATCTTTTAAT AGTTTT 6076 
^5 (2) INFORMATION FOR SEQ ID NO: 307: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6136 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: 1 
ACCGCCGATT GATAGCTTTA CTGCTGCGAT 
TTTTAACACA CGATGATTAC CATCATACAT 
AGATCCAATA TCGTAAAAAC ACAATGCATC 

30 

GATATCAAAT GAAGTTCCAA TTGAACCATC 
GTCACCTGCC ATTTGCTTTA ACAAAGATTT 
AACAAGTATA ATTTTAGGCA TTATCTTCAT 
TACTTTGAGC ACCTGGATCA ATATAACCTT 
CTTTCGTTGC TACCATATCT TTTGTATTAT 
40 AAGTTTCACC ATTTTTAAGC TTCTCTGCTG 

CATTTAAAGT AACTTTACCA CGTGATTCAA 
TGAAATCTTG ATTATCCATA TCATCTTTGG 
CATACAGTGG TCCTGATGCA CCCCCAACAT 
ACAATGATTG CATTGAGCTA TCATCAAGTT 
TGTTAACCCC GTGGTCACCA TCACCAATTG 

50 

GTTTTTTAAA CGTTTCTTCT AAATTTAATA 
GTGCCACCTC ATAAATGTAT ATTTATTCAT 



;EQ ID NO: 307: 

AAAACTGCCT TCAACAATTG GTGCATCAAC 60 

TTCAATTGCC ATATCTACAT TCATTTCTGA 120 

ATCCTCTAAT TTAGTCAAAA CTTCTTGGAT 180 

TGGTAATCCC CCGATTGGTA TAATATCAAC 24 0 

TGTACCACTT GcAATTTCTT TACTGTGGCT 3 00 

CTCCAATCAA AGCGTTTAAA ATATAAACCA 3 60 

TTGATTCTTC TCCAAAATAT GCAGCTCTAC 420 

CTGCTAATTG CTGTAAATCA TTGAATGTTA 4 80 

CTCGCGCTAC TACATCATAC ATTGTCTTTT 540 

CCGCTTCGGC AAATGCCTGA ATTAGTGTAA 600 

TGACTGCAGA CATTTTAACA AAGCTAAAGC 66 0 

TTGACATCAA TGCCATACCA GTTGATTTGA 720 

TGTCTTTAAG ACTACTAAAC CCACGAACCA 780 

CTCGATCTAA TTCAGTTAAT TCAGATTCAT 840 

AACGTGCTTT CATATCATTC ACTTTCATTT 900 

ATTCACTTCT TATTTAAAGT ATTGACTTGT 960 
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ACCTTGCATG TCTAAAGATG TCATATAATC ACCAACAAAC CATTTAGCAA CATTAACCGT 1080 

TCTTGCAGCT AAATTTTGTT GAATATATTT AGTTACGATA TTTAATTCAG ATAACGGCGT 114 0 

5 

ACCACCCATA CCATTTACCA TTAATATGAC ATCATTGGCA GTAACTTCTT TATACAATTC 1200 

GTCTAACAAT GTTCCAACAA TATGATCAAT ATCCTTTACT TCTTCCCTAT GAATACCTTT 1260 

TTCACCATGT ATACCAATAC CGATTTCCAT TTTGTCGTCT TCAATATCAA AGCCATATTT 132 0 

10 

TCCAGTAGTT GGAACAAGCG GTGGCTCAAT TGCCATACCA ATACTTTTAA TTTCAGGTAA 13 80 

CAACGCTTCT ACACGCGATT TTATCTCTGT TAATGAATAA CCTTTTTCAG CAAGATAACC 1440 

^5 GGCAAGCTTA TGAACAAACA CTGTTCCTGC AACACCACGA CGTTGTACTT CGTTTGTCAC 1500 

AGCAATGTCG TCACGAACAA TAACAGTTTG AACATTTATA CCTTCCATTT CTGCAAGCTC 1560 

TTGTGCCATT TCGAAATTCA TCACGTCACC TGCATAGTTT TTTACAACTA GTAATACACC 1620 

20 ATCACCAGTA TCTACTGCTT TAATAGCTTC TAATATTTTA TCAGGTGTAG GTGATGTAAA 1680 

TACTTCGCCA CAAACCGCTG CATCTAGCAT ACCTTCTGCA ACAAAACCGG CATGCGCAGG 1740 

TTCATGTCCG CTTCCACCTC CAGAGACTAT TGCTACACCA TGTTCTTTCT TAGCTTTTTT 1800 

TACAATAACT GTATTAGCAA TCAGATCTAA CTCTGOGTGC GCAATTAACA ATCCTTCAAG 1860 

CATATCAGTT AAAAATGTTT CTTTTTTATT GATTAACTTT TTCATCATGT TGTACCTCCT 1920 

TGGTATTATC AGTCATAGTA TAAAACAGAA TAAATGAATG CGCTATCATA AAGAATTAAC 1980 

30 

CTGATACCGT TATCAAATAT ATTTTTGATA AGATCCTCTT GATAACTTAA TGGTTCATTA 2040 

TTGAAAAAAT AAATAGTCTT GAGTGGCTCA TTAAATAAAA AATGCTATGA AAAGCCTTAT 2100 

AATAAAGTGC CTTTCATAGC aATAAGTTGT GTCCATTGAC ACTATACATT TTTCGTTTTG 2160 

TACATTAAAT ATAAGAAATA CGGTGCACCA ATAATTGCTA CGACAATACC TGCTGGAACC 2220 

CCACCTGGTT GTAATACAAT TTTGCCAATT GTATCAGCTA TAACAAGTAA ACATGCCCCT 2280 

40 ACTAAAATGG CAATTGGTAA AAACAACTGG TGACGTGGTC CAACGATACG TTTGGCAATA 2340 

TGCGGACCCA TTAATCCGAT AAACGAAATT GAACCTGCTA cTGCTACAGC AGCAGATGAT 24 00 

AACATCACTG CGATAAAGAA TAATATTAAA CGTTCTCTGC TTAACCTTAC ACCTAGACCT 24 60 

CGTGCAATAT TATCACCCGT ATGAATAATA TTTAGTGTAT TCGATTTAAA TAGTAAGTAA 2 520 

GGAATAATAA TCAACACCCA CGGTAAAAAT GCAATGACAA ATGGCCATTC GTCACCCCAA 2 580 

ATATTACCTG CAAACCAAGC AGCGATGAAA TCAGATTGCT TATCATCAAA TTTTGACATA 264 0 

50 

ATTGTAATTG AGCCACCATA TAATGCTGTT TGTAAACCTA CACCTATTAA TACCATACTC 270 0 

— — , ^O^aaA'^AA AAATAATPaA TGCAGTGqTG 2760 
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CCAATTGCAA TAAATAATGC GATTGCAAAT CCGCCACtGC GTTAATACCT AATATACCTG 2 8 80 

GTTCAGCTAT TGGATTTTTT GTGACACTTT GCACTATTGC ACCACTAATA CTAAgCtGCG 2 94 0 

5 

CCAGCCAAAA TAGTAATCAT CATCCGAGGT AACCTGAAAT CTAATAAGAT TAACTCATCA 3 00 0 

ATGGCATCAC CTTGTCCAAT TAAAGTTTTG AAAAATCTTT CAACAGGTAT GTTGTATTCA 3 06 0 

CCTGAGGTAA TACTCCAAGT ACAACCTAGA AATAGTAGAA TGCTAAAAAC AGCCAGTGCT 3120 

10 

ATCAATTGTC TGCGTTTATT ATTTGAACTA ATCATATTGA GCGTCCTCCT TTTTTAACTA 3180 

AATATAAAAA GTAAGGAACA CCGATAAATG AAATGATTGC ACCAACAGGC GCTTCTCCTA 324 0 

^5 AATATCGTGC TATCACATCG GCAACAAGCA CGAGTATCCC ACCTAACAAG GCTGTTAATG 330 0 

GTAGAATTTT AGCATAATCA GTTCCAATTA AAAATCTTGC TATATGAGGT ACCATCAAAC 3 3 60 

CTACAAATGC AACTTGTCCA GCGATAGCAA CTGCAATACC TGCTAGAATC ATAGCAATAA 34 2 0 

20 TThlAQKTKT GCCTCTGATC ATTGTTACAT TTTGACCTAA ACCTTTAGCT AATGATTCAC 34 80 

CAAGATTTAA AATGGTAAGT TGTTTACTAA TTGTTAATAT AATGAATAAC GCAATACCAA 3 54 0 

TTAATGGAAT TGCCCACTTA AGGTGTGACC ATGTTGTGCC TGAAACGCCT CCAGCAGTCC 3 6 00 

AAAATGTTAC TGTTTGATTT AGTCTAAAAG CTAATGCAAT ACCTTGACTT AGCGCTGTTA 3 660 

ACATAGCACT TACTGCTGCA CCCGCTAAAA TAATACGCAT CGGATTAAAT CCATCACGTC 3 72 0 

TAGATCGGCC TATCATTAAT ACAATAGCAC CTCCTAGAAT AGCACCTAAA AATCCAGCAA 3 780 

30 

hCPrCQhPCVKV TPJ^J^iPJ^TQPJK, GTGTTTGGTA AAACTGCATA TGTTAATGCT PJKAGCAPATO 3 34 0 

AAGCACCTGA ATTTAAACCT ATGAGCGCCG GATCAGCAAG ACCATTACGA GTAACACCTT 3 90 0 

GTATAATCGC ACCAGAAACT GCAAGCGCCA TACCTACAAT TACTGCTGCT ATATTTCTGG 3 960 

35 

GAATCCTAAT CTCATTGATG ATGTTTTGCT GTTGATTGCT AGGATTATAA TTAAAAATAG 4 02 0 

CCTCTATAAT TGTAGAGGCT TGAATTTTGG CGTCACCTAT TAATGTAGAA ATAAATAGTG 4030 

40 TGATTAGTAG TATCATACTT AAACCTATAA TATAGGATAA AAACTTCAAT GGCGTTGGGT 414 0 

TCTCTCTATT TGTCATGTTA ATTGTCCTTT TTATCATATT AACTTACTTA ATTAAGAATA 4200 

AGCTCTGCGA CATAAGTCAT AAGTTACCAG TAAAGGTTTT CCAGTTTTAG GATCTTTACT 4 26 0 

'^^ TAAAACAACA TCAATATTAA AAACTTTTTC TAATATTTCC TGTGTTAATA CGTCTTCTGT 4320 

TGAACCTGTA GCGATGATAT CCCCTTCTTT CATCGCAATA AGATGATCTG AGAAACGAAT 43 80 

CGCTTGGTTG ATATCATGAA GAACCATGAC AATTGTACAA CCTTGTTCCT GATTTAGCTT 4 44 0 

50 

CTGAACTAAT TCTAGTATTT CTAATTGATG ACAGATATCT AAATATGTTG TTGGTTCGTC 4 50 0 

TAAAAAGATA ATATCAGTTC TTTGTGCTAA TGCCATTGCA ATCCAAACAC GTTGTCTTTG 4560 
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10 



15 



25 



TGCCCAATCA ATtTCTTTCT TAtCCTCAGC AgTTAATCTA CCAAATCCTT TTTGATGTGG 4 680 

AAAACGACCA TATGAAACTA ATTCCCCAAC AGTTAAGCCA TCTGCTACTT CaGGTGaTTG 4 74 0 

aGGTAAAaTG GC tATTTTTt TGcAATCyCy TTCGTAGATt GTGtATGAAT ATTTTCACcA 4 800 

TCTAAAAATA CTTCGCCTTC TTTAACTGCC AATAAACGTG ACAATGCCTT TAGCAAAGTA 4 86 0 

GATTTCCCGC AGCCGTTAGG ACCAATGATT GACGTCACTT TGCCATCTGG TATTTCAACA 4 32 0 

TCTAATTTAT TTATAATCGT GTTATCCCCG TAACCAATTT TAACTTGTTG TCCATGCAAA 4 98 0 

CGATTCATAA TTTCCCTACT TTCAATAAAA TTCTTTCTGT TTATAAAAAA TAATTTCTAT 504 0 

TTTTAAATTA TCAATTTTCA AAGACATCCC AATTGATAAT GATTATCATG AACATCATTA 5100 

TAACATTTTT CAATCTTATT GACTAACATT ACTTTTTAAA TTGGATAGCT CGATTTGTCA 5160 

TGTCTTGTAT ATTACTTTTA TAAAATAAAA AACGCCCACA GATAAGTCTT CATAGTTCAA 5220 

20 AAACTTGTCC GTGGACTTCT ATTTAAGTAT GTGTGCTCAT ACCATTTATT TATTCATCTG 5280 

CAAGAAAGCC ATTACCATAG ACATCTCTTA CATCATGAAT TACGAGGAAT GCATCTTTAT 534 0 

CGATTTGTTT AATTAATCGC TTTGCTTTTG AAACTTGTGT TTTAGAAATA ACAACGTATA 54 00 

AGACATCTTT TTCTTCACGC GTATAATAGC CATGTCCGTT TAAAATGGTT AAACCTCTTC 54 6 0 

CAATTTGCTC GTCTATTGCT TTGGCAAGTT TGTCGGGATT AGTTGAAATA ATCGTCATAG 5520 

CTTTTTTAGT GTTTAAACCT TCTATGACAT ATTCCATCAC TTTTGTTCCT ATATAAAGTG 5580 

ATATTACTGT TACTAATACT TTATCAAGTG GAATAACTGT AAGTGAAATT GCAACAACGA 564 0 

TCATATCGAA GAAAAGCAAA GCATATGGCG TGCTTACATC GAGGTATTTC GTTGCAATTC 5700 

TCGCCAAAAT TGTTGTACCT GCTGTTGTAC CGCCTGCAAG GATAATTACT CCGATTCCTA 5760 

GTCCAACGCT TACACCACCA AAAATGGCAT TCACAATGCT GTTTCCAGTT TCTACTTGCC 532 0 

ATGATTCTGT TAAACTCAAA AATATTGAAA TAAGAATTGT TACAAGAATA GTTAAGTACA 5880 

TACTTCTCTT ACTCAAAAAt TTATAAcCTA TGGCAATCAA TACTGCGTTG ACCAAGAAGT 594 0 

TAGTGATGGC TGGTGAAATA TGAAACGCAT AATATAAAAT AATTGCTAAA CCTGTAACCC 6 00 0 

CGCCTTCACC TAAGTTACCA GAAATAATAA ATGCATTTAC ACCTGCAGCA AAGATAAATG 6060 

'^^ AACCTAAGAC AACTAGTATT AAATCTTTAA CCGTTTTATT CACGAAACCA TCCCCTTTAT 6120 

ATATTTATTA GACTAT 6136 
(2) INFORMATION FOR SEQ ID NO: 308: 

so 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2576 base pairs 



30 



35 



40 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 308: 

5 

GATATCGTAs CTACTGAAAA GTCATCACCA CCATGGAATG ATTTCTTTAA ACGTTTTAGC 6 0 

TTCTATGCAA TTGCAATTCA ATACTTTGTT GTACAATTTA TCATTACATT ATTCTTAATT 12 0 

TGGTTACCGA CGTATTTAAC AGAAGTATTC CACGTTAACT TTAAAGAAAT GAGCATTAGT 180 

10 

TCATTACCTT GGTTATTAAT GTTCTTCTTA ATCTTATCAG CAGGTGCAAT TTCTGACCGT 24 0 

GTATTAGGAT TAGGTCGTTC AAAATTCGTA GCTAGAGGTG TAATTGCAAT TGCAGGATTT 3 00 

75 ATTGTGTTTG CAGTTTCAAT TATCTTTGCT GTACGCACAG GAAATTTATA TGTAAGTATT 3 60 

TTCTGGTTAT CACTAGGTCT TGGTGGTATC GGTATTTCAA TGGGTATGAG TTGGGCTGCA 420 

GCAACTGACT TAGGACGTAA CTTCTCTGGT ACAGTATCAG GGTGGATGAA CTTATGGGGT 4 80 

AATATAGGTG CATTAATCAG TCCGCTATTA GCAGGTCTAT TCGTAGAACA TTTGGGTTGG 540 

ACAATGACAT TCCAATTGTT AATCGTTCCA GCAGTAATCG CTGTGATTAT GTGGTTCTAT 6 00 

GTGAAACCAG ATCAACCTTT AATTGTTAGT GATGATAAAG CAATAGAAAA ATAATTTAAA 660 

25 

CAAGCAGTAA GCTTTCACAT AGTTGCXKSCT TATTGCTTTT TTTGCGTTGA AATTGAAACT 720 

TTTTAAAACA GATATGGTTT AAGATGAAAA TGAAGTTATT GAAATGATAT ATGTAAAGAA 780 

ATAAGGTTTT AAAACATTAG TCAGGTAACG CTTGTaAAAG TACATATAAA TTTTAACTAG 84 0 

30 

CGCAAAGGTG GGCGACCAAA GtTcAACGAT GTTAAATAAC aTTAGrAATT AATTTTAATT 900 

GGACTTTAAA AGTTTTTAAA TTTAGATAAT TGAGCATAAG GTGTTATAAT GACATATGTT 96 0 

^5 GCGTAATTAA AATTTATAGC AACAAATTCA TTTTAACTAT GCTAATAAAA AGATTATGGA 1020 

AATATTTTGA CAAGGAAAGG AGAAGTCGAA ATGACATCTT TTTGACATCA CTCATAAAAA 1080 

TCAAtCGACT TAACTTAGAC TTTTATAAAG GTGTAAGACA GGGACTGTTA ATGATTATTC 1140 

40 CTGCAATAAT CGGTTACTTA TGTGGTAATT TCCAATTTGG ATTATTAGTT GCAACCGGAA 1200 

CACTAGCCCA TATTTATGTT TTTAAAGGTC CGTCGCGATC TAAGCTGCGA ACTGTAATAA 1260 

TTTGTAATTT AGCGTTTGCA ATATGTATGA TGCTTGGTAC GCTAACAGCC AAAACGCCAC 1320 

45 TCG T TTTTGG AATGACATTA TTAATTGTTA CGGTTATACC ATTTTATATA TTTACTGCCT 13 8 0 

TAAAAATAGC TGGACCGTCA TCGACATTCT TCATTGTGAC ATTCAGTCTA CCCATTAACT 144 0 

TACCTATAGC TCCCGAAGAA GCATTATATA GAGGCTTTGC GATTTTAGTA GGCGGTATAC 1500 

50 

TTGCCACTAT GATGGTGTTA ATCACGATCG TATTTTCTAA AAACAAAGCT GAAGAACAAG 156 0 

CAATTCAAAA TGATTTTAAA CTCATATCTA AGTTGTTACA CACTTATAAT GATAAATCTG 162 0 
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TCACTTCTAC TTCAAGTAAC GATAAATTAA GTAGACGTTT CCAAAAATTA TTATTATTAC 174 0 

ACACATCTGC CCAAGGGATT TATTCTGAAC TGTTAGAGTT GAACGCTAAA CAAATTCGAC 1800 

^ CATTGCCAGA TGAGTTAATT GAAATGATGG ATCATATCAT TGCACAACTA GATAATAGTG 1860 

AGGAAAATGT AAGATATTGG CGAAAAGAAG TGACAGTAAC AGAGGAATTT CAAAATTTAT 1920 

TCAACCATAT ATTGAAAATT GATGAAATGG TGCATGCAAA TGAAGCGCGT ATTGCGTATG 1990 

10 

AAGCAGACAT GCGAAAACCT TTATATAGTA AACGCATTTA TCaAAATTTA ACaTTAGACT 204 0 

CtATkGTTTT TAGAAATACA TTGAGATATA CAGCGATTAT GATGATAGCG ATATTTATTG 2100 

CGTTAATGTT TGATTTTGAA AAAGCATACT GGATACCGTT ATCTGCACAT ACAATATTAC 2160 

15 

TAGGAACATC AACTATACAT GCAATCGAGA GAGGTATGGC ACGAGGTTTA GGTACTATTT 2220 

TAGGTGTGTT AGTACTTTCA GTCATATTGT TGTTTTCAAT ACCAACACCT GTTGCAGTAA 2280 

20 TTTTAATGGG CATTGCAGCA TTGTTTACTG AAGCATTGGT GGGAGCAAAT TATGCGATTG 234 0 

CAGTAGTTTT TATTACAATA CAAGTTATTT TAATGAACGG ATTAGCATCA CAGAATTTAA 24 00 

CAATTAACAT TGCGTTTCCA AGAGTTATTG ACGTTGCAAT GGGTATTGTG ATTGCAATCA 24 60 

TAGGTTTATT TGTCCTTGGA CAACGTACCO CATCCGCATT GCTTCCTAAT GTAATGGCTG 2520 

AAGTTGTTCG TAAAGAAGCA ACGCTCTTTC ATTATTTATT TTCTGAAAAT CAATAT 2576 
(2) INFORMATION FOR SEQ ID NO: 309: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(Aj LENGTH: 66 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 



" (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: 
CAAGCACATT ACGATGGTCT AATACTTTGC TAATAATTTC TTCTTGTCCT GGTCGAAACG 60 
TTTCATACCC AAAGTAATGC GATAATGTTT GTTGCATCAT AAATTGACCC CTTATTGTTG 120 
TTCTTTAATT TCTTCTAACT CACTCCATCT TGTGATGTCT AAATCATATT GAATTTCAAG 180 
4S TTGTTCTTTT TCTTCGTTTA ATTCTTTAAT TTTCCCATAA TCTGCACTTG CCTCAATCAT 24 0 
GAGCACATCA ATTTCTTCCA TTCTTACTTC CGCTTGTTCT ATGCGTTTCA TCAATTGTTC 3 00 
ATATTCTAAT TTTTCTTTAT ATGATAAACC ATTTTTCTTA CGTACAGTTG TAGAAGATTT 3 60 
AGATTGTTGC TTCAATGTGG ATTTATTTTT ATCTAATGAT TTTTTATAAC TTTCATAATC 420 
■^-'?iA•^'-^ --^^r^r-r^ycm ACCATCATGA ATAAACCAAT ATGACTGTGC 4 80 
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AATATAATCT TCAAGTATTG TTAAAGTCTC AGTATCTAAA TCATTTGTCG GTTCATCTAA 600 
CAACAGAACA TTTGGCTGGT GTACGAGTAG ACGTAATAAA TACAAACGCT TTTGCTCTCC 660 

5 

ACCAGATA 663 
(2) INFORMATION FOR SEQ ID NO: 310: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12173 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310: 
CATCAGTTTA TTTTGAAAGG CAATGCGATC ATTTCATGTA TTTATGTTGT TTGAAACATC 60 
GTTAGATAAC AATAGTGATA TTGCACATTT TAAGCTGAAG ATGGAAAAAT ATTCGATAAT 120 
CAAATAAACA ATGAATTTTA GAAGGTACAA TGACGTTCAC TAATTTAAAT ATAGCTGAAT 180 
GTGTTGGTGA GTGATGTTCA CTATAGATTT ATATTAATAT ACAAAGACAA AGGTTGTTAA 240 

25 

TTTTTATTAA GCGTTAGCTT GAATGTATGA GAATTTTAGA TTTATAATAG AAGATAGAAA 3 00 

CGAAAATTTT TCTTAAAAGC AGTAATGTTG ACTCAAAATA AGCTATAATA ATGACACTTA 3 60 

TTTAATTGAT TAACATTTGC TAATAAATAT CAATATAGAA TATAACTTTC CAATAATGAC 420 

30 

TGAGAAAATC GAAATGTCAG TCTCGAATCA TATAATTAGA AAATTGATTA TTTTCTGTCA 4 80 

ATTTAGGGTT GAACTATACA TATGATATTG TTAGAATATT TTTTAACATT ATATTTTATT 540 
35 GCTTTAAAGT GGAATATACT TGAAATAATT AGTAGAGGTG AGTAAGGATG AGTAATAAAT 6 00 

TAGAATCATA CAGAAGTGAG ATTGTATCAC TGAATCATCA AATTTTAGAC TTATTATCTA 660 
AACQTGGTGA ACTAGCACAA AAAATTGGGG AAGAAAAATT AAAACAAGGT ACACGTATTT 72 0 

ATGATCCACA ACGTGAAAAA GAAATGCTTA ACGACTTAAT CGATAGTAAC AAAGGACCAT 78 0 

TCAACGATAA TACTATTAAG CAATTATTTA AAGAAATTTT CAAAGCCTCT ACAGATTTAC 84 0 

AAAAATCTGA AAATGAAAAA CATTTATATG TATCACGTAA GTTGAAACCT GAAGATACGA 900 

45 

TTGTAACATT TGATAATGGG GGCATTATAG GAGACGGCAA TAAATCATTT GTATTTGGGC 960 

CATGTTCAGT TGAATCATTT GAACAAGTTG AAGCTGTTGC TAAAAACTTA CATGCTAAAG 102 0 

GTGAAAAATT TATTCGTGGC GGTGCATTTA AACCACGTAC ATCACCATAT GATTTCCAAG 1080 

50 

GCCTAGGTGT TGAAGGACTT AAAATACTTA AACAGATTAA AGATAAATAT GATTTAAATG 114 0 

TTGTCAGCGA AATCGTAAAT CCAAATGATT TTGAAGTGGC TGATGAGTAT TTAGACGTAT 1200 
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10 



15 



20 



25 



30 



35 



40 



SO 



CTGAATACAT 
CTTATGAAAA 
GTACACACTT 
TACCAACTGC 
AGATCCATCT 
ATTTTATGAT 
TTCCAAGGAA 
GAATCGACAA 
ATTTTTTGTA 
TGTTCAATTT 
GAATTATCTG 
ACAAATGCAA 
GTTTATATAA 
TTTTCGCTAA 
GAAAATGAAA 
ACAGTTACTA 
GTTGTTAATG 
AAGCGTTTGA 
ACAGTAGGTG 
CTTGAAGATA 
CCTGAAAAGG 
TTCCTTGGTG 
GTAGTAGTAT 
ACTGAAGCTG 
TTAGTAGGTG 
GTGTTAAATA 
TATAAAGAAG 



TCTATTAAAA 
TGCTTCACAA 
GGCGACACGT 
ACCAGTCATG 
GAArsAgCAT 
GTTGCACTTA 
GAATTAAAGC 
ACTATAGACT 
TTTTAAACTT 
TAGAAATAAA 
CACAATTATT 
TAAAATATAA 
TTGATATTTG 
AATTAAAATA 
ATTTTTCCAA 
TGAAAACATG 
TATATGATGT 
GGAACCAAAA 
ATTATCGTCC 
TGATCATTCC 
TTGCAACAAT 
AAAAAGAAAT 
GTACAATTAC 
CAGGAACAAA 
CGAAAGAAAT 
GAGAACATTC 
AAAATGGCCT 
GCGTAAAAGC 



CGTGGTTTAT 
GGTAATCAAA 
AACACTTTAG 
GTAGATGTTA 
TAGCAGTTGG 
GTGATGCGGG 
CTTTAGCTGA 
ACTTAACTAA 
ACAGCCATTC 
GGTATACTGC 
AAAGATTATG 
TAGATTCACA 
AAAAGGCTTT 
TATCGTTCGT 
ACAAAAATAT 
TTATTATAAT 
AGCAAGAGAA 
TGTTAAAGCA 
AAATGCTGTT 
AGATATATCT 
GTATAAATAT 
TTTTAATAAC 
TGAAGAAATG 
TGGTAAGGAT 
TACGGGAGAA 
TAAAAAAGCT 
TCAATTAGGT 
XTTTGCCAAA 



CTGCTACAAT 
ACATTATTTT 
ATATTTCAGC 
CGCATAGTAC 
TGCTGATGGA 
TCAACAAATG 
TTTATATAAC 
TATGTCATGT 
TAAGAGTATA 
ACTTAACGAT 
ACTGATAGCA 
CATTTGTTGC 
CTCAAAACAT 
TATCATTTAG 
GTTACTGTAA 
GAATAAAACG 
GCGCGTGTCT 
GAAACTAAAA 
GCTAGAGGTT 
AATATCTATT 
CACTCAATTA 
TTATTAAGTA 
AAAGAATTGA 
GCACATATAG 
TTAATTGAAA 
CAAGAAGATG 
GATACATTGA 
ATGAAAGGCA 
^.n-TA-r^ ft^GCATAGTH 



CGAAGAGTTT 
ATGTGAACGT 
AGTACCAATT 
AGGTCGTAAA 
GTTATGGCTG 
GATTTAGATG 
GCTAAAAAGT 
TGAAGTAGAA 
TTACTITTAG 
TGTTATAATA 
GTAATTAAAA 
TGAAATGTGA 
TACAATTAAA 
CGTTTGTTTT 
ATTAAAATAT 
TTTACAAGGA 
CTATGGCCAC 
ATAAAGTTAA 
TAGCTAGTAA 
ATTCACAACT 
TTTCAAATTC 
AACAGGTTGA 
TAAATCAATC 
CATCAGTTAA 
AAGGCGCTAA 
TTTTAGAAGG 
ATTGTTCTGG 
ATTTGCCAGA 
CAATGGATGC 



GTTTATGCAG 
GGAATCCGAA 
TTAAAACAAG 

GATATCATGT 



AGGTGCATC\- 
AATTCCAAGC 
TAAAATAATA 
TATTATCTTT 
AGTGGCTATT 
CTTCGACACT 
TTATAACTAT 
ACATTTTTCA 
AATGAAAAAA 
TATTTCAAGC 
GGTAAACTAT 
GGAAATTATG 
AGTGTCGCGT 
CGAAGTCATT 
AAAGACAACA 
TGCTCGTGGA 
AGATAACGAT 
TGGTATTATT 
ATCTGTACCT 
TATTGATTTT 
ATCATTTGCT 
TTTAACTGAA 
TGCTGAAAGT 
TGCCATTTTA 
TGGTATTAAA 



1320 

1380 

1440 

1500 

156 0 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2380 

2940 

3000 
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CCACAACTTT CTAGTGTTAT TCAACCATTA TATGATATCG GTGCAGTAGG GATGCGCTTA 312 0 

TTAACAAAAT ATATGAACGA TGAAAAGATA GAAGAACCAA ATGTAGTTTT ACCTCACAGA 3130 

5 

ATTGAATACC GAGGAACTAC AAAATAAATT CACAAAATTA GGCATTCATC TAACGACCCA 324 0 

AATTATATGG GTGTTGGAAG AATGCCTTTT ATTTATCTTT TAAAATCGTT GCAGATTAGG 3 3 00 

TTACTTATTG ACGAGTAGAT TCGTACCAAC TCGCTATATG TAAAGCTAAT TTTTTATTTT 3 3 60 

10 

TTTCACTAAT TTCTTTTGTG CGGGGGACAT AGGTATAATC ATTTAAACGA TCTTCCCATC 34 2 0 

TTTTAGGTAA TAATTCAGAT GAATAATGTT TCCATTTATT AATCCATTCT AACGGTAAAT 34 8 0 

AACCACTTTG AATTGGTTGA TCAATTAAAC TTAAGAATAC ATGACTCCAT GCACGTGGTA 3 54 0 

CGACTCTCCA AATATTGTAG CCTCCGCCAC CAAACATAAT TACCTTTCCA TTCGTATAAG 3 600 

AATCAGCTAA ATATTTTACA AAATATGGAA TTTCATATAA TGAATGTAAC GTACAATTTA 3 660 

20 GATGAGTTAG TGGATCACGA TAATGTATAT CGACACCATT TACGCTTAGA ATAATATCAG 372 0 

GTTTAAAACT CTTTACGACA GGCTCAACTG TTAATTTAAA ACACTCCAAA AATGATGCAT 3 730 

CTTCTGTATA CGGTTCAAGT GGGACATTTA CAGTGTGTCC ATAGCCGATA TCTTCACCGC 3 84 0 

GCTCAGTATA GTGACCAGAG CCTGGGAAAA GAAATTTTCC GGTTTCATGG ATAGAATAAG 3 900 

TAGTAACATG GTTATCGGCA TAGAAACTCC ATTGTGTACC ATCTCCATGA TGTGCATCGG 3 96 0 

TATCTATGAT TAAAACGCGT TGATTGTATT CTTTAGCTAA GTATTGTGCG GTAATTGCAA 4 020 

30 

TATCATTGTA TATACAAAAA CCACTTGCTC GACCAGGTTG AGCGTGATGC AAACCACCAC 4 080 

CTAAGTGACA ACCATTTAAT ACTTTGCCTG ACATAATAAG ATCTGCTAAA GTTAAAGCGC 414 0 

CTCCAACAAT TGTGGCACTA TGGCGGTGCA TATGCTTAAA TTGACCATTC TCTTCATCAT 42 0 0 

35 

TTAATCCATA TTTCTTAGCC TCATCTTCAC TGATAATGCC ATGTGAAGCA TGCTTAATAG 426 0 

CTTCGACGTA ATCATATTTA TGAATTAACA TTAATTCGTC ATCTGTTGCA ATTCTAGGTT 4 320 

40 GTACTATTTG TTCTGGAGAC AATAAATTTG CATTCAAAAG TAGCTCTGTT GTTAATTTTA 4 3 80 

AACGCATTTG ATTGAAGGGA TGTTGGTCAT GAAATCGATA TTGTAATAAC TTATCTGAAT 4440 

AAACATATGC AGTTTTTGAT GAATGTTGTT GCATATAATC CCTCCGATAT TCCAAAAATT 4500 

AAAAGAAAAA CCGATTCATA TAACGAATAT CATCAAACGC TTGTTGCTGT TCTAATGTAA 4 56 0 

TGTTTTTGCC AATTCTTGCC ATTAAACAAT TAGCTGGATG ACTTGTTATT TCTGGATCAT 4620 

CTGTAGCGAA TATTTCAAGT CCACCAGTTG CCATTAACCG CTGCATTAAT TTTTTATAGT 46 80 

50 

CAAATACATC TAACTTTGAA TTTTTTAAAT CCCAATGCCA GTAATATTCT GTAGTTATAA 4740 

CGATATAATT CTCGAATTCT GGTGTAGAAA GGCTAAGTTG TATCAGCTTT TCTGCAAGTT 4 8 00 
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TACCTGTAGA CCAGCGCTCA ATTTCATCAG GATAGTGGAA AGTGACATAA CCCACAATGA 4 920 

GTTGATCTTG GCGAGCAACA TAAATTCTAC CTTCTGGTAA TGTTGTAATT TCTAACAAAG 4 980 

^ CTTTATACTG ATCTTCAGCA TCTCTAAATG CXWrTAAATG CGCATCGAAA GTAAGCGCTT 504 0 

TCAAATCTtC GTGTGTTAAA GGACCTTCAA TAACAAATTG CTTGTCATGA ATGTAATAAT 510 0 

CTTCGGATTG ATACGTCTTT AAATGATTCA TATTTTCAAC TCCTCAATCG ACGTCGTGAT 5160 

10 

TGTATTAAAT TCATTATATA GAAaATTTAC AATAATTAAT ACTAGAAAAA GGATAAAAGT 522 0 

AAAAATTTTG AATAATTAGA AATGTTATGT ATAATATTGA GAAAGAAAGC GTTTTCACAT 5280 

^5 AACAAAGGGG GAGTTTCAAA TGAAAGTCGA AGTTTATAAA GGAGCGCAAG GTAAACATAA 5340 

CCTTAAAGAT TATGAAGAAA CATATAATAC TTTTGATTGG AAAGACGTAG AACAAGCATT 54 00 

TTCTTGGAGT GAAACTGGAA AAATGAACAT GGCATATGAA TGCATAGATC GCCATGTAGA 5460 

20 TCAAGGATTA GGGGATAAAA TAGCGTTAAA TTACAAAGAT GAGCACAGAA AAGAATCGTA 5520 

TACTTATAAA GATATGCAAC GGTTATCTAA TAAAGCAGCG AATGTTTTGT CTGAACATGC 55 8 0 

AGAAGTTGAC AAAGGTGACA GAGTATTTAT ATTTATGTCG CGTACACCTG AACTATATTT 5640 

TGCGTTGTTA GGTGTTTTAA AAATTGGTGC AATTGTTGGG CCGTTATTTG AAGCATTTAT 5700 

GGAAAAGGCA GTTGCGGATA GATTAGAGAA CAGTGAAGCT AAAGTGTTAA TTACTAATAA 576 0 

GGCATTGTTA CCTCGAGTAC CTGTAGATAA ATTACCAAAC TTGAAAAAAA TTGTTGTCGT 5820 

30 

AGATGAGGAT GTAGAAGACA ATTACATAGA CTTCATTAGT TTGATGGAAA CTGCTAGCGA 58 8 0 

TGAATTTGAC ATTGAATGGT TAAAGTCGGA TGATGGTTTG ATTTTACATT ATACATCAGG 594 0 

TTCTACTGGG CAACCTAAAG GtGTATTGCA TGTTCAACAA GCAATGTTAG TGCACTATAT 6000 

35 

TTCTGGAAAA TATGTATTAG ATTTACAAGA AGATGATGTT TATTGGTGTA CAGCAGATCC 606 0 

AGGTTGGGTT aCAGGAACAT CTTATGOTAT TTTTGCACCA TGGTTAAATG GCGCTACAAA 6120 

40 TTGTATAGCT GGTGGTCGCT TTTCGCCAGA ACAGTGGTAT AGTATGATTG AAGATTTTAA 6180 

AGTGACGATT TGGTATACGG CACCAACAGC TTTAAGAATG TTAATGAGTG CTGGTGACGA 624 0 

TATTGTTGAG AAATATGACT TGTCATCGTT ACGTTCGATT CTATCAGTAG GTGAGCCTTT 63 0 0 

AAATCCTGAA GTTATAAAAT GGGCGAAAAA AGTATACGGT TTAACGGTGT TAGATACTTG 636 0 

GTGGATGACA GAAACAGGTG GACATATGAT TGTTAACTAT CCAACGATGG ACGTcAAGCT 64 2 0 

TGGCTCAATG GGCAAACCAT TACCTGGTAT TCAAGCTGCA ATTATCGATG ATGCAGGGAA 64 80 

50 

TGAATTACCA CCAAATCGAA TGGGCAACCT TGCTATAAAA AAAGGCTGGC CATCAATGAT 654 0 

•■- ' '--r.^^-r. -Arvn-'^A'^AT^ '^TTATTr/^AG ArTGGTATGT 6600 
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TGATGTAATT ATGACAGCTG GTGAACGAGT TGGACCATTT GAGGTTGAGT CTAAATTGGT 6 720 

TGAACACGAA GCAGTTGCCG AAGCAGGAAT TATTGGTAAA CCTGATCCGG TTCGCGGTGA 6730 

5 

AATAATTAAG GCGTTTGTTG CACTGAGAAA AGGATATGAA CCAACAGACG AATTAAAAGA 6 34 0 

AGAAATTCGT ATATTTGTTA AAGAAGGTTT GTCGGCACAT GCAGCACCAC GTGAAATCGA 6 90 0 

ATTTAAAGAT AAATTACCTA AAACACGGTC AGGTAAAATT ATGAGACGTG TATTAAAAGC 6 960 

10 

TTGGGAATTA AATTTAGATG CTGGGGATTT AAGTACAATG GAATAATGAC ATGAATGTTA 702 0 

TTGAAGATTT TTTTCGAAGA ATAAAGGGTG ACAACATATT TCATGTCAAT GTTTAAATAA 70 8 0 

^s TCGTTTACTT TACGATAAGC AATATAAAGA ACTGTTAACT TGTGTCATAT CATTTCGTAG 714 0 

AAAGCATTTG AAAATGATGA CATAACAATA ATGGCATATC TTTATATTGC TTTTTATTTT 720 0 

TAATATGATC TTTGGAAGAT GATTATTTTA AATAATAGAA AAATATAGTT ATCAATAGTA 7260 

TCAAGCGCTA AAAGTTGTAT AATACAAAAC TTTAATAAGT GAATTTATTG CAAAAATGAA 7320 

AGCGCTAACC CGATTTAGTC GACAAGTTTT TAACAGTTCG TTATTATATG AATGTAAGTA 73 30 

AAAATTTCTT AGCTACAACT TACATATTAT AAATGCATAA ATTAAACAAA AAGGGGCGAA 74 4 0 

25 

AAAAGTTGAC TCATTTATCA GATTTAGATA TTGCGAATCA ATCAACACTA CAACCAATTA 7500 

AGGATATTGC TGCATCAGTA GGTATTTCAG AGGATGCATT AGAACCTTAT GGTCATTACA 7560 

AAGCTAAAAT CGACATTAAT AAAATTACGC CAAGAGAAAA CAAAGGGAAA GTTGTTTTAG 7620 

30 

TAACTGCGAT GAGCCCAACA CCAGCTGGTG AAGGTAAATC AACGGTTACA GTTGGTTTAG 76 80 

CTGATGCATT CCATGAGTTA AATAAAAACG TTATGGTTGC ATTAAGAGAG CCTGCTTTAG 774 0 

35 GACCAACATT TGGTATCAAA GGTGGTGCGA CTGGTGGTGG TTATGCGCAA GTCTTACCTA 78 00 

TGGAAGATAT CAACTTACAT TTCAACGGAG ATTTCCATGC GATTACAACT GCAAATAATG 7360 

CAT'i;?iTCTGC GTTTATCGAT AATCATATTC ACCAAGGTAA CGAATTAGGA ATCGATCAAA 7920 

40 GACGTATTGA GTGGAAACGT GTATTAGATA TGAATGATCG TGCACTTAGA CATGTAAACG 79 BO 

TTGGGTTAGG TGGACCTACA AATGGTGTAC CACGTGAAGA TGGCTTTAAT ATTACAGTAG 8040 

CGTCTGAAAT TATGGCGATT TTATGTTTAA GTAGAAGTAT TAAAGACTTA AAAGATAAAA 8100 

TTAGTCGTAT TACTATTGGT TACACTAGAG ATCGCAAGCC AGTTACAGTT GCAGATTTAA 3160 

AAGTGGAAGG TGCACTTGCA ATGATTTTAA AAGATGCAAT AAAACCAAAC TTAGTACAAT 8220 

CAATTGAAGG GACACCTGCA TTAGTTCATG GTGGACCATT TGCGAATATC GCACACGGTT 82 8 0 

50 

GTAACTCAAT TTTAGC7VACT GAAACAGCAC GTGATTTAGC TGATATCGTT GTAACGGAAG 834 0 

CTGGATTTGG TTCAGACTTA GGCGCTGAAA AATTCATGGA CATTAAAGCG CGTGAAGCAG 84 0 0 
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w 



15 



25 



GTGTAGCGAA AGATAATTTA AAAGAAGAAA ATGTAGAAGC AGTAAAAGCA GGAATTGTTA 8520 

ATTTAGAGCG TCATGTTAAT AATATTAAAA AATTCGGTGT AGAACCGGTT GTTGCAATTA 85 80 

ATGCATTTAT ACATGATACC GATGCAGAAG TAGAATATGT AAAATCTTGG GCTAAAGAAA 864 0 

ATAACGTACG AATTGCCTTA ACTGAAGTTT GGGAAAAAGO TGGTAAAGGT GGCGTTGACT 87 00 

TAGCAAATGA AGTATTAGAA GTCATTGATC AACCTAATTC ATTTAAACCT 'ITATATGAAT 37 5 0 

TAGAATTACC ATTAGAGCAA AAGATTGAAA AGATTGTGAC TGAAATCTAT GGCGGTTCAA 8820 

AAGTAACGTT TAGCAGTAAA GCGCAAAAAC AATTAAAACA ATTTAAAGAA AATGGTTGGG 8880 

ATAATTACCC AGTATGTATG GCGAAAACAC AATATTCATT CTCAGATGAT CAAACGTTGT 8940 

TAGGTGCACC ATCAGGATTT GAAATTACAA TTCGTGAATT AGAAGCGAAA ACAGGTGCAG 9000 

GATTTATCGT AGCGTTGACA GGTGCAATCA TGACTATGCC TGGTTTACCT AAAAAACCAG 9060 

CAGCATTAAA CATGGATGTT ACTGATGATG GTCATGCAAT TGGGTTATTC TAATAAATCA 9120 

TGTCAATTGT TTAATAAAGA TAAGTAAATA GTTTAATAGA CCGGACTGTT GGAGATGCAT 9180 

TATTTCAGCA GTTCGGTTTT TTGCTGTGCT AAAAATAGAT TCAATTTGGC GAATCTAACG 924 0 

ACAATGTTTG AAGGTGGTTA ATTAATGTAT ATGAAGATAA AAAGTGGGCT TGAAGAATAG 93 00 

GAAAGCGATG CAATGAATAT TCCATATTAA AAAAAATTAA TAAAATAGGT TGCAATATTT 93 60 

AATTGGGATG CGCTACAATT AACACTAATA ATTGATATTG ATAATTATTA TCAATTAAAT 94 20 

ATAATCTTAT AGGAGTTGTT AACAACATGA ACAAACATCA CCCAAAATTA AGGTCTTTCT 94 80 

ATTCTATTAG AAAATCAACT CTAGGCGTTG CATCGGTCAT TGTCAGTACA CTATTTTTAA 954 0 

TTACTTCTCA ACATCAAGCA CAAGCAGCAG AAAATACAAA TACTTCAGAT AAAATCTCGG 96 00 

AAAATCAAAA TAATAATGCA ACTACAACTC AGCCACCTAA GGATACAAAT CAAACACAAC 9660 

CTGctACGCA ACCAGCAAAC ACTGCGAAAA ACTATCCTGC AGCGGATGAA TCACTTAAAG 9720 

ATGCAATTAA AGATCCTGCA TTAGAAAATA AAGAACATGA TATAGGTCCA AGAGAACAAG 9780 

TCAATTTCCA GTTATTAGAT AAAAACAATG AAACGCAGTA CTATCACTTT TTCAGCATCA 9840 

AAGATCCAGC AGATGTGTAT TACACTAAAA AGAAAGCAGA AGTTGAATTA GACATCAATA 9900 

CTGCTTCAAC ATGGAAGAAG TTTGAAGTCT ATGAAAACAA TCAAAAATTG CCAGTGAGAC 9960 

TTGTATCATA TAGTCCTGTA CCAGAAGACC ATGCCTATAT TCGATTCCCA GTTTCAGATG 10020 

GCACACAAGA ATTGAAAATT GTTTCTTCGA CTCAAATTGA TGATGGAGAA GAAACAAATT 10080 

ATGATTATAC TAAATTAGTA TTTGCTAAAC CTATTTATAA CGATCCTTCA CTTGTAAAAT 1014 0 

... ^^^^^^^^^^ ^^^^^^^^ AGTAATCAAA 10200 
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AGGCAACGAC CAATATGAGT CAACCTGCAC AACCAAAATC GTCAACGAAT GCAGATCAAG 10320 

CGTCAAGCCA ACCAGCTCAT GAAACAAATT CTAATGGTAA TACTAACGAT AAAACGAATG 103 80 

5 

AGTCAAGTAA TCAGTCGGAT GTTAATCAAC AGTATCCACC AGCAGATGAA TCACTACAAG 104 4 0 

ATGCAATTAA AAACCCGGCT ATCATCGATA AaGAACATAC AGCTGATAAT TGGCGACCAA 10500 

TTGATTTTCA AATGAAAAAT GATAAAGGTG AAAGACAGTT CTATCATTAT GCTAGTACTG 10 560 

10 

TTGAACCAGC AACTGTCATT TTTACMJkPJ^ CAGGACCAAT AATTGAATTA GGTTTAAAGA 10 620 

CAGCTTCAAC ATGGAAGAAA TTTGAAGTTT ATGAAGGTGA CAAAAAGTTA CCAGTCGAAT 106 8 0 

TAGTATCATA TGATTCTGAT AAAGATTATG CCTATATTCG TTTCCCAGTA TCTAATGGTA 10740 

CGAGAGAAGT TAAAATTGTG TCATCTATTG AATATGGTGA GAACATCCAT GAAGACTATG 108 0 0 

ATTATACGCT AATGGTCTTT GCACAGCCTA TTACTAATAA CCCAGACGAC TATGTGGATG 10 8 60 

20 PJ^QPAAQATA CAATTTACAA AAATTATTAG CTCCGTATCA CAAAGCTAAA ACGTTAGAAA 10920 

GACAAGTTTA TGAATTAGAA AAATTACAAG AGAAATTGCC AGAAAAATAT AAGGCGGAAT 10980 

ATAPiPAPiGPiA ATTAGATCAA ACTAGAGTAG AGTTAGCTGA TCAAGTTAAA TCAGCAGTGA 1104 0 

CGGAATTTGA AAATGTtACA CCTACAAATG ATCAATTAAC AGATTTACAA GAAGCGCATT 11100 

TTGTTGTTTT TGAAAGTGAA GAAAATAGTG AGTCAGTTAT GGACGGCTTT GTTGAACATC 11160 

CATTCTATAC AGCAACTTTA AATGGTCAAA AATATGTAGT GATGAAAACA AAGGATGACA 11220 

30 

GTTACTGGAA AGATTTAATT GTAGAAGGTA AACGTGTCAC TACTGTTTCT AAAGATCCTA 11280 

AAAATAATTC TAGAACGCTG ATTTTCCCAT ATATACCTGA CAAAGCAGTT TACAATGCGA 11340 

TTGTTAAAGT CGTTGTGGCA AACATTGGTT ATGAAGGTGA ATATCATGTC AGAATTATAA 114 00 

35 

ATCAGGATAT CAATACAAAA GATGATGATA CATCACAAAA TAACACGAGT GAACCGCTAA 1146 0 

ATGTACAAAC AGGACAAGAA GGTAAGGTTG CTGATACAGA TGTAGCTGAA AATAGCAGCA 11520 

40 CTGCAACAAA TCCTAAAGAT GCGTCTGATA AAGCAGATGT GATAGAACCA GAGTCTGACG 1158 0 

TGGTTAAAGA TGCTGATAAT AATATTGATA AAGATGTGCA ACATGATGTT GATCATTTAT 1164 0 

CCGATATGTC GGATAATAAT CACTTCGATA AATATGATTT AAAAGAAATG GATACTCAAA 1170 0 

45 TTGCCAAAGA TACTGATAGA AATGTGGATA AAGATGCCGA TAATAGCGTT GGTATGTCAT 11760 

CTAATGTCGA TACTGATAAA GACTCTAATA AAAATAAAGA CAAAGTCATA CAGCTGAATC 11820 

ATATTGCCGA TAAAAATAAT CATACTGGAA AAGCAGCAAA GCTTGACGTA GTGAAACAAA 118 80 

50 

ATTATAATAA TACAGACAAA GTTACTGACA AAAAAACAAC TGAACATCTG CCGAGTGATA 11940 

TTCATAAAAC TGTAGATAAA ACAGTGAAAA CAAAAGAAAA AGCCGGCACA CCATCGAAAG 12000 
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CATGGTGGGG CTTATATGCG TTATTAGGTA TGTTAGCTTT ATTCATTCCT AAATTCAGAA 12120 

AAGAATCTAA ATAATTAnCT AAATATAGCA TATGTATGAT TAACTTTGTA GAC 12173 

5 

(2) INFORMATION FOR SEQ ID NO: 311: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1316 base pairs 

(B) TYPE: nucleic acid 

^0 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

^5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: 

CAACATTAAT ATTGATATTA AATCTTCCTG GATAACGTGC TTGTTCGAGT GATAAGTATG 6 0 

CACGCACTTG ACTTAACTCT TTATCTAAAG TAATCGTATG TTGCTTAGAG CCTTGTAAAT 120 

20 TCGCTCTGAA AAAATAACTC AATTCTAATA GTAACTCTCG TGCCTTTTCG CTATTTATTC 180 

TAACTAAAGC TGAGATCGTG TTAATTGAAT TGAAGAAAAA ATGTGGACTC ACTTGTGCCT 24 0 

GTAATGACTT AATCTCAGCA TCTTTCAATA ACTTACTTTG CGTTTCGGCT TCACCAAGTT 3 00 

CAATTTGGCT ACTAAAAATA TTTGCCAATC CTTCTGCAAG TTGACGTTCC ACAAAAGTTA 360 

AATCATTAGG GTTTGTAAAA TACATCTTCA ATGTACCGAC GATAGAACCA TGCATCTCAA 420 

GTGGTATCAC GATAGCTGCT CTAAGCGGGC AATTCGGATG ACTACAACCA ATCTCTTCTT 4 80 

30 

TAGTATGAAC TTCTTTCAAC TTTCCTGATT TCAATACATC TTTAGACAGA CTTGTTAATA 54 0 

TTTCATTTGT TGGTATGTGA TGATCACTAC CTGCACCTAC ATGCGATAAG ATTTCATTTT 600 

TGCTTGTAAT TGCTACGGCA GATACTTTCA TTAAATTTTT AATAATCATC GCAATTTGCT 660 

35 

GTGCCGATTC TCTATTCAAT CCTTCTTTAA AATACGGCAA TGTCTGGTTC ATCAATTGCA 720 

GTACATCATG TGTTTGAACA GCCTTCATTT GCTCCTCTTG CTTTAATGTT GAAATGATAA 780 

40 TAGACATAAA AATCGCCGTA CCAACGCTAT TAACAATAAT CATTGGTAGT GCAATTAATG 84 0 

ATATGAGGTC AACCGCATAT GCTTTGTCGT GGGAAAATGT TAAAATGCTC AACATTTGAA 900 

TCATTTCCAT AACAATTCCA ATCATGGCAC TTTTCGCAAT ACTCGGGTAA CGCTTGCGTC 960 

TTTGAGCTTG TAAGCCAAAA TAACCAGCAA TTATACCAAT AAATATAGAT GAGATAAGAT 1020 

AAACTTGTGC ATCCGCCCCA CCCATATACA CTCTGAAAAT ACCTGAAATA ACGCCAACAA 10 80 

ATAGACCTAC AAAAGGGCCA CCAACTAATC CTGCGACACC TATCGTTAAT ACACGTGTGT 114 0 

50 

TAGCTAAAGA TACATCATCA TCTAAACGGA AGTACACACT TCCTGACAAA CTATGTTGAT 1200 

• ' - "T-n'^r,-.n —TAAGGCAAA rAAACTGAAA ATAATACATA 12 60 
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(2] INFORMATION FOR SEQ ID NO : 312: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 7972 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



w 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312: 

TATAAATATT ATTTTATTAT CGTTTATCGC TATTTGTGTT ACATAATCAA AACCATAAAT 6 0 

15 TCTTACTCAT TCAGATTTAC CCAATATTTT TACTTTTATA ATGTAATGCG TTTTATCCAA 120 

GTTATTTTTT AAAAATMJKT ATTGAATTnG GGGCTGnTTT CATGTCATTA AGAGATGAAG 180 

CATTGGAAAT GCACAAACGT AATCAAGGTA AATTAGAAGT TAAACCAAAT GTAAAAGTTA 24 0 

CTAATAAAGA GGAATTAAGT TTAGCATACT CACCTGGCGT TGCTGAACCG TGTAAAGATA 300 

TTTATGAAGA TAAAAGAAAA GTATATGATT ACACAATTAA AGGAAATACA GTTGCAGTTA 36 0 

TTACTGATGG AACAGCGGTA TTAGGTTTAG GTAACATTGG ACCTGAAGCA AGTATTCCTG 420 

25 

TAATGGAAGG TAAAGCAGTA TTATTCAAAA GCTTCGCTGG TATCAATGGG GTGCCTATTG 4 80 

CGTTAAATAC AACTGATACC GAAGAAaTCA TTAAAACAGT TAAGTTGTTA GAACCTAATT 54 0 

ATGGTGGTAT TAATTTAGAG GATATTTCGG CACCACGTtG TTTTGAAATT GAAGAACGAT 6 00 

30 

TGAAAAAAGA AACTAATATT CCGGTATTCC ATGACGATCA ACATGGTACA GCAATTGTAA 660 

CATTGGCAGG TTTGGTAAAT GCATTGAGAG TTGTTAACAA AGATATTGCT AAAATAAAAG 720 

55 TTGTACTAAA TGGTGCTGGT GCAGCAGGAA TAGCCATTGT TAAATTACTA TACGCGTATG 780 

GTGTAAGAAA TATGGTTATG TGTGACTCAA GAGGCGCAAT TTTTGAAGGA CGTTCATATG 840 

GTATGAATCC TACGAAAGAT GTTGTAGCAA AATGGACAAA TAAAGATAAG ATTGAAGGGT 900 

40 CTTTAGAAGA AGTCGTAAAA GACGCAGATG TATTTATCGG GGTTTCTGTA GCTAATGCGC 960 

TGTCACAAGA TATGGTTAAG AGTATGGCAG ATAATCCAAT TATATTTGCA ATGGCTAATC 1020 

CAAATCCTGA AATAATACCT GATGATGCCA AAGCGGCAGG TGCACGAGTT GTTGGTACAG 1080 

45 

GACGTTCAGA CTATCCTAAC CAAATTAATA ATGTATTAGC TTTCCCTGGT ATTTTTAGAG 114 0 

GTGCATTAGA GGTTGAAGCT ACACATATAA ATGAAGAAAT GAAAAAGGCA GCTGTAGAAG 1200 

CGATTGCTGA TTTAATCGAT AGTTCTGAAT TAAATGAAGA CTACTGTATC CCAGGACCGT 1260 

50 

TTGATAAACG TGTAGCGCCA TCAGTTGCTC GTAATGTTGC TAAAGCGGCA ATGGAATCTG 1320 

GAGTAGCTAG GATTGAAGTT GATCCGCAAG ATGTGTATGA TAAAACAATG AAACTTACAG 1380 
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20 



25 



40 



45 



50 



^rr^^MTGA T0««T3A* AOTTTATCAT «A«TTC« CAGTCAAACC AATAT«ATC 
AAATAAAm AAACCCGTTT TT»ACI«TC AAOTTCAOTT TAAOOCTCTA AATOGTTAOA 
ACAOAGOTTA TTTOGAOOTT TTCCTATXTTT TAAAGATTTT TTTAATCGAA CAAAOAAAAA 
OAAATATCn ACAOTACAAO A<:TCTAAAAA TAATGAIOTC CCTGCA30TA TTAT«CTAA 
CTCTCCAAAO TGTAAOAAAA TTATGTACAC AAAAGAATTA GCTC^AAAATT TAAATG'.^O 
CTTTAATOT =atcat<:a™ TTOCITTAAC IGCGTATAAA CGTATAOAAG CAATITCTOA 
TGAACOATCA TTTACAGAAT TCOATAAOOO AATOACCTCT ■3C0AATCCAT TAGATTTTCC 
^OTTAriTA <:AAAAAA™ AAAA^CCA ACA.AAGACA <^TCTT«««= AAaCAGTTCT 
aACTCOTACA CCACAACTA3 AT«TA^ A™=«TT O^TCATOG ATT=AC=TTT 
TAOAATGOGA AOTATWOAT COGTTATC«= TGAA»*.ATA T<rrC3CATCA ™ATTACTa 
...T^AGAAC COTTTACCAT TTATXCTTtT CTCTGCAAOT <=OTa=TaCAC CTATOCAAGA 
AOCTATTArr TCCrr»T3C AAAT«»TAA AACCA.TOTA TCTTTAAAAC OTCATTCTGA 
CGCTOGACTA TTATATATAT CATATTTAAC ACATCCAACT A<™0«OTO TATCTOCAA. 
™CATCA GTT«TOATA TAAATTTAAO TOA=CCAAAA GCGTTGATAG GTTTOCAGG 
™,TCOAOrr A^AACAOA CAATAAACOA AAAATTGCCA OATOATITCC AAACTGCAGA 
ATrmATTA GAGCAT^AC AATTGOATAA AGtTOTACAT CGTAA^ATA TGCGTCAAAC 
.™,..^..A A^TAAAAA ICCATCAAGA GGTGACTAAA TAATGTTAGA TTTTGAAAAA 
CCACTTTm AAATTCGAAA TAAAAITGAA TCTTTAAAAO AATCTCAAGA TAAAAATOAT 
^OATTTAC AAGW.GAAAT TGACATGCTT GA«=CCTCAT TOGAACGAGA AAGTAAAAAA 
ATATATACAA ATCTAAAAGC ATGGGATCGT GTGGAAATTG CGCOTTTGCA AGAAAGACCT 
ACGACCCTAG ATTaTATTCC ATATATCTIT GATTCGrTTA TGGAACTACA TGGTGATCGT 
AATTTTAGAG ATGATCCAGC AATGATTGGT GGTATTGGCT TTTTAAATGG TOnGCTOTT 
ACAGTT.™ GACAACAACG TGGAAAAGAT ACAAAAOATA ATATTTATCG AAATTTTGGT 
ATGGCGCATC CAGAAGG™ TCGAAAAGCA TTACGVrTAA TGAAAGAAGC TGAAAAATTC 
Trm-ACATT TATAOATACA AAAGGTOCAT ATCCIGGTAA AOCIGCTGAA 
OAACGTGGAC AAAGTGAATC TATCGCAACA AArrrCAITG AGA^XTTC ATTAAAAGXA 
CCA=™™ CGATTGTCAT TGGIX^AAOGT GGCAGIGGAG OTGCTCTAGO TATOGTATT 
GCCAATAAAG TA^GA^rT AGAGAATAGT ACTTACTCTG TTATATGTGC TGAAGGTGCA 

. -r^nrrnAAAC AATGAAAATT 3180 



2700 

27S0 

2920 

2880 

2940 

3000 

3060 

3120 
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GGTGCACATA AAGATATTGA ACAGCAAGCT 
TTAGATTCAC TTGAGTCATT ATCACGTGAT 
^ AGAAATATCG GTTCTTATAT AGAATAATCA 

GGGTTTTACC ATAAATTGAA GTACATTAAA 
TAACTAAGAT TAGTAGTCAA AATTGTGGCT 

10 

AATTCAATAT TTAAAACGTT TACATCAATT 
CAATTTGTGG TAATTTAGAG TTAAAGATAA 
GAAAATTGCA GTTTTAACTA GTGGTGGAGA 
AGTTGTTCGT ACAGCAATTT ACAATGAAAT 
AGGATTGTTA AATGATGATA TTCATAAACT 
20 GCGTGGAGGT ACATTCTTGT ATTCAGCAAG 

TAAAGTTGCA ATCGAAAACT TACGTAAAAG 
TGACGGTAGT TATCGCGGTG CACAACGCAT 
CGGTATTCCT GGTACGATTG ACAATGATAT 
CACAGCATTA AATACGATTA TTGGCTTAGT 
CGCACGAACA TTTATCATTG AAGCAATGGG 

30 

TGGATTATCA GTTGGTGCTG AGACAATTGT 
AATAGCTGAT AAAATTGAAC AAGGTATTAA 
AGCAGAAGGT TGTATGACTG CGCAAGATTG 

35 

TGATAATAGA GTGTCTGTGT TAGGTCACGT 
TAGAGTTTTA GCATCACGTT TAGGTGGATA 

40 AGCTAAGGGT GTTGGAATTA AGAACAATAA 

TGATGGTAAA GATCATAAAT TTGATTATAG 
ATAAGATTTC AGGAGGAATT ATAAAATGAG 

'^^ AGCTTCAGAA TCAGAAGAAA TGATTGAGAA 

ATTAAACTTT TCACATGGTA GTCATGAAGA 
AGTAGCTAAA AGATTAGACA AAATTGtAGC 

50 

TCGTACGCAT AATATGAAAG ACGGTATCAT 
TAGCATGAAT GAAGTTGAAG GAACACCTGA 



TTAGCTATTA AATCAGCGTT TGTTGCACAG 3 3 00 

GAAATTGCTA ATGATCGCTT TGAAAAATTC 3 3 60 

ACTTGAGCAT TTTTATGTTA AATCGATACT 342 0 

ACAATAATTT AATATTTAGA TACTGAATTT 34 8 0 

ACTAATCTTT TTTTAATTAA GTTAAAATAA 354 0 

CAATACATTA GTTTTGATGG AATGACATAT 36 0 0 

ATCAGTTATA GAAAGGTATG TCGTCATGAA 3 6 60 

TTCACCTGGA ATGAATGCTG CCGTAAGAGC 372 0 

TGAAGTTTAT GGTGTGTATC ATGGTTACCA 378 0 

TGAATTAGGA TCAGTTGGGG ATACGATTCA 3 84 0 

ATGTCCAGAG TTTAAGGAGC AAGAAGTACG 3 900 

AGGGATTGAG GGCCTTGTAG TTATTGGTGG 3 960 

CAGTGAGGAA TGTAAAGAAA TTCAAACTAT 4 02 0 

CAATGGTACT GATTTTACAA TTGGATTTGA 4 08 0 

CGACAAAATT AGAGATACTG CGTCAAGTCA 414 0 

CCGTGATTGT GGAGATCTAG CATTATGGGC 4200 

AGTTCCAGAA GTGAAAACAG ATATTAAAGA 426 0 

ACGTGGTAAG AAACACTCAA TCGTTCTTGT 432 0 

TCAAAAAGAA TTATCACAAT ACATCAATGT 4380 

TCAACGTGGT GGTAGCCCAA CAGGTGCGGA 444 0 

TGCGGTAGAC TTATTAATGC aAGGTGAAAC 4500 

AATTGTAGCA ACATCTTTTG ATGAAATTTT 4 560 

TCTATATGAA CTTGCTAACA AGTTATCTAT 462 0 

AAAAACTAAA ATTGTATGTA CAATTGGACC 4 680 

ATTAATCAAT GCTGGTATGA ACGTTGCACG 474 0 

GCATAAAGGT AGAATTGATA CAATTCGTAA 4 8 00 

AATTTTATTA GATACAAAAG GTCCAGAAAT 4 860 

TGAACTTGAA CGTGGCAACG AAGTTATTGT 4 92 0 

AAAGTTCTCA GTAACATATG aAAACTTAAT 4 98 0 
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TAAAGATATT GACCATGCTA AAAAAGAAGT TAAATGTGAT ATTTTAAACT CTGGTGAGCT 5100 

TAAAAACAAA AAAGGTGTTA ACTTACCTGG CGTAAGAGTA AGTTTACCTG GTATTACAGA 5160 

^ AAAAGATGCT GAAGATATCC GTTTCGGTAT TAAAGAAAAT GTTGACTTCA TTGCAGCAAG 522 0 

TTTCGTACGT CGTCCTAGTG ATGTTTTAGA AATTCGTGAA ATTTTAGAAG AACAAAAAGC 52 80 

TAACATTTCA GTATTCCCTA AAATTGAAAA CCAAGAAGGT ATTGATAATA TTGCGGAAAT 534 0 

10 

TCTTGAAGTG TCTGATGGTT TAATGGTTGC ACGTGGTGAC ATGGGTGTTG AAATTCCACC 5400 

TGAAAAAGTA CCAATGGTTC AAAAAGATTT AATCAGACAA TGTAACAAAT TAGGTAAACC 546 0 

^5 AGTTATTACA GCTACACAAA TGTTAGATTC TATGCAACGT AACCCACGTG CTACACGTGC 552 0 

AGAAGCTAGT GACGTTGCCA ACGCAATCTA TGATGGTACA GATGCAGTAA TGTTATCTGG 5580 

TGAAACTGCT GCTGGTTTAT ATCCTGAAGA AGCTGTTAAA ACAATGAGAA ATATTGCTGT 5640 

20 ATCAGCTGAA GCAGCCCAAG ATTACAAAAA GTTATTGTCA GATCGTACTA AATTAGTTGA 570 0 

AACTTCATTA GTGAATGCTA TCX3GTATTTC GGTTGCACAT ACAGCTTTAA ACTTAAATGT 5760 

TAAAGCAATT GTAGCTGCTA CTGAAAGTGG TTCAACGGCA CGTACTATCT CCAAATATCG 5820 

TCCACATTCA GACATTATTG CGGTGACTCC AAGTGAAGAA ACTGCACGTC AATGTTCAAT 5880 

TGTTTGGGGA GTTCAACCTG TAGTTAAAAA AGGACGTAAG AGTACAGATG CATTGTTAAA 5940 

CAATGCAGTT GCAACAGCTG TTGAAACTGG TAGAGTATCT AATGGTGATT TAATCATTAT 6 000 

30 

TACTGCTGGT GTACCAACTG GTGAAACTGG AACTACTAAT ATGATGAAAA TCCACCTAGT 6060 

TGGTGACGAA ATTGCTAATG GTCAAGGTAT TGGACGTGGA TCAGTTGTTG GTACTACGTT 6120 

AGTTGCTGAA ACTGTTAAAG ATTTAGAAGG TAAAGATTTA TCTGACAAAG TTATCGTTAC 6180 

35 

TAACTCAATC GATGAAACGT TTGTACCTTA TGTAGAAAAA GCTTTAGGCT TAATTACAGA 6240 
AGAAAATGGT ATTACATCAC CAAGTGCAAT TGTTGGTTTA GAAAAAGGTA TTCCAACAGT 6300 
40 TGTAGGTGTA GAAAAAGCTG TTAAAAACAT AAGCAATAAC ATGTTAGTTA CGATTGATGC 6360 
TGCTCAAGGT AAAATCTTTG AAGGATATGC AAACGTACTA TAATTTATAA AAAAACGTCT 6420 
TTCCATTTAT CAACAATGGA AAGGCGTTTT TTGGTTcATC TGGTATTTTA TGACGTAATT 64 80 
AATAGGTTAT TTGATAATGA TAGTGTATGA ATGGCAATCT ATATAAATGT TTATATCTTT 654 0 
TATACATGTA CATTATCACC TTCAAACTTT CACTCATATT ACTTTGGAAA TTTATTATAA 660 0 
AATAGAAGTA TGGATGTATT TCTGAAATGA TACATTATTA AATAGATGAG AAAGTAAAAG 6660 
TTTTGAGCCA AGTACGCAAT TTAATATTAT AAGTTGCATA TAAAACAGGA TGGGACATAA 672 0 
— '--^'-r-^-T— — aATTAGAA AATATCTTAC TGCTGTTCTC 6780 
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tCTTCGACTG GCACTGCTCC CTCAGGAGTC TCGCCATTAA TACTACGTAT TAACATGTAA 6 90 0 

TTTTACTTTT ACATACTTTA AAAAAATAAA ACACTTTGCC CAACTTGCAC ATAAATGTAA €96 0 

5 

AATTCAATAA AATGAATTTT CTGTGTTGGG TCCCTTCGTA TAATTTAATA AATACCACTA 7020 

AACTAAATTA ACGAGGTGCC TTATGTATAA AATTTATAAC ATGACCCAAC TTACACTACC 70 8 0 

AATAGAAACC TCTGTTAGAA TTCCTCAAAA TGATATTTCG CGATATGTTA ATGAAATTGT 714 0 

10 

TGAAACGATA CCTGATAGCG AATTCGATGA ATTCAGACAT CATCGTGGCG CAACATCCTA 7200 

TCATCCAAAA ATGATGTTAA AAATCATCTT ATATGCATAT ACTCAATCTG TATTTTCTGG 7260 

75 TCGTAGAATA GArAAATTAC TTCATGACAG TATTCGAATG ATGTGGTTAG CTCAAGATCA 7320 

AACACCTTCT TATAAAACTA TTAATCGTTT TAGAGTGAAT CCTAATACTG ATGCGTTAAT 7380 

TGAATCTTTA TTTATTCAGT TCCATAGTCA ATGTTTAAAG CAAAATCTTA TTGATAATAA 744 0 

TTCAATTTTT ATTGATGGTA CAAAAGTAGA AGCTAATGCC AATAGATATA CATTTGTGTG 75 0 0 

GAAGAAAAGT ATTCAAAATC ACGAATCGAA ATTGAACGAA AATTCAAAAA CATTATATCG 7560 

TGACTTAGTT GAAGAAAAAA TAATACCAGA GATAAAAGAA GATGGAGATA GCGATTTAAC 762 0 

25 

PiKTAGAAGPiA ATAGATTTAA TTGGTAGTCA TTTAGATAAA GAAATCGAAG ATTTAAATCA 76 BO 

TTCTATTGAG AACGAAGATT GTGCTCAAAT TAGAAAACAG ACCCGTAAAA AAATAACTGA 774 0 

GATTAAGAAG TTCAAAAAGA AATTTGATGA TTATTCCGAA AGAAAAAATA AATATGAAGA 7 8 00 

30 

ACAAAAATCG ATTCTTAAAG ATAGAAATAG TTTTTCTAAA ACTGATCTGA TCATGATGCA 78 60 

ACTTTTATGA GAATGAAGGA AGACCATATG AAAAATGGCC AACTTAAGCC AGGATACAAT 7 92 0 

35 TTACAAATAG CGACAAATTC TCAAAAATGT TTTATCCTAT GACCTATTTC AA 7 972 

(2) INFORMATION FOR SEQ ID NO : 313: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3175 base pairs 
^c* (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313: 
ATTTTTTAGT TAATTGTCTT TCTTAAAATA ATTTTAGCTT TCATTAAATT AAACAATTTT 60 
ACAAGCTTGG AACACCAATC AAAATCCTAA GTTCTAAAAT GCAATATTAG TAGTCGTTGA 120 

50 

CTGAATGAAC ATATGCTTAT AATATTTTTT TGCAATGCTA GTCAAGTTGA TTTATGCTCA 130 
CAAGGATATG CGATTTATAT TTTCTTACAA CAATGAAAAT GCCTGATACA ATGCGATCCT 24 0 
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AATTAAATTA TTGTTGTTTT AACCATTGTA CTATCGTCC3G AATCATAAAT CCTGTTGGCC 360 

CTTTTGGACC ATTATATGAA GCTTTATTAG TCGTTGCTGG ACCTGCAATA TCGAAATGAA 4 20 

TGTGAGGTGT TTGACCACTA AAATGTGTTA CAAAACTTGC CGCAAATAGC GCTTTACCTT 4 80 

GTCCATTCGT ATGGTTAACT AAATCAGCGA TATCACTGTG TTTAATACTT GCACGTTCGG 54 0 

TTGCAGTAAT CGGTAATTCA AATACCATTT CATCGACTTC AGAACTTATT TGTA.ATATAT 60 0 

CGTTTAATAT CACTTTACTA TTCGATTCAA ATGCAGCAGC TTTATCATCG CCTAGTGCAa 660 

CAATTGCTGC ACCCGTTAAT GTAGCAAAGT CCATAATCAC ACTAGGCTGA TATTGATTTG 720 

CATAAAACAC AGCATCTGCA AGGACTAATC TACCTTCAGC GTCTGTATTC ATTACTTCTA 780 

CAGTTTCACC ACTTAATGCT GTAAATACAT CATCTGGcTT CATTGATGCT TCATTTATCA 84 0 

TATTTTCAGC ACACGCAAGC ACTCCGACAA TATTTACAGG CAGTTGTAAA CGAcTAGCCG 900 

CTTCAATGAT ACCAACGACA TTCGCAGCGC CACACATGTC AAACTTCATT GTAGCCATGC 96 0 

CATTCTTCGT TTTAATACTA TAACCACCAG AATCATACGT TATACCTTTA CCAACTAAGG 102 0 

CAATTGGTGC TTCATCTTTG TCTTTGCCAT TATATGTGAT GGTTACTAAt CTCGGTTTAT 10 8 0 

GCTTACTACC TTTACCTACT GCTTGTAAAA GTCCGAATCC TTCAGAAACT AAAGTGTCAT 1140 

AATCTTTAAC ATCTACTTTG ACCTTTGTAT TTTTAAAATG ATTAACAATA TCTTCTGcAA 12 00 

ATGTTTGTGG TGtTAATACA TTCGGTGGcA TATTACTAAA GTCTCyTGcC AAATTAATGG 12 60 

aTTGGcCAAT ACTGATACCC TCATGAATAA AATCTAATTC AATTAATGAT TCACTAATAA 1320 

GGkTTAAATT CGTCTTAAAC GGtGCCTTCT kACTTgrTTT ATAATGATCG aATyCATATG 13 SO 

kTGcACGCTC ACTTTGGAAT ACCGCATGCC ATTAATACAT CTGgATAACT GgATCcATAT 144 0 

TTTGgAnATA AATGaATCCa TAAGTAAATA CGTATCTTCa ATGTGTTCTG ACTTTATGTA 15 0 0 

TTGGAAAAGA TGTCCCCaTA mrCAACAT ATCTTGATAA tACGTGTCTT TAAGTTTCCT 156 0 

AAACCAACTG TAATTAAACG ATATGTTTGA TCTTGTACAT CAAATGCAGT TGTATAAATC 1620 

TTCCCAACTT TACTACCAAT AATATGTTGA TGTTTTAGTC TTTCAAGTGA TTCTGTAATA 16 8 0 

TCGATATGAT TAAAACTAAT GCGCTCTAAC TGATTTAAAT GTTCTGGTAT ACCAATAATC 174 0 

AATGTATTTA TTTCGTTGCT TAGTGTGTTA TTTAATTTAA AATTCATAAT GTACCTCCTT 130 0 

AGATTTGATA TGTTACTCAC TTCAAATTGT ACAACAATAA AGCCCTcAGT GACACTGAAG I86 0 

GACTTTATAA ATGAATAATT TAATTTTATG TGTTTAGCTG AAACTTTAIT TAAAGCCTAG 192 0 

AATTTACCTT TTTTGAATGC TAAACCGATA CCACCGATTT TGAATACCGC ACGTGTATCA 1980 

- — .---r- ^r^^^^T-j^f^ CCAGCGATAG GTTTACCAAA TACCATACCT 2040 
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GTTGATTCAC CGTTTAAGAT GCGTTTAATG TTTTTAGCAA CACTTTCACC TTGTTGCATT 216 0 

GCAATTTGTG CTGTAGTTGG TAATGGACGT TCTTCTCCAG CTGGGATAAA CGCTGAACAG 22 2 0 

^ TCACCAATAA CAAAAATGTT GTCGTAACCA TTGATTGTTA AATCTTGCTT TGTAACGATA 22 8 0 

CGTCCACGTT TAACGCCTTC AAATGATTCT TCCATTAATT TACTACCACG TACACCAGCT 234 0 

GCCCATACTG AAGTACCTGC ATTTAATTGT TGTTTTTCAC CATCTACTTC AACTACAAAA 24 0 0 

10 

CCTTTTTCGT TACAAGCAAC GATTGGTGTA GCAATTTTAA ATTCAACACC GCGGTCTTCT 24 6 0 

AAGTAGCTAA cTGCGTGGTT AACTAATTCT TCTGAGAACA TTGGTAACAT TTTAGGTGCT 2 52 0 

GCTTCAACAC AAGTGATTTT AACTTTATTT TGATCCACAC CATATTTGCT ACATAATTCA 2 5 80 

GGAATTCTGT CTGTTAATTC ACCTAAGAAT TCAACACCAG TGAATCCAGC ACCACCAACT 264 0 

AAGATAGATA AATCGTTATC ATCTTTTTCT TTTGATGCTG CATAGTTAGC AAATTTGTCT 270 0 

20 TCGATATGAC GTGATAATTC ACGTGCTGTG ATAACATTTT CAATTTGGAA AGCATGATCT 276 0 

TTCATACCTT CGATGCCGAA TGTTTCACTA ACGAAACCTA ATGCTACTAC TAAAATATCA 2 82 0 

AAGTCATAAA TACCTTGATT TGTTTCTACC TTTTTAGCAT CACGGTCAAT TTTTGTTACT 2 3 BO 

25 TCTGCtTGAA CAAAGTTCAC TTTGTCTTTC TTCAAGACAC TTTCCACAGG ATATAATACA 294 0 

TCTTCATAGT TTAGTGTACC TGCTGATGCT TCATGTAACC ATGTTGCTTC ATAGTGATAT 3 000 

TCATTTTTAT TAATAAGCGT AATTTCTGCT TCTTCTGTTG ATATCGCTTT TTGCAATTTA 3 060 

30 

GTTACAGTTT GTAAACCTGC ATAACCAGCA CCAAGTACAA GTACTTTTTT ACGATCTTGA 312 0 

GCCATTTAAT TnCACCTAAG CnTTCATATT TTTTTAACCA AATGCTGATA ATTAC 3175 
(2) INFORMATION FOR SEQ ID NO : 314: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 702 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
40 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 314: 

CATCATTATT AAAGATTTTC AATCAATACA gAATCACAAT ACGTACGCAT TGTGCACGAT 60 

AAAAATACAG ATGTGTATAT TAACTATGAA CTACAAGAGC AACTAACGAA CAAAGCTTAC 12 0 

ATTGGTGATC ATATTTATGT TGAAGGGATA TGGCTCGAAG TACAAGCTGA TGGTTTAAAT 18 0 

50 

GTATTGAGTC AGAATACAGT GGCATCGTCA TTAATTCGCT TAACACAAGA GATGCCACAT 24 0 

GCACAGGCAG ATGATTACAA TACGTACCAT CGTTCGCCAA GGATTATTCA CCGTGAACCG 300 
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TGGCGTTCCA TTATACCGCC ATTAGTAATG ATTGCTTTAA CTGTTGTCAT CTTTTTAGTG 420 

AGACCAATTG GTATTTATAT TTTAATGATG ATTGGTATGA GTACAGTAAC GATAGTATTT 480 

^ GGTATTACAA CGTATTTCTC TGAAAAGAAA AAGTATAACA AAGATGTTGA AAAACGAGAG 54 0 

AAAGATTACA AAGCTTATTT GGATAATAAA TCTAAAGAAA TTAATAAAGC GATTAAAGCA 600 

CAACGTTTTA GTTTGAATTA CCATTATCCA ACGGtTGCTG AAATTAAAGA TATCGTTGAA 66 0 

w 

ACGAAAGCAC CAAGAATATA TGAnAAAACC ATCGGCATCA TC 702 
(2) INFORMATION FOR SEQ ID NO: 315: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4121 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315: 
TATGTTCCGA CAACGAAACA AAGTGTAATT ACAAGAGCAA AGATAACTTT GAATGTTTGT 6 0 

AAACGTCCAT CTTTACCTTC AGTTAAATGC ATGAACATTA ATAATTGArG TCCTGCTTGG 120 

ACGAATGCAA AGCCAAAGAT AATTGTCAAC TTCGCGTGGA ATGTTAATGA CGTGTATAGT 180 

GTTACGTAAA CTGCTAAAAG CGTTAATACG ATAGATGCGA TAAATCCTAC AGTATGTTTC 24 0 

30 

ATTATTGTAC TCATCCGCTA TACACCATCC CTATCATATA TACGGCAGTA AAGATGAAAA 3 00 

CCCAAACAAC ATCTAAGAAG TGCCAGTATA AACTTACTAT AAATAATTTT GGCGCATTAT 3 60 

ATTTGTCTAA TCCGCGTCGT TGGATTTGGA TTAATAAACA AATGGCCCAA ACGATACCTA 420 

35 

GCGATACGTG ACAACCATGC GTTCCTAATA GGATAAAGAA ACTAGACCAG TAAGAACCAA 4 80 

TTGTTGGGTT AACGCCTTCT GATGCATAGT GTGCGAATTC ATAAATTTCG AATCCAACAA 540 

AGACTAAACC TAAAAGTAAC GTAATGATCA TCCAAAACAT CATTAACTTT TGTTTTTCTT 600 

GGCGCATGTA GTAAATAGCA ATACCACATG TGTAAGAACT GAATAATAAT GCAAACGTCA 6 60 

TTATTAAAAC AAGAGGCAAT TCAAATAACT CAGTAGTCAT TTTACCTGCA TAATCGCCAC 7 20 

CATGTTGCAA AGTTAATAGT GTCGCAAATA GGGTACCGAA TAACGCAAAT TCGGCTGTAA 7 80 

TGAAAATCCA AAAGCCAAGC TTATTTAATT CGCCTTCATG TGTGCGTGAA TCAATAGTGT 84 0 

TTGTATCATG ACfCATGACT TACAGCCTCC CTTTCTTTAA TACGCGCTTC TCTTAATCTA 900 

so 

GCTTCAGTTT CTGCAACTTC AGCAGCAGGG ATATGATATC CGTGATCGAT TTGGAAACTG 960 

* - ^'-jv-'^nAA'^ ZVAACAAATTA ATGCTGGAAT TACAGTTTCG 1020 
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10 



15 



25 



TTGTTTGGCA TATGAATGTC TTTGTAATTA TGGTTGTCTA AGTAATGACG ACCATGTTCT 114 0 

TTCATATCAA CAAATGTGTC GTAGTCATTC CAATCTGGTG TAATGGCAAA GTTGTATTTA 12 0 0 

GGTGGAATTG CTGATGCAGT AGTCCACTCT AGAGTACGAC CAAGGCCATC CCAGTTATCT 1260 

CCAGTTGCTT CACGTGGAGA TTTGAAGTGA CTGTATACGA TACTAACAAC AAGGAATAAG 132 0 

AATCCGATTG CCATTAATAA TGCACCGATA GTTGAAATTA AGTTTAATAA GAACCAACCA 13 80 

TCTGATGGCA TATAAGTGTA TAAACGACGT GGCATACCAT CTAATCCAAG AATGAATTGT 14 4 0 

GGTAAGAAAC AAACGTTAAA TCCGATCATG AAGAACCAGA AGCACCATTT GTTTAATGTT 1500 

TCGTTTAATT TGTAACCCAT CATCTTTGGA TACCAGAAGA TTAAACCAGC TAAGCAGGCA 15 60 

AATACAACAC CAGTAACCAA TGTATAGTGG AAGTGAGCTA CTAAGAAGTA CGTATTGTGA 1620 

TATTGATAGT CAGCTGATGC CATTGCTAAC ATTACACCCG TAACACCACC TAAAAGGAAG 168 0 

20 TTAGGGATAA ATGCTAATGA GAATAGCATT GGTGACTCAA ATGTAATACG TCCTTTATAT 174 0 

AATGTTAATA ACCAGTTAAA CAATTTCACA CCAGTTGGAA TACCAATCAG CATTGTTGAA 1800 

ATTGAGAAGA ATGAGTTGAT TAACGCACCA TTACCCATTG TGAAGAAATG GTGAACCCAA 18 6 0 

ACTAAGAAAC TAAGGAACGC GATACCGGCA GTTGCCCATA CCATACTTTG ATGTCCGAAT 192 0 

AAACGCTTAC GAGCGAATGT CGGGATAATT TCTGAGTAAA TACCAAATGC TGGAAGGATA 198 0 

ACGATATAAA CTTCAGGGTG CCCCCATACC CAGAAGAAGT TAGCCCAAAG CATTGGCATA 204 0 

CCGCCATGTG CAACTGTGAA GAATGCTGTG TCAAATATTC TATCAGTTGT CATTAATGCT 2100 

AACGCTACTG TTAAAGGAGG GAAAGCAAGA ATAACAATTA ATGTAGTAAT AAATGTTGTT 216 0 

ACTGTAAACA TTGGCATTTG CATAAACTTC ATAGTTGGTG TTTTACATCT TAAAATTGTT 222 0 

ACAAAGAAGT TGATACCTGT AGCTAAGGTA CCAAGCCCTG AAATTTGTAT AGCTATTAAG 2280 

TAATAGTTAA CACCCGGACC AGGACTGAAT TCACCTGCTA GTGGCGCATA GTTTGTCCAA 2340 

40 CCAGCTGCTG GTGAACCACC AATAATAAAT GACAGGTTGA ATAAAATCAT ACCTGCAAAG 2400 

AATAGCCAGA AACTTACGTT GTTTAATACT GGGAATGCAA CATCACGTGC TCCAATTTGT 24 60 

AATGGAACAA CGATATTCCA TAAACCAAAG ATAAATGGCA TTGCCATGAA GATAATCATG 2 520 

ATTACACCAT GTGTACTAAA AATTTCGTTA TAGTGGTTAG ATTCTAAAAA TTTGTTATCA 2 580 

GGTACTGTTA ATTGCGCACG AATAAGTAAC GCATCAATAC CACCACGGAC GAACATTAAT 2 640 

ACGGCACAGA TTAAATACAT AATACCGATT TTCTTATGGT CTACAGATGT GAACCATTCT 2 700 

TTGTAAAGAT ATTTCCATAA TTTAAAGTAA GTAATTACTG CGATTAAACC AATAACTAAG 2 760 

AATGGGGCAC CAATTTGTGC CATTGTAATC ATcCAGTTAC CTTTAACTAG TAATTGATCC 2 820 
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TTGAAATTTT CTTCATTTCT TTCGCATTTT TCGATTCATC TTTCTTGAAC TCATTGTTAT 294 0 

ATGGTTCGTC ATTTCCAAGA ATCATCAACT TCATACCATG TCGTTTATAG TTCGCATTTG 30 00 

5 TAATTTGAGC TTTACGAGCA GGTATTAATG GTTTGTCTGA TACATCTTTA AACATATTTT 3 06 0 

CTTCACTAGT GAAGTTTGGA TCTTTCAATT CGAAATTGAA ACGTTTATAT GCATAGAAGA 312 0 

TGTATTCTGG ATCGGCTGCT GGATCAACAA ACGCCATATG TGTACCATTA AATTCTAAAG 3180 

CTTTATTAGG TGTGCTTGGT AATAATTGTT TATCAAATGT ATCTTGATCT AACGTTTTCT 3240 

TACCTTTAAC TTCTTTCACC CATTTGTCGT AGTCTTTTTG ACTAACGGCA TTTACTTTAA 33 00 

ATGTTTGACG TGAGAATCCT TCACCATTGA AGTTAGAGTT ACGACCTCTG AACGTACCAG 3 360 

15 

TTTGAGATGC TTCTAACGTC CAATTCATTG TCATGCCAGT CATGGCATAT TTTTGACCAC 3420 

CTAATTGTGG AATCCAGAAA CTTGTCATTG TATCCATAGC TTGAAGCTTA AATACAACAG 34 80 

GACGATCTTT AGGGATTGTT AATGTATTAA CAGTCTCTAT ATGTTCATCT GGATAAGCAA 3 540 

20 

AGAACCATTT GTATCCTGCA CTTAC-TGCAT ATACAACCAT TGGATCTTTC TCACTCTTCG 3 600 

GTGGTTTTTC GTAATCGTAT AAAGTTTTAA CTGTAGGAAT AGctAAAGCA GCAACGATTA 3 66 0 

TGATAGGTAT TACAAACCAT ATTGTTTCAA TGATGGCATT ATGGTGCATC TTACCAGATT 3 720 

CGGCATTCTT ATTATAACTA TACTTGTAAA TAAAAATGGC GAACATGCCA AGTACAACGA 3 780 

AACAAATAAC AAGCATGAAG ACGATTGAAT AAAGAATCAA GAACTTCTGA CTACTTGCTA 3 84 0 

30 CTGGCCCTTT TGCGTTGAAA ATTTCTATAT TTGAACAACC ACTAAGTAAA ATTAGTGTGC 3 900 

CAAATAATAG AAGCAAAGAC TTAAATTTTG ACACTTTTTT GACCTCCTA.A TACTACAAAT 3 960 

GTAGGGCTTA ACATTAATTT TAAGTTATTA CACAATATTT ACAAGGGCTT ATGGGAAAAA 402 0 

AATTAATAAA ATTGTATCAA AAATGTTGAT AAATCAAGGT GTGACGTGGG TTCACACATT 4 0 80 

TGTTAAAATT ATGTGTACAT TTTGTGACTA ATAGCGTTTT T 4121 
(2) INFORMATION FOR SEQ ID NO: 316: 

40 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9310 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 

[xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316: 
CGAGTGAGTA CAmACATATT TTTATTTGCA AGGGGTAAAT GGCATATAAC TATCTTTTTT 6 0 

ATGTAAGCTG GTATAAAATT TATACTAATA GGAGGGATAG TATGAATATA GTAGGGCATC 120 
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TTGGATTACG ATTAGTTGAA AAGTCGGTTA 
TTTATGGGGA CGAAgTAGGT ACAGCCGGAA 
5 CGGGTCATAA GCAGCCAGGT ACTGAAACGA 

AAGCGGCACT TCATTATTTT GAAAAACGTC 
TGTACTATCT TGGACAAGAA GGTGTTGTCT 

10 

TGCTTGTTAA TGATAGTTTT GAAGTACCAC 
TACCTCAAGC ATATCAAATT TTAGGAATAG 
CGCGTACGGT AGAATTTTTG GAAAATGTCT 

15 

TCGATGTGCT GACATTAGCA CCACAAGGTT 
AGGGACAACG TGAAAGACCT GGACGAGGTT 
AAATGAGTGA CTTAGATGCA ATTTACAAGA 

20 

GTATAATTGA TCGCTATTTC TTTAAATCAT 
AATTTGCGAC TGAAGCGCCT GGATTTACTA 

25 AATTGAACTT GCCTGACTTT TTAGAAGCAG 

AAATATAAAG GAGAATGTTT AATGGCCAAA 
TTTGGTTTGT ATTCCTTAGG TGATCATTTA 

30 TATGAGCAAC GTATTAATGA AATTATTGAA 

GATGTTTTTG CAGTTGGTGA AAGTCATCAG 
GTGTTAGGTG CAATTGCCCA AGCGACAAAG 

35 

ATTAGTGCAc AGATCCTGTA AGAGTATTTG 
ATGGTAGAGC CGAAATTGTA GCTGGCAGAG 
GCTATGATTT AAAAGACTAT GATGAATTGT 

40 

TAAATAAAAC TGAGCGTATT ACTTGGTCTG 
AAATATTCCC AAGACCAATC GATAATATAT 
CTGCAAGTGC TATTAAAGCG GGAAAACAAG 

45 

GCCCAGCAAT GAACTTTAAA GGTTCTATAG 
GTTTCGATgc TTCGCCTAAG TCTTTACCAG 
50 AAACAACTCA GGATGCTATG AGAGAATTTT 

TTCGTGGTGT TGGTTATCCG AAACAGCAAT 



ATCAAGACAA TCCTTCAATG TATCATTTGT 24 0 

CAATTTTAAG CTTTTTTGAA ATTCCCAATG 30 0 

TTTATCGATT TTCATTATTA GTACCAAATC 360 

TTGAGAATAA TGGTATTAAG TCTGAACGTT 420 

TTAAAGATGA AGACGACTTA GAAATCATAT 4 80 

ATCAATGGCA ACATAACGCT TATAGTGAAA 54 0 

GGCCAGTCGA ATTAAGAGTT AGAAATGCAG 600 

TAGGTTATCG CAAAAGAGAT AATAAATCAT 660 

TATATTCGGA TTTTGTAGTT ATTGAGCAAC 72 0 

ATATCCATCA TATTGCAGTT AATACACCAC 78 0 

AATTACAACA ACAACCACAA AGTAATTCAG 84 0 

TATACTATCG CCATAATTCA ATTATGTATG 900 

TTGATACACC TGTTGAACAA TTAGGAAGTC 960 

AACGTGAACA AATTGAAAGT AAGTTACACG 102 0 

TTAGAAATGA ATAAAAATAC GCCTCTTGAG 1080 

TTGAATCCAT TGAAAGGTGA AAAAGTTAGT 114 0 

GCAAGTAAAT TAGCAGATGA AGCAGGTATT 120 0 

GAGCATTTTA CAACACAGGC ACATACGGTT 126 0 

CATATTAAAG TTTCAAGTTC TTCAACGATT 13 2 0 

AAGACTTCGC GACATTAGAT TTGATTTCTC 13 80 

CATCAAGAAC AGGTATTTTT GACTTGTTTG 14 4 0 

TTGAAGAAAA ATTAGGTTTA CTTTTAGAGT 1500 

GAAAATATCG TCCAGAACTT AGAAATATGA 1560 

TGCCAATATG GCGTGCTGTT GGTGGTCCAC 1620 

GTGTGCCAAT GATGATTACA ACCCTTGGTG 16 8 0 

ATGCTTATCG TCAAGCGGCA ACTGAAGCAG 174 0 

TAAGTACAGC GAGTCTGTTT TATACAGCTG 180 0 

ATCCACATTT GAATACAGGG ATGTCATTTA 1860 

TTGCTAATTC GTCAGATTAT CGAGAAGCGC 192 0 
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GACTATAAAT 


GTTAGTGATA 


GCCCAATCAT 


AAAATGCTTA 


3720 
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ATTTTTAAAT ATAAATTTGG AATGAATAAT 
GGAAACTTGG ACATCAATCA AAGTAATAGG 
^ AAAGAAGCTG ACGAAAAGTC agCTTGcAAT 

AGAGAAATTG GGTCCGTAAT TTCTACAGAC 
AATTTCGAAA AGAAATTCTA CAAGCAATGC 

10 

CAATGCGTAA CATTAATATG TTATTATAAT 
TGAGGATGTG ATGGTATGTT TGGAATGAAA 
GAAGCTCATG ACACAGAAGC GCTTTTCAAT 

15 

GAATGGTTAC CTTGGGTAGA TGCAACTGAG 
AGAGGACTTT TGCAATTTGC TGATGGTAAT 
ACGCTAGTTG GTGTCATCGO TTTACATGAA 
GGGTACTATT TAGATAAAGA ATTTGAGGGT 
TTGATAAAGT ATTGTTTCGA AGAGCTTGAC 
25 AATAATGAAA AAAGCCGGGC TATTCCTGAA 

CGTGACAATG AATTACTAAA TGGTATTTAT 
TCAGAATACG ACCAAAAATG ACAAATTAGA 
CAGCGCTCTT TTATTTAATT TTTGAAAATA 
ATATAATGAT TTTGATAATT ATTATCAACT 
CAGCCGTTTT AAGTGTAATG TATTATTTTA 

35 

AAGAACACAT CTATGGGGAT AATAGAATTT 
ACAACGCTAT TATTAGCATC AACGTTATTA 
AAGGATGATT CAAAGACATC GAAAAAAGAT 

40 

ACAAAAGCAT ATGATAAATA TACTGATGAA 
AAATTTGTTA AAGCGATTGA AAATAATGAT 
GTTCGTATGT ATTATGAACG CTCTGAACCA 

45 

AAAATTGATG CACGTCTTGC AGATATGAAA 
TATCATAAGA TTGAAAAAGC ATTATACGAA 
50 GATGCACAAC AATTATTGAA AGATGCAAAA 

ATCACACCAA AATTAATGTT ACAAGGTTCT 



AAAGTAGTGA TTAAATTAAG TTGTGTGATA 3 84 0 

CACTACAACG CTTATTGGCG GGGCCCCAAC 3 90 0 

AATGTGCAAG TTGGGGATGG GCCCCAACAT 3 960 

AATGCAAGTT GGCGGGGCCC CAACATAGAG 4 020 

AAGTTGGGGA AGGACAACAA ATTTAAGATA 408 0 

GATAATTTAC AGAATTATAT GAAAAATGAA 4140 

GTGAATGAAC AAATAACATT AAAAATTTTA 420 0 

TTAGTCAATC GTTCAAGAAA TTCACTTAGG 4 26 0 

CAACCATCAG ATACGCGTGC ATTTATTAAA 4 320 

GGATTTCAGT GTGGCATTTG GTATGAAGGA 4 3 80 

ATTAATCACA TGCACAGAAA AACTTCATTA 4440 

CATGGGATTA TGACACAAGC AGTTGAGGCA 4 500 

TTAAACCGAA TTGAGATTAG TGCCGCAGTT 4 560 

AGGCTGGGAT TTACTAGAGA AGGTATGTTA 4 620 

TCATCGAGTT ACATCTATAG TTTATTAAAA 4 680 

CTTACAAAAG AGTGATGACA TTTAAAATGG 4 74 0 

AAAGGTTGTT GACAGTATTA TTTTATAACA 4 8 00 

AGATGATGTT TATGGGAGGA TGCTTTAAAA 4 86 0 

GCGTGTAGGG AATGCGAAAA TAATATTTAT 4 920 

CTATAATGAG GTGTCAAAAT GAAAAAGTTA 4 980 

ATTGCTGCAT GTGGGAACGA CGATAGTAAG 5040 

GATGGTGTTA AAGCAGAATT AAAACAAGCA 5100 

CAGTTAAATG AATTTTTAAA AGGTACAGAA 5160 

ATGGCCCAAG CAAAAGCGTT ATATCCAAAA 522 0 

GTTGCAGAAG CATTTGGAGA TTTAGATCCT 528 0 

GAAGAGAAAA AGGAAAAAGA ATGGTCAGGA 534 0 

GATAAGAAAA TTGATGATGT GACTAAAAAA 54 00 

GAATTGCATG CCAAAGCTGA TACATTAGAT 546 0 

GTTGACCTAT TAAATGAAGT TGCAACTTCT 552 0 
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GTTGAAGGCG CACAAAAAAT TTATGACTTA TTTAAACCTA TTTTAGAGAA AAAAGATAAA 5S40 

AAATTAAGTG ATGATATCCA AATGAACTTC GATAAAGTGA ATCAATTATT GGATAAATAT 5700 

AAAGATAACA ACGGCGGTTA TGAGTCATTT GAAAAAGTAT CGAAGAAAGA CCGTAAAGCA 5760 

TTTGCGGATG CTGTTAATGC ATTAGGAGAG CCACTAAGTA AAATGGCTGT GATTACTGAA 5820 

TGACAAATTA TGAACAAGT-J AACGATAGTA CGCAATTTTC AAGACGTACA TTTTTGAAAA 5 880 

TGTTAGGTAT TGGCGGTGCC GGTGTTGCAA TTGGCGCAAG TGGTGTTGGT AGCATGTGGT 594 0 

CTTTCAAATC AATGTTCAAT ACACCAGAAG ATCCGGAAAA AGATGCGTAT GAATTTTATG 6 000 

GTAAAGTGCA ACCAGGCATT ACCACACCCA CGCAAAAAAC ATGCAATTTC GTTGCGTTAG 606 0 

ATTTGAAGTC AAAAGATAGA GATGCAATTA AGGCAATGTT TAAAAAGTGG ACGGTTATGG 6120 

CTGATCGTAT GATGGATGGT GATACAGTTG GCAAGCCGAG TAACAATCCT TTAATGCCAC 6180 

CAGTAGATAC CGGTGAATCG ATAGGATTAG GTGCAAGCAA GTTAACGATT ACCTTTGGGA 624 0 

TTAGTAAGTC TTTGATGAAG AAAA'rTGGGT TATC7AGTAA AATTCCCai^-T GCrTTTAAAG 6300 

ATTTACCGCA TTTTCCGAAT GATCAGTTAA TAGACGATTA CAGCGATGGT GATATTATGA 636 0 

TTCAAGCATG CTCAAATGAT TCGCAAGTAT CCTTTCATGC GGTTCATAAT TTAGTTCGTC 6420 

CATTTCGAGA TATTGTTAAG GTACGTTGGG CGCAATCTGG TTTTATCTCT GCTAAAGGTA 64 80 

AGGAAACACC TAGAAATTTA ATGGCATTTA AAGATGGAAC AATTAATCCT AGGAAGAATA 654 0 

30 ATCAACTTAA AGATTATGTG TTTATTGATG ACGGATGGGC GAAACATGGA ACTTATTGCG 6600 

TTGTCAGACG TATTCAAATA CACATTGAAA CGTGGGATCG TACTGCGCTG GA.AGAACAAG 6660 

AGGCTACATT TGGTCGGAAA CGACATAGTG GTGCACCGTT AACAGGTGGG AAAGAGTTTG 6720 

ATGAAATTGA CTTAAAAGCG AAAGATAGTC ATGGCGAGTA TATTATTGAT AAAGATGCCC 6780 

ATACGAGGCT AGCGAAAGAA GCAAATACGT CAATTTTACG TAGAGCCTTT AATTATGTGG 6840 

ATGGTACGGA TGACCGCACA GGTAACTTCG AAACAGGCTT ACTTTTTATT GCTTTTCAAA 6 900 

AAGCGACAAA ACAATTTATC GATATACAAA ATAATTTAGG TAGTAATGAT AAATTAAATG 6960 

AATATATTAC ACATAGAGGT TCTGCTTCAT TTTTAGTATT ACCAGGTGTT AGTAAGGGAG 7 020 

GATACCTTGG TGAAACATTA TTTGACTAAA TTTGTAGCAA TGCTAATAAC TGCTGCTATG 7080 

GTGTGTAGCT TTGGGTTACT GAAAAGTCAG GCAGCAGAAC AACAAAGTAT TAGTGATGTA 714 0 

TATAGTGTGA TAACGGATGC GAAATCTGCA CTTTCTAATA ATTCGATATC GAATGACAAT 7 200 

AAGCAGAAAG CAATTGAGCA AGTGGTAAGT GCAGTTAAGA AATTATCGCT TGAAGATAAT 7260 

AGTGAAAGTA ATGCTGTCAA ATCAGATGTG AGAAAGCTTG AAGATGCAAA AGCGAATGAT 732 0 



25 



40 



50 



GCTAGTAAAG ATGCGGGTTC TAAAATTAAA 
GCTGCGATGA CAAAAGCGAT TAAAGATAAA 
5 AGTTTGAATC AGATTTGGAC AAGTAATGAA 

TATGGACAAA TTGAAGTCGC ATTATTACAA 
GATACGGCAA AAGTGTCACA TGCTTGGACA 
AAAAAAAGTA ATACGTCTGC AAATGATCAA 
GAGAAGGCGA TTAAAGCTAT CGACGACAAT 
CATTTTATAG AAACTTGGCC GTATGTTGAA 

15 

TATACGAAAA TTGAAGATAA AATACCATAT 
GCACATGTGA AAGATGGTTT AGTAGATTTA 
AGTTATAGCT TCGTCGATGT GATGATTATC 

20 

ATTGTAATGA CATTGACTAC CATGACGCGT 
GTGATTGGTG GTGCAATTGC CGGACTTGTA 

2^ GAAACTTTAG GGAATAGTGG CATTCTTCGT 

GCGGTCATAT TAATGTTTAT CGTTGGTGTT 
TGGAATGACA TGATTAAAAA TATGTATGCT 

30 TTAGCGACGA TTGGTTTAAT ATCTGTGTTG 

ATGGGGATGA TAGGTGAGCT AGCGACCAAA 
GTTATTTTAA TCATCTTTGC ATTATTATTT 

JC 

TATATATTTA GAGTGTTGTC GATCTTTATT 
AGTATTCAAA AGTTACAATT ATTAGGTGCG 
ACGATTAACT GGTTGGGCTT TTATCCAAGT 

40 

ATTATGGTAG TTGCTATCTT AATCTTTAAA 
GTTTTTTTTG TTGCTATATT GGAAATATTC 
TGATTCGTAT AAGGTTAATG TGTTGGCGGT 

45 

ATACTGAACT CAAAGAGCAA TATGAGTGGC 
GGTGGTGCAA TGATACTTGC TAATACAAAG 
so TTCAATGATG TGGTATCTAT AAGACCGAAT 

TGAAATAGAA GACCAAATGT GAATAACCAA 
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CTATTGCAAC AGCAAGTCGA TGCTAAAGAT 74 4 0 

AATAAAGCGG AATTAGAATC TTTGAACAAT 7500 

ACAGTGATTC GCAATTATGA CGCAAATCAA 7560 

CTTAGAATTG CAATTCATAA GTCACCATTA 7620 

ACTTTTAAAT CAAATATTGA TCATGTCGAT 76 8 0 

TACCATGTAT CACAATTAAA TGATGCGTTA 7740 

CAATTGTCGG ATGCTGcaTg TGCGCTTACA 78 00 

GGTCAAATTC AAACTAAAGA CGGTGCTTTG 7860 

TATCAAAGTG TATTAGACGA ACATAATAAA 7920 

AATAACCAAA TTAAAGAGGT TGTTGGCCAT 79 8 0 

TTTTTACGTG AAGGGCTAGA AGTGTTGTTA 804 0 

AATGTAAAAG ATAAGAAAGG GACTGCAAGT 8100 

CTGAGTATTA TCTTAGCAAT TACGTTTGTA 8160 

GAAAGTATGG AAGCGGGATT AGGTATCGTT 8220 

TGGATGCACA AACGTTCAAA TGCAAAACGT 8 2 80 

AATGCGATTA GTAATGGTAA TTTGGTATTG 8 34 0 

CGTGAAGGTG TCGAGGTTAT CATTTTCTAT 84 00 

GATTTTATTA TTGGTATTGC TTTAGCTATC 84 6 0 

AGATTTATAG TTAAATTAAT ACCTATTTTC 8 52 0 

TTTATTATGG GATTCAAAAT GCTTGGCGTA 858 0 

ATGCCAAGAC ATGTTATTGA AGGATTCCCA 864 0 

TATGAACCAT TGATAGCACA AGGTGCTTAT 8700 

TTTAAAAAAT AAAAAACAGG CCGAGTGCCT 8760 

GGTATTGCAG TATAACGATA ATCACAGCAT 8 820 

TTGCCTCGGC ATGTGAACTT AACGATGAAC 88 80 

AATGTGAGTA ATATATTTAA TGTTAAATCG 8 94 0 

CAAGCGAAAT AAATATATTT ACGrTAATGT 90 00 

TTTGCAAGAC CCATAAATAA TATTGGTAAT 9060 

CGTATGAGTT CAATCAAATA TGCTTTAAAG 9120 
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GGAAAGCCAA CATAAAATGC AAAAGCGACG CCAGCACAGA ATAATAACAC GCTGAAAAAA 924 0 

CTATATTTAT AAATAAATTG ACGTTCATTA TTATGTAATC CAGGTGCAAT GAATGCCCAC 93 00 

AATTGATAAA 9310 
(2) INFORMATION FOR SEQ ID NO: 317: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3458 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



15 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 317: 

ATTAGTATTA CACACTAATG TAAATAGATT GGTTGGAGAA GAGATATTTG CTAATAAGTG 60 

CCTTGCTAAT AATGATGTTC AAATTATGAA TTCAATAAAA AAATTAAITG AAGCTGAGTT 120 

20 

ATTAACAACG ACTAATGAT-T TTGAAGTTAG TATATATAAA AAGArAAGAC CTGAATTACA 180 

AAGTATTTTA AAAAGTTTTG GTATAAAAAC AACAGGTAAT AAACCTGACT TAATTAAACG 24 0 

TATTGACGAC AATTTTCATA TTATTAATAA CTTAGATTTA CCATATGTAT ATATACCAAT 3 00 

TAAGAAAGGA GAAGAAATTT TAAAGAAAAC AGAGTACTTA ACCTCTTTTA TACAAAGTTA 360 

TGGTGAAATT TCTCTTGAGC GTGCTTATTA TTTGGTTGAA AACTATATAQ ATGAAAATTG 420 

30 CGATGATAAA GTTGCAGAAA TATACAAGTT TGAATTTCAA AGAAAATATG ACAATGGCGA 480 

GTTTGATTTT AATCATGGAT ATAATTTCGA ATTGAATATG TTGATAGATC ACTATAAAAG 540 

AGATGTAAAA GACTACGATA ATGCCAGAAA GTATTCAAAT ATTTATCTTT ACTTTGGTTT 600 

GAGAGATTTC TTAAAAAAAT TAATGAGTAA TTATTCATAT TACGATAgTA AAGGGAATAT 660 

AGATTTGAAC GAAATACAAA ACGATCTGAA TAGATTTATC AACTCTAGCG CTTCTGGTAT 720 

GTACGAGCGA TTAATATATA ATGAAAATTT ATCCAATAAT ATTATGTTTG AATTATTTAA 780 

AAAGGACACA CAAGATTATA GTGATTTGGA AGAACAATTG ATTGAAAAGT TCATAAACTA 840 

TGTAGTGTCT AATGTAAAAA AAGAAAGTAG GAGTAATACT CTTATAGAGT TATCAAAAAT 900 

TTTAGAGAAC GGATATACAA TTGATAAAGA AGAATTTAAA AAAGAAGATG ATTATCTTTC 96 0 

TAAGTACATA TTTACTGACA TAGATTATTT GAAAAAGTTA GAATCAAAAA TAAACGTTGC 102 0 

TATTGATATT CGAAGTGGAG AAATTCATTT GGTATTAGAT GATGATAGCC TTGATATATT 108 0 

AATACAAAAT CAAAAATACG GCAATGAGTT TTGAGTCATA ACTAAATATA ATATGTTGAA 1140 

GAGAGGTTTT ATTTAAATGG CTAAAATTGG TTATGATCGT GTATCAACGA AAGATTAATA 12 00 
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AACGTACAGA GCTTGTTAAG TGTTTAGATT ATTTACGAGA GAGCGACACA TTAGTTGTCT 13 2 0 

ATCAACTTGA TCGGTTAGGT AQPJkQQACPJK AACTATTaAT TGAATTaTCA CAATGATTCG 13 80 

ATGATAACGG AATTGACTTA CAAATTAGTA ACATGAACAT TTCAACGAAA GACACAATGG 14 4 0 

GCAAAATGTT TTTTACGATG ATGAGTGCAT TTTTCaGGAT TAGAAGTTAA TTTACTATGT 150 0 

GAGTGTATAA AATAGACTTA GCAGCAACAA GAGCGAGAGG CCGAAAAAGC GGGCGCCCCT 156 0 

CTTTACCAGA GAATAAAAAA TGAGAAATTA AATTTTTATA TGATGAACAA ACGATAACAG 162 0 

GGGAAGAAAT AGCTAGTTAG ACAAGGGTAT GTCACTCAAC TGTTTATCGA GTTGATTAAG 1680 

AAATGAAAAA ACTTATACTA TGAATTACTG TTTAAAAGTG TGCATGTTAT AATATTTATT 174 0 

GAGCAAGTTG GATAGATGGT GGCTAATCTC TTAATAAAGG GGTGATGCCT ATGGTTATAG 180 0 

TTGTTACTCC TAGnAAAGGA CTAGCATGTC TGATTTTGAA ATGCTTATGG TTGTATTAAC 1860 

AATCATTGGT TTAGTATTGA TTAGTACTCA AGACCATAAA AAATAaCCTT CTATTCGCTT 192 0 

TGACCGGCAT TTTTGAAGGC TATTTTTAAA TAAAATATAA GGTCACCGTC TTTTTAACGG 198 0 

GCTCATTAGG GTAACATGTT TCCGAGTGTT GCCCTTTTTG TGTTTCAAGA GTTAATGATT 2 04 0 

25 TTCATCTTTT GCTTCTACTT GCTACAAATA TATTTTAACA CATTTTTCTT ATGAATTGTA 2100 

GTTCTGAACA TAATCAGAAT TAATAAAACC AACTTTCCAT ACAGCAGAAA ATACAATTAA 216 0 

AAGTATAGAA TGTAATCACG AATTATATAA AAAGTATTGG AGACCTCTTC ATATATAGAT 222 0 

AATTGACTTA GTTATTTTAG AAAGAAGCCC CTAaCAACTA AAGTTGAAAA ATAGAGGAAC 2280 

ACAGTTgGAT TACGCATCAA CTGCATaAGg CCCCTAAluAA CTAAAGTTGT AAGGGGCyCT 234 0 

AAAATTTATT TTGGTTGATT GTCTTCTGGT TTATCTGAAG TCATTGTTTT TGTTGTATTA 24 0 0 

TCATTTAATG AATCTTGTGC TTTTTCTGTT TTTGTTTCAA CAGATGTTGT CACTTTATCA 24 6 0 

TTTTCTACTT TTGTATTTTT TGGTTCAGTG ACAACTTTTT TATTCTTGTC AGCTGTTTTT 2520 

ACTTTATCTT TAGTAATTTT TTGACCACGT TTTAAGTAAT ATTGAACGAT ACCCATTAAA 2580 

ATGATTGCAT GAATAACAAC GAATAAAATA ATTGTCATTA CTGTATAAAC TCTTATGATA 264 0 

TTTTCTGCAA CACTTTGAGA GAGTGATTGC GTATTAAATG AAATCAAGTA ACCAACAGGT 27 0 0 

GTTTTAAATA AAATAATAAT TAAGTTTAAT AATAGTATTC CAATGAAAAA TTTGAAAAAT 27 6 0 

GTTTTTTGAC CATTTTTCAT TGCTTTAAAT CCATTAGCTA AGTGTGTTTT TACTTTATCG 28 2 0 

TTAGTTGATT CdACGAAACT AGTCaTAAAG TTGAAAATAG GTATTAACAA TAACCAAGTA 28 80 

ATAATTGCTT TAATCAATAA AACAACAATG ATTATGATGC TTTGAGTTGT AATAGCAATA 2940 

CCAATCAGGT GTGTACTATC TGCATATGAT GATTGTACTG AATTCATAAT CATTTCTGAT 3 00 0 
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ATAGATACAA GACCGATCAG AACGCTCTTT AAATAATTAC CTTTnTAAA TGCAATAAAT 312 0 

AAATCGGTAA ATTTTACTTT TTCATGACTC ATTGCTCGTT TCATAACATT TGTAATTCCG 3180 

ATAAAAATTT GAACTAAAAC AAACAGTGAA ACTACAGCTG CTATAAGTAA TACAGCGATA 3 24 0 

ACTTTTAAAT ATGCATCTAC TGGTGGTTTT TGCCCAAATT GTGAATAAAT TGCAATAGTT 3 3 00 

TGTGCATACT TAGCAAGCTG GAAATTAGCA AGTAAATATA CAACXGCAAT TACAGCAAAT 3 36 0 

GCTATAAGTG CATATATCAA TGTTATTAAT AATTGTGGTT TAGCATTTTT AAATGCTGAT 3420 

TTAAAGTAAG TAAACAAAGT GGTGCCTCCT TTTCTTCA 34 58 
(2) INFORMATION FOR SEQ ID NO: 318: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 695 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318: 

25 GCTTGCAAGA TGCTTTCATT AAAGACAAAG TCATCGATAT TATGAATATG TTCCAAAATC 60 

ACACTGATAT CACTTATACG TTGAATAAGT CTCATGCACA TATTATATGT ACGCCAGAaA 120 

TATTTGCTAA ATTGTTACAT ACGATTGCAA CTAGAAATAT CGACATTCTA TCTGCCAATT 180 

ATAGATCGTC ATCTATGAGC AAAGCTCGTA TATCATAATA AAAGACACAT CTATACGATG 24 0 

ATCATTTACA ATGATTGCGT ATAGTGATGT GTCTTTTTTT GTATTATTTA TCTTCGGAcG 3 00 

ATTGATCCTC AACCATTTGT TCAAATGCTT CACGAACTTG TGGTACTGTC ATACCTACTA 3 60 

CAACTTGGAT ACTCTTACCA CTCTTTACTA ATCCGTGTGC CATTTGTTGA TGCGTGAAAT 420 

ATTCCGTATC TGCAACTTTA CTTTCATCAT AAACAGTTAA GCGTAATCTT GTTGTACAGT 4 80 

TAGTAACATC TTTGATATTT TCTTTGCCGC CTAAACCGTC GAGGTAATAT GCAGCTTTGT 540 

CTTCGTATTC ATTACCAGAT GAAGCACGTT TAGAATCTAC GCTATCACCT TTTTTATTTT 600 

TGTAATCTTG CTTAGAAAAT AATTTAACTT CCTCTTCTGT TTyCTTACGT CCAGGTAATG 6 60 

^5 GAATATCAAA TTyCAAAATT AAGAATCTGA ATAAG 6 95 

(2) INFORMATION FOR SEQ ID NO: 319: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 875 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

'n^ TOPOLOGY: linear 
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Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 319: 

AATATTGaTa TTATATATAG TCGAATCaAT GATGACCCaC ATCGACCtAA TGCAAATAAT 6 0 

AAGACAATCA AGCAGTTAAA TGACTTGTAT TGCTCGCGGA ATTTAAGATA TAATGAAATC 120 

GCATTCAAGA AATTTGATAG TCATTTGCTA TCAATTTCAG AAATATATTA TGAATTGCTA 130 

AATTTAATAA AAGCGAGTGA TCAGTATTAG AGAGAATAGA GCGTTAAGAC TCTATCGCCG 24 0 

AAGtGCAAGT AATTTATTAC GAAACTCTCA GGcAAAAGGr TAATACTGTA ACGCGTTCCT 3 00 

GAATTGGTGA TTTATAAACA GGGTAGCGAT TGCTATCCTG TTTTTATAAT TTTAAGGGGG 360 

TATTTCAATG TCAAGTGATT TAAAACAAAC ACCTTTATAT CAAAATTATG TTGATAGAGG 420 

TGCAAAAATT GTGGAATTCG GAGGATGGGC GATGCCTGTT CAATTTTCAA GTATTAAAGA 4 80 

GGAGCATAAT GCTGTTCGAT ACGAAATTGG CCTGTTTGAT GTTAGTCATA TGGGTGAAAT 54 0 

TGAAGTAACA GGTAAAGATG CTAGTCAGTT TGTGCAATAT TTATTATCAA ATGATACTGA 6 00 

TAATTTAACT ACTTCAAAAG CATTATATAC TGCTTTATGT AATGAAGAAG GCGGTATTAT 6 60 

TGATGATTTA GTAATATATA AATTAGCTGA CGACAATTAT TTATTAGTTG TTAATGCTGC 7 20 

25 TAATACTGAA AAAGATTTTA ATTGGATTTT AAAACACAAA GAGAAATTTG ATGTTGAAGT 7 30 

ACAAAATGTA TCAAACCAAT ATGGTCAATT AGCAATACAA GGACCAAAAG CtAGAGATTT 840 

AATTAATCAA TTAGTTGATG AAGAkGTAAC TGAAA 875 
(2) INFORMATION FOR SEQ ID NO: 320: 



20 



30 



(i] SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5897 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320: 

TCTTTTTACG AAGGCTACCG TTTAGGATCT GATAACTTAA CTGTAGGAGA AATTGTATTT SO 

AATACAGCGA TGACAGGTTA TCAAGAAACT ATTTCAGATC CATCATATAC AGGTCAGATC 120 

ATTACTTTTA CGTATCCATT AATCGGTAAT TATGGTATCA ATAGAGACGA TTTTGAATCA 180 

TTAGTACCTA CATTAAACGG TATTGTAGTG AAAGAAGCGA GTGCGCATCC AAGTAATTTT 240 

AGACAGCAAA AGACACTTCA TGACGTTTTA GAATTGCATC AAATTCCAGG GATTGCAGGT 3 00 

GTTGATACAA GAAGTATTAC GCGTAAAATT CGACAACACG GTGTGTTAAA AGCTGGTTTT 3 60 

ACTGATCGAA AAGAAGATAT TGATCAACTT GTCAAACATT TACAACAAGT AGAATTACCT 420 
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AGTGTCGTAC TTGTAGACTT TGGTAAGAAG CAAAATATTG TTCGAGAATT AAACGTCAGA 54 0 
GGTTGTAACG TCACAGTTGT ACCATATACA ACTACTGCCG AAGAAATTTT AGCAATGGCT 600 
CCAGATGGCG TTATGCTATC AAACGOACCA GGTAATCCTG AAGTTGTAGA ATGTGCGATT 660 
CCAATGATTC AAGGAATTTT AGGGAAAATT CCGrrCTTTG GTATCTGTCT AGGACATCAA 720 
CTTTTTGCAT TATCTCAAGG AGCAAGCTCA TTTAAAATGA AGTTTGGTCA TCGTGGTGCC 7 80 
AACCATCCAG TTAAAAATTT AGAGACTGGA AAAGTTGATA TTACGAGTCA AAACCATGGA 840 
TATGCAATAG ATATAGATTC GTTAAAAAGT ACTGATTTAG AAGTTACTCA TCTTGCATTA 9 00 
AATGATGGTA CTGTAGAAGG TTTAAAACAT AAAACATTAC CAGCATTTTC TGTTCAATAC 960 
CATCCTGAAG CAAATCCAGG ACCGTCAGAT TCAAACTATC TATTTGATGA TTTTGTAGCA 1020 
ATGATGACTA ATTTTAAGGA AAAGGAGCGT CATATCAATG CCTAAACGTA ATGATATCAA 1080 
AACAATTTTA GTAATAGGGT CTGGGCCAAT TATCATAGGT CAAGCAGCTG AATTTGATTA 114 0 
TGCTGGAACA CAAGCATGTC TAGCTTTAAA AGAAGAGGGA TATCGAGTTA TTCrTGTAAA 1200 
TTCAAATCCA GCGACAATCA TGACTGATAA GGAAATTGCG GATAAAGTAT ATATCGAACC 126 0 
^5 GTTAACTCAT GATTTTATAG CGCGAATTAT ACGTAAAGAG CAACCTGACG CTTTACTTCC 1320 
AACTTTAGGT GGTCAAACAG GTTTAAACAT GGCGATTCAA CTACACGAAA GTGGTGTGCT 13 80 
TCAAGATAAT AACGTCCAAT TATTAGGAAC TGAGCTAACA TCAATTCAAC AAGCAGAAGA 144 0 
CCGTGAAATG TTTAGAACAT TAATGAATGA TTTAAACGTT CCTGTACCAG AGAGTGACAT 1500 
TGTAAATACA GTAGAGCAAG CCTTTAAATT CAAAGAGCAA GTGGGATACC CGCTAATTGT 1560 
TAGACCGGCA TTTACGATGG GTGGTACCGG AGgCGGTATT TGTCATAATG ATGAAGAATT 1620 
ACATGAAATC GTCTCAAATG GTCTTCATTA TAGTCCAGCA ACGCAATGTT TATTAGAAAA 1680 
ATCTATCGCA GGTTTTAAAG AAATCGAATA CGAgTAATgC GTGaTAAAAA CGATAATGCC 1740 
ATCGTTGTAT GTAACATGGA AAATATTGAT CCAGTTGGTA TTCATACAGG CGATTCAATT 1800 
GTTGTGGCTC CTAGTCAAAC ATTATCAGAT GTTGAGTATC AAATGTTACG TGATGTTTCA 1360 
TTAAAAGTTA TTCGAGCTTT AGGTATCGAA GGTGGTTGTA ATGTTCAATT AGCATTAGAT 192 0 
CCCCATTCAT TCGATTATTA TATTATAGAA GTAAATCCGC GTGTATCACG TTCATCAGCG 1980 
TTAgCTTCAA AAGCAACAGG ATATCCTATT GCAAAATTAG CTGCTAAAAT CGCGGTTGGT 204 0 
CTAACATTAG ATGAAATGTT AAATCCAATT ACAGGAACAT CTTATGCAGC GTTTGAACCA 2100 
ACTTTAGACT ATGTGATTTC AAAAATACCA AGATTTCCTT TTGATAAATT TGAAAAAGGA 2160 
-^T'^^CACACA AATGAAAGCA ACAGGTGAAG TTATGGCCAT TGGTCGAACT 22 20 
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TTACCAAATG GTGAAAGCTT CGATCTTGAT TATATTAAAG AACGTATTTC ACACCAAGAT 2 34 0 

GATGAACGAT TATTTTTCAT CGGCGAACAA TTAGAAGAGG CACAACATTA GAAGAAATTC 24 0 0 

5 

ATAATATGAC TCAGATTGAT TACTTCTTCT TACACAAGTT CCAAAACATT ATTGATATTG 24 6 0 

AGCATCAATT AAAAGAGCAT CAAGGTGATT TAGAATATCT TAAATATGCA AAAGATTATG 2520 

GATTTAGTGA TAAAACAATA GCGCATCGCT TTAATATGAC GGAAGAAGAA GTATATCAAT 25 80 

10 

TGCGTATGGA AAATGATATT AAACCTGTTT ACAAGATGGT TGATACTTGC GCAgCTGAAT 264 0 

TTGAATCTTC AACACCATAT TATTATGGTA CATACGAAAC TGAAAATGAA TCCATAGTTA 2 700 

CTGACAAAGA AAAAATCTTA GTATTAGGCT CTGGACCAAT TCGAATCGGC CAAGGTGTAG 2760 

AATTTGACTA TGCGACAGTT CACGCCGTTT GGGCAATTCA AAAAGCAGGG TACGAAGCGA 2820 

TAATTGTGAA TAACAATCCA GAAACAGTTT CAACAGACTT CTCAATTTCT GACAAATTAT 28 8 0 

20 ACTTTGAACC TTTAACTGAA GAAGATGTGA TGAATATCAT TAATTTAGAA AAACCTAAAG 2 94 0 

GTGTCGTTGT ACAATTTGGA GGACAAACAG CGATTAATTT AGCAGACAAA TTGGCTAAAC 3000 

ATGGTGTTAA AATACTTGGT ACTTCACTAG AAAATCTAAA TCGTGCTGAA GATAGAAAAG 306 0 

25 AATTTGAAGC ACTATTAAGA AAAATTAACG TGCCACAGCC ACAAGGGAAA ACAGCTACAT 3120 

CACCTGAGGA AGCATTAGCG AATGCTGCAG AAATCGGATA TCCGGTTGTA GTAAGACCTT 3180 

CTTATGTATT AGGTGGTCGC GCAATGGAAA TTGTAGACAA TGACAAAGAG TTAGAAAACT 3 24 0 

30 

ATATGACCCA GGCTGTAAAA GCGAGTCCGG AACATCCGGT ACTAGTCGAT AGATATTTAA 3 300 

CTGGTAAAGA AATTGAAGTT GATGCGATTT GTGATGGAGA AACGGTCATT ATTCCAGGAA 3 360 

TCATGGAACA TATTGAACGT GCTGGTGTGC ATAGTGGTGA CTCAATCGCT GTATATCCAC 3420 

35 

CACAAACTTT GACAGAAGAC GAGTTAGCAA CACTTGAGGA CTATACTATA AAATTAGCTA 34 80 

AAGGTTTAAA CATCATTGGC TTAATCAACA TTCAATTCGT TATAGCTCAC GATGGTGTGT 3540 

ATGTTTTAGA AGTAAATCCA CGTTCTAGTA GAACGGTACC ATTCTTAAGT AAAATTACTG 3600 

40 

ATATTCCAAT GGCACAATTA GCTATGCGAG CAATCATTGG GGAAAAACTA ACAGATATGG 366 0 

GTTATCAAGA AGGGGTTCAA CCATATGCTG AGGGTGTCTT TGTGAAAGCA CCAGTATTTA 372 0 

45 GTTTTAATAA ATTGAAAAAT GTTGATATTA CTTTAGGACC TGAAATGAAG TCAACAGGTG 3780 

AAGTGATGGG GAAAGATACT ACATTAGAAA AGGCGTTATT CAAAGGGTTA ACAGGTAGTG 3840 

GCGTTGAAGT TAAAGATCAC GGTACAGTAT TAATGACCGT CAGTGACAAA GATAAAGAGG 3 900 

AAGTTGTTAA ATTGGCACAA CGCTTAAATG AAGTTGGCTA TAAAATTTTA GCAACGTCTG 3 960 

GAAcAGCTAA TAAATTAGCT GAGTATGACA TACCTGCAGA AGTAGTAGGC AAAATTGGTG 4 020 

55 



EP0 786 519 A2 



TGACTAAAGG TAAAGAAGTA GAAAGGGATG GCTTCCAAAT TAGACGTACT ACAGTTGAAA 414 0 
ATGGTATTCC ATGTTTGACA TCTTTAGATA CAGCTAATGC CTTAACGAAT GTAATTGAAA 42 00 
GTATGACATT TACAATGCGT CAAATGTAAA TCAATCAAAC TGTATCGGTG GGGCTGTAAT 4260 
TAACCATTTA CTTAAAGAAG TTTATATTAC AGCCTCATTA TTTTAATGAA TTTCTTAATA 4 3 20 
TAAAGGGAGA CaTATATGAT GAAAGATTTA CCAATTATTG CATTAGATTT TGAATCAAAA 4 380 
GAAAAAGTAA ATCAATTTTT AGATTTATTT GATGAATCAT TATTCGTAAA AGTAGGTATG 4440 
GAACTTTTTT ATCAAGAAGG TCCTCAATTA ATTAATGAGA TAAAAGAAAG AGGCCATGAT 4500 
GTATTTTTAG ATTTAAAACT GCATGATATT CCTAATACAG TTGGTAAGGC GATGGAAGGA 4 56 0 
CTAGCTAAAT TGAATGTTGA TCTGGTAAAT GTTCATGCTG CTGGTGGCGT AAAAATGATG 4620 
TCTGAGGCCA TTAAAGGATT AAGAAAACAT AATCAAGATA CAAAAATTAT TGCAGTAACA 4 6 30 
CAGCTTACGT CAACAACAGA AGACATGTTA CGACACGAAC AAAATATACA AACATCGATT 4 740 
GAAGAGGCCG TTTTAAATTA TGCCAAGTTA GCAAATGCAG CTGGTTTAGA TGGCGTTGTT 4SG0 
TGTTCACCTC TTGAAAGTCG TATGTTGACT GAAAAGTTAG GTACATCATT TTTAAAAGTA 4B60 
ACACCAGGTA TTAGACCTAA AGGTGCATCT CAAAATGACC AACACCGTAT TACGACACCG 4 920 
GAAGAAGCAA GACAGCTTGG TTCGACGCAT ATTGTAGTCG GTAGACCGAT TACACAAAGT 4 980 
GACAATCCAG TCGAAAGTTA TCATAAAATT AAAGAAAGTT GGTTAGTATA ATGGCTAAAG 5040 
AAATTGCAAA ATCATTATTA GATATTGAAG CTGTAACATT ATCACCAAAT GATTTATATA 5100 
CATGGAGTTC AGGTATTAAA TCACCGATTT ACTGTGATAA CCXSTGOTACG TTAGGTTATC 5160 
CTTTAGTTCG AGGCGCAATC CGCGATGGTT TAATTAACTT AATTAAAGAA CACTTTCCTG 5220 
AAGTAGAAGT TATTTCTGGT ACTGCAACAG CTGnTATTCC ACATGCAGCT TTTATTGCTG 5280 
AAAAATTAAA ATTACCAATG AATTATGTTC GTTCATCAAA TAAGAGTCAT GGTAAGCAAA 534 0 
ATCAAATCGA AGGTGCTAAA AGTGAAGGTA AAAAAGTAGT TGTGATAGAA GATTTAATTT 540 0 
CGACAGGGGG ATCTTCAGTC ACAGCAGTTG AAGCCTTAAA ACTAGCAGGT GCAGAAGTAT 5460 
TAGGTGTTGT AGCTATCTTT ACTTACGGTT TGAAAAAAGC AGATGATACA TTTAGCAATA 5520 
TTCAACTACC TTTTTACACT TTAAGTGATT ACAATGAATT AATTGAAGTA GCTGAmAmTG 5580 
AAGGTAAAAT TTCTAGTGAA GATATCCAAA CATTAGTTGA ATGGAGAGAC AACTTAGCAT 5640 
AATATAGACA CTAGAAGGAG GAATTCAACA AATGAATGAC AAAACATCTA ATGATTTATA 570 0 
TGGGAAGATA AAACATTGTA ACGAATTTAT CAATCATTCA AATGATTCCA ATCTATCTAG 5760 
--^-^ATGAAA GTTCAACGAA GCAAAAACAT ATAAAAAATA AAACAACTAT 5820 
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TTAAAAACAA AAAAGCT 58 97 
(2) INFORMATION FOR SEQ ID NO; 321: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7965 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321: 

TCATTCTAAA TCAACTTATT TTCCATTGCA TAAATTGCTG CTTGTGTACG ATCGCTAACT 6 0 

15 

TGTAATTTAC TAAATATATG ACTGACATGT GTnTAATTG TTTTTTCAGA TACAAATAAA 120 

GTTTCTGCAA TCTCTTTATT TGTTTTACCT TTAACCATTT CACGTAACAC TTCAATTTCT 180 

20 CTCTTTGACA ACTTATTCGT GTAGTGTGGT TTTTGGCTAA CTGTTTCGAA TACATCTTGT 240 

GCCTTAGGAT GTATCATTTT TTCACCGTTC ATAACTCGTC TAATAGTTTC AATTAATTGC 3 00 

TGAGGCTCAA CGTCTTTCAT TTCATAACCA TCAGCACCTT TATTGATTGC TGAAATTACA 3 SO 

2s TGTTCATCAT CAACATAACT TGTTAATACC AAAACTTTAA TATCCGGATA ATGTGCCTTA 4 20 

ATATATTCCG TAATTTCAAT ACCATTCATG CCAGGCATCA CTAAATCTAA TAGCACAATA 4 80 

TCAGGGTGCT CATGCTCTTT TAAATATTCT AAAAATGTTT CTCCATCTGC AAAGTCTTGT 54 0 

30 

AAAACTTCTA TGTTTTCAaT CGTGGaTAAT AAAAATCGCA ATCCTTGtCG CACAATATAA 6 00 

kGGwCATCTA CTAATATnAC TTTGTTCATG GGTTATCTCC TTAAAtCAAG CTATTTTATA 66 0 

GGAATTGTGA ATTGTATTTG TGTACCCTTT GTTGGCTGAG AATGAAAGGT CACTTTACCT 72 0 

35 

CTTAATAATT TAACTCTTTG TTTTATGTTA TTAATACCGT GTGATGAAGC TATCTGAACA 780 

TTATCGATCT CAAATCCTTG ACCATAATCA ATCACGTCAA TATATAGTAT ATCGTTCATT 840 

TGTTTTAATG TAAGATCCAT TTTATTCGTA TCAGCATGTT TCTTAACATT ATTAATACAC 900 

40 

TCTTGTAATG CTCTGTATAT GTTTTCTTCG ATTTCATTAG ATAAATCGAT TAAACCTTCT 960 

ACATTTACAT TTAATTGTAT ATGCATTAAT TTACTATATG CTGTCAAAGC ATGAATTAAA 102 0 

45 CCTTGCTCAA GTCCAACTGG CTTAAGTnGC CAAATCAATG CACGCATTTC ATTAACGGCA 1080 

TTTTGACTCG TTtCCTCAAT CGTCTTGAAT GCTTGTTTAG CGATGGATTC GTTTGACATG 114 0 

CCATACGCAG CATGTGCtGT TAGTTTTACA GAAAATAACA TTTGATTTAC TGAATCATGT 1200 

AAATCTCTAG CTAAACGATT ACGTTCATTA ATTTTTGCCG CTTCTTTTTC ACGGTCTGTT 1260 

AAATAAATAC GTTrGATGGC TGACCCTAAT TGAAATGCGA CAGACTCTAG CAACTCTAAA 1320 
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TGACCCGATT TAAGTGGTAC CGTTGCATGA TGTGTAATAT TGTCATTTTG GCTAGGAAAT 144 0 

GCTTTAGAGG CTAAGTTAAT ACGAGAACAA TTGACGATAT TCGACGCTTT CATTAGCCTA 1500 

5 

CGTTGATTAA ATGCTTTCAC ACACCAACAA GACCCATCTT TAATATAGTG ACAGTGGTCT 1560 

GCTGTCAAAG ATTGTGGTAG AGCCACATGT GATACAAGTT CATGTTCACC TACGCTATTG 1620 

ATGAAAAATA TCCAGCCTGT CGTGAAATTA CTGCCCTCAA TTAAATATTT TAACGCACCT 1630 

10 

TGGGTCATGC TATACATTTC rGTTTCTTCG TTTAAAAATT CGGCAATCTC TTTTAATAAA 1740 

GCTAGTCGCG TCCTTTGTTC CATCAAATCG CTCCAATTCA TTTTTACGTG TATTAACTAT 1800 

TATACATTGA GTTATTATAT TTTTAAATCT TAGACGTAAA CATGATAAAA TGGCCTTGAT 1860 

15 

TACrCAATAG TTATATTTCG GAGAACTGAT TTGTGATATG ATATTAAAGA CTATAGGAGG 1920 

ATTTTATGAA ATTTAAAATA CCAGAAAACT TTAATGACTT AAGTTTACGA GATATTTTCC 1980 

20 AACAACTTAA GGTACCTAAA AAAGATTTAC ATCATTTAAA TATGTCTAAA GATATTACTA 204 0 

TTAATGATAA ACCTGCGCGA TTAATGGATA AAGTGCATAC TGGCGACGAT GTATTTGTTC 2100 

CAACCATCGA TGAAAAAAGT AATTATGTTC CAAGTTATCG TTATGCACAA ATTAAATACG 2160 

-5 AAGACGATGA TATGGCAATC GTAATGAAAC CTAAAGGTGT TAAGACTCAC CCTAATGATT 2220 

TAAAAGAAAG CAATACTTTA ATGAATCATG TGATTTACAC TATTGATAGT GACTATGTCG 2280 

AACCAATTCA TCGACTGGAC CAGGAAACAG TAGGATTATT AATTGTTGCT AAAAATCCTT 234 0 

TAATGAAAAA AATTCTTGAT CGCATGTTAG AAGACAATGA TATTACGCGG ATATACAAAG 24 00 

CAAATGTTAA GGCACTTTTA CCTTTAAAAC CACAAACGAT TGATATGCCA ATTGGTAAAG 24 60 

ATAAATTCCA TTCGAATAAA CGACGTGTGT CTCCTACTGG ACAGCGTGCA ATTACACACA 252 0 

35 

TTTTAACTTC AAAAATGATA AAAGAAGCTG TGTGCCmACT TGAAATCAAG TTGGATACTG 2580 

GACGTACTCA TCAAATwCGT GTGCATTTAG CTGAAATTGG TCACCCTGTT ATTGGTGATC 2640 

CTTTATATGG TGATTCAACG TTAAGACAAT TAGAACTTGA AAGTTACAAA ATAGAGTTTG 2700 

40 

TGcATCCCTT GACTAAGGAA GTCATTTCCG TTTCTTTGGA TGACTAATTT GATTAGTTTT 2760 

GCATGATATT tAAACATGCA ATACCGCATT GTAACTAAAT CAAGTATCAA CtTAAAcGGA 2 820 

45 TAGATGGAAA ATTATTAATT TTTTCAGATG TTCGGTTTTT TTGTTTTTTA CGATGCTTAG 2 88 0 

GATTTTATAT TTTGATATTT TAGTAATTAT TCATTTTATA ACATCCTTGG ATAATGACTT 2 94 0 

GTAGTCTTTT TCAACTGCGT TACGTGTATC TATGGACAAT ACATGACATC ATAAGATTTT 3 000 

so TATCACAGGT TGTTTGGCCA ATACATGTAC AACAATTCAT CATATAAAAA ATAGGTTCTA 3 06 0 

-r.A'-AAAA^G GACTCCATGA AAAGTTTTTC CTTTTCATGG CTCTATATCA AATCAGACTT 3120 
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CCAAACTTTA TTTTATATTA ATATTTAATT AATGAGGATC TACCATATCT TCTGGTTTAA 3 24 0 

TCCATGCTTC AAATTGTTCT TCTGTAACAT ATCCAGTTTG AATTGCAGAT TCTTTTAAAG 3 3 00 

5 

TTAAACCTTC TTTATGGGCT TTCTTAGCAA TTTGAGCTGC TTTTTCATAA CCAATATGTG 336 0 

GATTTAATGC AGTAACTAAC ATTAATGATT GATTTAAATA ATTATCAATA TTCTCTTCGA 34 2 0 

TTGGTTCAAT GCCCACTGCA CAATTGTTAT TAAATGTTTC CATACCATCA GCTAAAAGAT 34 80 

10 

AAATTGATTG TAGTGTATTA TGCATAATAA CTGGTTTATA AACATTCAAT TCAAAGTTAC 3 54 0 

CTTGTGAACT TGCGAACCAA CAACTGTATC ATTACCCATT ACTTGGACTG CnACCATTGT 36 00 

^5 TAACATTTCA CATTGTGTAG GATTAACTTT ACCAGGCATA ATTGATGAAC CTGGTTCATT 3 660 

TTCAGGGATA GAAATTTCTG CCAAACCAGC TCGTGGCCCT GAAGCCAACC ATCTCACATC 3 720 

ATTAGCAATT TTCATTAAGT CTCCTGCTAA TGCCTTCAAT GTTCCATGCA ATTGAACAAC 3 78 0 

20 TTCATCATGC GCTGTAAGTG CGTGGAATTT ATTTTCAGAA GATACAAATG GATAACCCGT 3 84 0 

ATTTTCTGAA ATATAATGTG CCACTTTATC ACCAAATTCA GG ATGCGCAT TAATACCAGT 3 900 

ACCAACAGCC GTACCACCGA TGGCAAGATT TAAAATGTGC TTCTTAGATT CAGATAACAT 3 96 0 

25 TGTTTCACAA CGGTCAAGCA TATAACGCCA GCCACTAATC TCTTGTCCTA GTTTGATCGG 4 02 0 

CGTTGCATCT TGTAAATGTG TACGACCAAT TTTAATAATT GAATCAAATT TATCTTCTkT 40 80 

TTCTTTCAAA GTATTTCTTA AAAGTTTTAA TGCAGGTTCT AATTTTGTTT CAACCTCTTG 414 0 

30 

ATATAATGCA ACGTGCATAG CAGTTGGGaA TGTATCaTTC GaACTTTGAG ATTyATTTAC 4200 

ATCATCATTk GGGTGGATAC TTTCATCACT TTGATGATCT TTTAAATACA TATTAGCAAC 4 260 

ATAACTTACT ACTTCGTTCA CATTCATATT ACTTTGTGTA CCGCTTCCTG TTTGCCATAC 4320 

35 

AACTAGTGGG AAGTGTTCAT CTAATTCACC TGATAAAATT TGATCACATG CGTATACAAT 43 80 

GGCATCTTTC TTTGCCTCGC TTAATTTTCC TAAATCAAAA TTAGCTATTG CTGCTGCACG 4440 

CTTTAGTTGT GCAAAACCAT AAACTACTTC GATTGGCATA CGCTCTTTAC CAACTGGGAA 4500 

ATTACGTTTA CTTCTTTCTG TTTGAGCACC CCAATATTTA TCTGCAGGTA CTTCTATTTC 4 560 

TCCAAAAGTA TCATGTTCAA TTCTTACTGA CATTCAATTT CTCCCCTTAT CACTGTTTAT 4620 

45 TTAACTGTAG TATATCATTA AATAATTTAA TTGAGCAATT TATGATTAAA ACGTTTTCAT 4 6 80 

AATTTGAAAT AAAAATACAC TAATCGCACG TGTTCACCCT TTATTACAGT GATACGGTCA 4 74 0 

TACGATTAGT GTGTTATCTA TCATTATTTA GTTATTATTG AACTAAGTTT AATTACGATA 4 8 00 

50 

CTTTGTTTTA GTAGCTTCAA CCGtAGCAAT AGCTGTAAGT ATATATAATA CAGCACTAAC 4 860 

AATTGTCGTA TATGGATTTA GAGCAACAAG CGTaCCTAAA ACTCCTGTTA AACTCGCATA 4 920 
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TACAATACCT GATTGATTAC TTTTAATGAA TGTTTGCGCA TTAACATCAT CAATTAATCC 504 0 

TTTTGATAAA TTGAGTTGTA ATTTTATTAC TTTGAAAATA ACAGGTAAAT ATAATGCCCC 5100 

5 

AATTGCCAAT GGAAAAGCTT TAATTGATAT TAAACTTATA ATAACTGTTG CTATCAATAA 516 0 

TTGAATCCAG TATTTTCCTA ACATAAATAT ATAAATCTCC TCTAATTTCA TTCTTCAATA 522 0 

GCATATCATA ATCTTGGCAT ATTAAGAAAC GCGGTTTAAT GATTTCATTA AAAATATTAC 5230 

w 

TGATAGATGA CTTCTTTCAA TTATGTCTGG AGTAATTAAT TATCAATTCC GTTTAAATGG 534 0 

TGTTTTAATA TTTAAAATTG AACTTTTGAT ATATTACTAT GTCTGGTACA CAAATCAATG 5400 

TTTTATGCTT TACAAAGTTA TATTGGCAGT AGTTGACTGC AGTCCACAAC ATAGAGGCTT 54 6 0 

CGGAATGTCA GCTTCTATTT CATGCAAGTT GGTGGAGCTC CAACATAGTG GAATTGGATT 5520 

CCCAATTTCT ACAGACATTG CAAATTGGGG AAACGGGCCA CAAACTCAGA AACTGGTGGA 5580 

20 AAGTCAGCTT AAAATAACAT GCAAGTTGGC GAGGCCCCAA AATAGTGAGA TCGGATTTCT 5640 

AATTTCTACA GACATTGCAA ATTAGGGAAA CGGGCCACAA ACTCAGAAGT TGGTGGAAAG 5700 

TCAGCTTAAA ATAACATGCA AGTTGGCGGG GCCACAACAT AGAAAAATTG GATCCTCAAT 5760 

25 TTCTACAAAC AATGTAAGTT GGGGAAACAG CCCCAACACT GAAACTAGCA GAAAGTCAGC 5920 

TTCTATGAAT ATAATAAAAA AGCTAGGTAA CAAAATGCTA CCTAACTTCA TATTCAAGAT 58 80 

AATCAATCCT ATTTGATATA TGTTCTATAC TATACATTAT TTACATGATA AATAACTGAA 5940 

30 

TATTACACAA TTATAATACT TTACTGACTG TCTTCTTCAG AATTCTTTTC TTGATCATTT 6000 

TGATCAGAGA TTTGTTCCaT TTCTTTACCT AATTCTTTTA AATCTTCAAA ATCCGTTACC 6060 

ATACTGTTTT CTTCTTCATG ATAATTTAAT TTTGGATCTT TGTCTTTAGA CATAATCAAT 6120 

35 

ACCTCACAGA TTTTTAATTA AGCAAAGCGT GACGTAAAGT AAGCTTTAAC ATCTTCAGGT 6180 

AAACCTGCAG CCGCTTCTTT ATCAAGAATA ACATTTACCA TTCTATGTGC TTTTAAATCG 6240 

GCTGGTTCGA AGCTTGTTTT ACCATTTTCT TGATATAATT TTTCAACTAC ATCTCGTTTA 63 00 

40 

TTAGCACCTG TCACTACTAA GAAAATTTCT CTTGCTTCCA TTAGTCCTTG ACGAATACTA 6360 
ACATTTAACT TACCTTGCTC ATCGATAGAA ACAACTTGTA ATGTTAATTT CCCTTTATTT 6420 
4 5 TCTTTAGTTT TAATCTTATC AGCGATTAAT TCGATTGCAT CTTTTTCATA AGCAATTGGA 64 8 0 
TAAACTTGAC CTGCTGGTAC ACCTAACGCT TCGAAATATG ATTTTTTATC GTCATAATCT 654 0 
AAAATATTTA TTTGGCTAAA ATCAACAGCA TGTTTTTCAA CATTTTTCTT TAATTCATCT 66 00 
AGAACTGGCG CTTGATCTGT ATCTAAATGA AAACCTGCAA TTGTAGTAGG ATTATTGTTA 6660 
-AATAATATC AGCAGCATAT TCTGCTACAA GTTGACTATT GTCAAAGACT 6720 



EP0 786 519 A2 



GATATTAATA ATTATTATAC CCTAACTTTC AATATATCAA ACCATTTAAC TTTAACATGC 6 84 0 

TTATACTCTA AATATAGCAC TTAAGCATCA TTTTTATAAT GAAAATGAGT AAATTTTAAT 6 900 

5 

TCAATCCCGG rAAATCTTGT TGACGTAACG CTTCATAAAT TAACAACGCA GCAGTATTTG 6 96 0 

ATAAATTTAA TGAACGAATA TGTTCACTCA TAGGAATTCT TAACGCTGTG TCTTGATATT 7 02 0 

TCTCTTTCAC CCAGTCTGGT AATCCTGTCG TTTCTTTTCC AAAAATGAAG TAAAAATCTT 7 08 0 

10 

TGTCATGATT TGAAAAATCA AAATCACTAT AAGTCTTTTT ACCAAATTTT GTTAATAAGT 714 0 

AATACTCGCC ATTTGTGACT TCAAAAAATG CTTCAATACT ATCATGATAC GTAATATTCA 7200 

CAAATTCCCA ATAATCTAAA CCGGCTCTTT TTAACATTTT ATCATCAGTT CTAAATCCAA 726 0 

15 

GAGGTTTAAT TAAATGTAAA TGTGTGTTTG TACCTGCACA CGTACGgCAA TGTTACCAGT 7320 

ATTAGCTGGG ATTTCTGGTT GATATAAAAC GATATGATTT GTCATATTAC TATTCTCTCC 73 80 

20 TTGTGTCTAA TCCTTTTATC ATTTCATTCT GAACTTCTGC ATCCTCTTGA TCATAATTAG 744 0 

CATTGATAAA ATCTCTTGCT TCTTCCCCAA GAATTTGACC AATGGCCCAA TAAGCAGTTG 750 0 

CTCGAATCAA CGGTCTTTCA TCTGTTGTTG CAACTTTTTT CAATTCTGGA ATTGCATCCA 756 0 

25 CTTCATTAAA ATGCGCCAAT GCTAAAATAG CATTTCGTTG TATCGGCTTT TTACCACGCC 762 0 

AAGCACCTGC AAGgTGACCA TATGTTTGTT TGAATTCTTT ATTAGACATA CGTAGtAAAG 76 80 

GTACTAATCT TGGCTTTAAA ATTTCTGGTT CCAAAATGAT GTCATCTTGT TCGGTATTAA 7 74 0 

TACCTCTATT TTTCGGACAA ACTTGTTGAC ACGTATCGCA ACCATATAAT CTATTCCCAA 7 800 

TTTTATAACG ATATTGGTCA GGCATATAGC CTTTTGTTTG CGTTAAAAAA CTAATGCATT 7 860 

TCTGACTATT TAATTGGCCA TTTCCAACTA ATGCACTTGT TGGACAACGA TCAACACAAA 7 920 

35 

TTGTACAATG CACCACAGCT ATCTAATAAT GGATCATCAG GTTCC 7 965 

(2) INFORMATION FOR SEQ ID NO: 322: 

(i) SEQUENCE CHARACTERISTICS: 
^0 (A) LENGTH: 1302 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: double 
(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322: 

GCCCTGTTGG AGAAATCACC TTTATACGAT GGTTGAAAAA GCATCATTAG GTACAATCGT 6 0 

50 GGCAATTATA GTTATTTTGC TGTTTTTAAG AAACATTCGT ACGACGGCAA TTTCTATTAT 120 

ATCGATTCCG TTATCACTTC TTATGGCGCT TATTGCTCTG AAATTGAGTG ATGTTTCATT 180 
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TGTAGTTGrr GAAAATATTT ATCGACGCTT AACAGATTCA GAAGAACAAC TAAAAGGTGA 
AAATTTAATT ATCAGTGCGA CAACTGAAGT ATTTAAACCA ATAATGTCAT CGACACTAGT 
TACTATTATC GTCTTCTTAC CACTTGTGTT TGTATCAGGT TCAGTAGGCG AAATGTTTAG 
ACCTTTTGCA ITGGCTATrG CAITrAGTTT ATTAGCATCG TTATTAGTOT CAATTACACT 

COrrCCAGCG ttggcagcta cactatttaa aaaaggcgtt aaacgtcgta ataaacaaca 

° TCAAGAAGGA TTAGGTGTTG TTAGTACAAC TTATAAAAAA GTATTACATr GGTCATTAAA 
TCATAAGTGG ATTGTAArTA TATTAAGTAC ATTAATTTTG GTTGCAACTA TTGTATrTGG 
AGGACCGAGA CTAGGCACTA GCmATrTC AGCAGCTGAC GATAAATTIT TAGCTATTAC 
" TTATACACCG AAGCCTGGTG AAACGGAGCA AGCAGTGTI^ AATCATGCGA AAGATGTTGA 

AAAATATTrA AAACAGAAAA AGCATGTAAA AACAATTCAA TACTCAGTTG GCGGTAGTAG 
XCCAGTAGAT CCAACGGGTA GTACAAATAG TATGGCAATC ATGGriGAAT ATGATAATGA 

CACGCCTAAT TTTGATGTAG AAGCGGATAA GGTrATiTW. ^xo^v^^^ - 

TCCTGGAGAG TGGAAAAATC AAGATTTAGG AACAGGTGCA GGTAATAAAT CTGTAGAGGT 
TACTGTAAAA GGTCCATCAA TGGATGCCAT AAAATCAACT GTAAAAGATA TTGAACAGAA 
AATX^AAACAG GTTAAAGGAC TAGCCAATGT CAAATCTGAT TTATCGCAAA CATAl^ATCA 
GTATGAAATT AAAGTCGATC AAAATAAAGC GGCAGAAAAT GGTATTTCTG CAAGTCAACT 
so TGCAATGCAC TTGAATGAAA ACTTACCAGA AAAAACAGTT ACGACTGTTA AAGAAAATGG 

TAAAACTGTT GATGrTAAAG TCAAACAAAA TAAGCAAACA GC 
(2) INFORMATION FOR SEQ ID NO: 323: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 323: 



1020 
1080 
1140 
1200 
1260 
1302 



ATATATATTA 


ATTTAAACGT 


GTTTCACATG 


TACCAGTGTT 


AATGACAGAT AATGCTGCGT 


60 


TTAAACCACC 


TTCAACAAGG 




CTTCATCAGA 


GAAGAAAGCA ATATGTGGTG 


120 


TTACTAAAAT 


TCTTTCATGT 


TCGATTAACT 


CTAATAATGT 


TTTATCGTCA ATGTCTTTAT 


160 


TAGTCCAGTC 


ATTTGTGAAG 


TATGCTGCTT 


CATTTTCATA 


AGTATCAATC GCAGCACCTA 


240 


ACAAAGTACC 


ATCGTTCACT 


GCAGCGATTA 


AATCAGGTGT 


ATTGATGACT GCACCACGTG 


300 



10 



20 
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GATAGCTTTC TTTGTTCGCT GGAACATGTA AAGAAATAAT ATCGGCATCT TTAATAGCTT 420 

CTTTAACACT ATCTTTATAA GTTAAAAAGT CTAAATCTTT ATTAGGATAG GCGTCATAAG 4 80 

CTGTAATTGT AGCACCAAAT CCTGCATATA TtTTAGCTGT AGCAGCACCG ATACGACCCG 54 0 

TACCAATAAT TGCAACAGTC ATATTTTTAA CTGGTTTAGA CATGATTTCT GCTTGCCAAG 600 

TAAAATCATG TGCTTGTACA CGGCGTTCAA TATCTGGGAA GCGACGCACT AATTGTAGGG 660 

CGATAGArAC AGAATACTCT GCAATtGTTT CAGGTGAATA ACTAGGAACG TTAGATATCA 720 

CAATATTGTG TTTTTTAGCT AAATCTAAAT CATACATATC AAATCCAGCA GTACGTTGTG 780 

T5 CAATTTGTTT AATACCGTAA GATTCTAATT TAGGATAAAC GTCATTTTCT AACTTACCAA 84 0 

ATTGCATTGT AGTTACGCCA TCGTAATCTT TTAATTGAtC GACTGTAGCA CTTGATAATA 90 0 

GCTCTTTAGA AGTAGTTACT TCGACATTAT TCTtTtTCCC CCAATTTAAT GCCATCTCTT 96 0 

TCTCATAATC ACGCGTaCCA AAGAACATAA TTCTCGTCAT TAT 1003 

(2) INFOimATION FOR SEQ ID NO: 324: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 503 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

30 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324: 

GGCTTTTATA GTGTTTACGA TTATTATGAT TACTTATATT CACGCTGTTA AAAATTATCC 6 0 

AAATAACCGT ACTGTTCATT ATGGTTATAC AGCTGCATTT ATACTTGTTA TATTACAAGT 12 0 

TATCACAGGT GCATTATCTA TTATGACAAA TGTTAACCTG ATAATCGCAC TTTTCCATGC 180 

ATTATTTATC ACTTATTTAT TTGGTATGAC AACATACTTT ATCATGCTAA TGTTACGATC 24 0 

AGTAAGAAGT GACArGCAAT AACaAAAAAG cmAACCGTAA TTTTAATGGC ACGCCCATTA 3 00 

AAATTACGGT nTTTTATATC AATATTTAAA AtTAAACCtA AGCCATGTAA AAACGAGATT 360 

ACACGTCAAT TGTTGTGTAA TCTCGTTTTA TnTTAATCAT TTTAGTCAGT TGCTTTTTCA 42 0 

ATTTCGATTA ATAAATCGCC TGTCGCTATT GTGTCACCAT TATTTACAGT TACTTGTTTA 4 80 

ATCACACCGT CAAATGGTGC TTGAATTGTT GTTTCCATTT TCATAGCTTC AGTAATTAGC 54 0 

so PJiCGQCTGAT TAGCTTTCAC AGTTTCACCT ACACTAACCT TGACTTCAGT TACTGAACCT 60 0 

GGCATTTGAG CACCGATATG ACTTGGATTA CTCTTATCTG CTTTTGGCTT AACGTTCGCA 66 0 

TTTGTATGCA CATTTTCATC TTTAATGTAA ATACGTCTCG CTTGACCATT CATCGCATAG 72 0 
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CGTTTACCTT TATCGATTTC GATTTCTACT GTTTCACCAT TACGCATTCC AAAGAAGAAT 84 0 

GTAGGCGTAT CAAGTAACGA TAAGTTTCCG TATTGATTTC TAGTTTGAAT ATATTGTTCA 900 

TATACTTTTG GATATAGTAC ATAACTAATA ATATCTTGCT CCGTAACAGG ACCTTGTTGC 960 

TCTTCTTCAA GCAACTCACG GACTTTTTCA AAATCAACTG GCTCTAGATA TTCACCTGGA 102 0 

CGAGCTGTTA GTGCTTCTTG GCCTTTTAAA ATAACCGCTT GTAAATCTTT ATTAAAACCA 108 0 

TTTACAGGTT GTCCTATTTC ACCTTTGAAG AACGACACTA CTGATTCTGG GAAATCTAAT 114 0 

TTATAGCCAT CTGTAATCAC GGATTGTTCA TCAAGATCAT TTTGTACCAT ATAAAGTGCC 12 00 

ATATCACCAA CTACTTTAGA CGATGGTGTT ACTTTTACGA TATCACCAAA TAAGAAATTC 1260 

ACTCTGCGAT ACATATCTTT GACTTCATCA AATCTTTCGC CTAAACCTAA ACTTTTAGCT 1320 

TGTTGACTTA AATTCGAATA CTGTCCACCA GGCATTTCAT GTTGATAAAT TTCAGTATTC 13 80 

20 GGTGATTTGA TATCACTTTc AAAGTCTGAA TAATAAGTAC GTACAGTTGA CCAATAATGA 1440 

CTAAGTGACT CCATACCTTC AATATCAGTT CTAAGGTGGC GTGGGAAGCC ATTTAATGCA 1500 

TAATATAACG AATTGGCGCT TGGCTGACTT GTTAAACCAC TCATTGAAGC AACAGCAGTA 1560 

TCAATGATAT CGACACCAGC ATCTATTGCT TGTTTGTATG TTAATAAACC ATTACCACTT 1620 

GTATCATGAG TGTGAAGATG AATTGGTAAA TCTACAGCTG ATTTTAACTC ACCAATCAAT 1680 

TCGTAAGCGG CTTTAGGTTT TAATAAGCCT GCCATATCTT TAATCGCTAA AATATGGAAA 174 0 

CCTTCACGTT CTAACTCTTT AGCTAGTTTG ACATAATACT CTAAAGTATA AATGTTTGAT 1800 

CGCTCAGGAT TTAAAATGTC ACCTGTATAA CAAATAGTAC CTTCTGAGAT TTTGCCCGCT 1860 

TCTTGTACTG CTTCATTGGC AACTTTCATT TGATCTACCC AGTTTAATGA ATCGAAAATT 1920 

CTAAAGACAT CTATGCCTGC TTTAGCACTT TCTTGTACGA ATTTATGAAT AACATTATCA 1980 

GGATAGTTTT TATAACCAAC TGCGTTTGAA GCACGTAACA ACATTTGGAA TAATACATTT 204 0 

GGAATAGCTT TACGTAGACG TTCAAGTCGT TCCCATGGGT TTTCCTTCAA GAAATTATAT 2100 

GCCACATCAA ATGTAGCACC GCCCCACATT TCTAGTGAGA AACCATCTTT AAATACGTCC 216 0 

GCTGTTTTGG ATGCGATATT AATCATATCC TTAGTTCTAA CTCGTGTAGC TAATAATGAT 2220 

TGGTGTGCAT CTCTAAAGGT TGTATCTGTT AGTAAGACAT CATCCTGCTT TTTAACCCAT 2280 

TCAGCTACAC CTTTTGGACC TACTTCATCA AGCAATTGTT TCGTACCACT AAATGAAGCG 234 0 

ATTTTACTTG AAGATACAGT TGGAATTGAT GCTAATTCAT AGTCTGGTTT CGGACGTTTC 24 00 

TCAACATTTG GGAAACCATT AATTGTTACA TTACCTATAT ATTCTAATGT TTTAGTACCT 24 6 0 
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TTAACACCAC GAATACGCAT TTCTCGTAAT GAGCGTACCA TTTTTTCTTC TGCTTGTTTA 264 0 

AATGATATCG CGTGTGTAGA TAATTTTACG AGTAATGAAT CATAATAAGG TGATATCTCA 270 0 

GCACCTTGGA AACCATCTCC AGCATCAAGA CGTACACCAA AGCCCCCGCT TGAACGATAA 2 76 0 

GCAATGATTG TTCCAGTATC CGGCATGAAA TCATTTAACG GATCTTCTGT TGTAATACGA 282 0 

CATTGGATGG CATAGCCTAA TGTTGTAATA TCTTTTTGTT GCGGCATATT AATCTCTTCA 2880 

CCAAATAAAT CGGCACCTGC TGCAACTAAA ATTTGTGTCT TAACAATATC AATTCCTGTT 2 94 0 

ACCATCTCTG TAATTGTATG CTCTACTTGT ACACGAGGGT TAACTTCTAT AAAGAAGAAT 3000 

TCGTCACCAG ATACTAGAAA TTCAACAGTA CCTGCATTGA CATATTTAAT ATTTTCCATC 306 0 

AATTGAATTG CAGCATCACA AATACGTTGA CGTAATGTTG ATGATAATCC AACTGATGGT 3120 

GCAACTTCTA CAACTTTTTG ATGACGACGT TGTACTGAAC AATCACGTTC AAATAAGTGT 3180 

20 ACGATAITTC CATGTTCGTC ACCTATGACT TGTACTTCAA TATGCTTTGG ATTATCAATG 324 0 

TATCTTTCTA TGTAAACTTC ACTATTACCA AATGATTTTT CAGCTTCTGA TTTTGCTCTA 3 3 00 

TGGAAAGCAT CTTCTAATTC ACTTTCTTCA CGAACGATTC TCATACCTTT ACCGCCGCCA 33 60 

CCACTTGTGG CTTTAATCAT TAGCGGGAAA CCAGCTTCTT CTGCAAATTC TTTTGCTAAT 34 2 0 

TCATATGATT TAATTGGACC GTCTGTACCA GGAATAACTG GTAAATCTGC CTTGATAGCC 34 8 0 

GTTGTACGAG CTTTAACTTT ATCTCCAAAC ATATCTAAAT GTTCTAAATG AGGACCAATA 3 54 0 

AATTTAATTC CTTCTTCTGC ACAACGACGC GCAAATTGTT CATTTTCACT TAAAAATCCA 3 6 00 

TAGCCAGGAT GAATCGCATC CACATTCGCT TGTTTTGCTA CATCAATGAT ACGCTCAATA 36 6 0 

TTTAAATAAC TTTCAGCAGG ACCTAAATCA CTTCCAACTA AATAGGATTC ATCTGCTTTA 372 0 

TATCTATGTA ATGAACTTTT GTCTTCATTC GAATAAATTG CAACTGTGCT GATGTCTAAT 37 8 0 

TCTGCCGCCG CTCTGAATAT ACGAATTGCA ATTTCTCCAC GGTTAGCAAC AAGTAACTTT 3 84 0 

TTTATTTGTT TCAATAGCGA TACACTCCTC AAACTATTAG AATTTTCTAA CTAATTAGAT 3 900 

AATAAAATTT TATCTTAAAG CGCTCTGTTT TGCTATAGTt mTGTTTCmAA TTTTCAAAaT 3 96 0 

TTaACATyCT tGAGACAATT AAAaCCyCCG CTTCmGaAAT AATAATTTCA AAAATGACTA 4 020 

TGCAACAACA GGTAGTTCCA CGTTTTTGTT GTGAAACATT TTCGATTTCT ACAACTCTAA 4 080 

AAAATTAAAA ATAAAATTGC AAAACATCAA CATTTATTAT CAATAGCGAT AACTTTATCT 414 0 

TATCATCATG ATTCTAATTT CGCCACCACA TTTAGTAATT TTTAGTCATA AAATTTAGTT 4 2 00 

ATAATTATAC GTTGTTTTGT TTATAAAATT TGATAATCak GAGTAATCtC GTAATATCAA 4 2 60 

AACaAAAAGG AAGTTAAGCG TTGTTTGGTT GcCTAACTTC CGTTATTGAA CTCATCcAGT 4 320 
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w 



TCTCGTACTA AATATTGGCT AGTATTTTTT TAATTAAATT GTCTTCTTAT ATCAACTTTT 4440 

TGTTGTTGTT TCTTTCGTTG CTGGTCTACT TTGATTTGTT TACCTACAAT CAGAAGTAAA 4 500 

CCCATAGCAA TACTTAAACT AATCATTGAT GATCCACCAA AGCTGATAAA TGGCAATGGC 4 560 

ACACCAGTTA ATGGAATTGT TGCCGAAATA CCGCCAATGT TTACAAACGT TTGACTTCCA 4 620 

AAGTATGTGG CAATCCCAAC ACACACAAGT TTATAAAAAT ATGATGATGT TTTATTTGaA 4 6 80 

AACTGGAAGG CACGATATAC AATAAAGAAC TCTAAAGTAA TAACTAGCAA TCCTCCGATT 4 74 0 

AAACCTAATT CTTCGCAAAT AATTGCAAAA ATAAAATCTG TATGTGGTTC TGGTAAATAG 4 800 

CCCAATTTCA TTGCACTATT TCCTAATCCT TTTCCAAATA CGCCACCGTT ACCTATCGCA 4 860 

AGCAATGAAT TGGAAATATG GTATCCAGTT CCTGATTCGA ATTGGAATGG ATCTGTTAGC 492 0 

GTACTAAATC TGGCAGTTAA ATAACTTGGT AACCAACCAG CCATTAATGC AATGACAAAT 4 980 

20 ACTACTAAGA ATCCTAGCAC TGCTGGTATA CCAAATCTTA GGACTTTGTT 503 0 
(2) INFORMATION FOR SEQ ID NO: 325; 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 89 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325: 

CTTGTTAATC CGAAAATTAT TAGTCAATCA AATGAAACAA TAACAGACTT nGAAGGTTCA 60 

ATTACATTGC CAGATGTTTA CGGCGAAGTG ACAAGAAGTA AAATGATAGT TGTCGAAAGT 12 0 

TATGACGTCA ATGGGAACAA AGTTGaACTA ACTGCACATG aAGATGTAGC AAGAATGATT 180 

TTGCATATTA TAGATCAAAT GAACGGTaTC CCTTTTACAG AACGTGCGGA CCGTATTTTA 24 0 

ACAGATAAAG AAGTGGAGGC ATATTTTATA AATGACTAAA ATAATATTTA TGGGTACACC 300 

AGACTTTTCA ACAACTGTTT TAGAAATGCT TATTGCAGAA CATGATGTCA TTGCAGTCGT 36 0 

AACGCAACCA GATCGACCTG TTGGACGTAA ACGTGTTATG ACACCACCAC CAGTTAAAAA 42 0 

AGTTGCAATG AAATATGATT TACCTGTATA TCAACCTGAA AAATTAAGTG GATCAGAAGA 4 80 

ATTAGAACAA TTGCTTCAAT TAGATGTAGA TTTAATTGTA ACTGCTGCTT TTGGACAATT 54 0 

ATTACCTGAA TCATTGTTGG CATTACCAAA TCTTGGGGCA ATTAATGTAC ATGCATCATT 6 00 

GTTACCGAAG TATAGAGGTG GTGCACCAAT TCATCAGGCA ATTATCGATG GTGAACAAGA 66 0 
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10 



ATTAGGGGCA GATTTATTAA AAGAAACTTT ACCATCTATT ATAGAGGGCA CAAATGAAAG B4 0 

CGTACCTCAA GATGATACGC AAGCAACATT TGCTTCCAAT ATTCGACGCG AAGATGAGCG 900 

AATTAGCTGG AATAAACCAG GAAGACAAGT GTTTAATCAA ATTCGTGGAT TATCACCATG 96 0 

GCCAGTTGCT TATACAACTA TGGATGACAC TAACTTGAAA ATATACGATG CTGAACTCGT 102 0 

TGAGACTAAT AAGATAAACG AGCCTGGAAC CATTATAGAA ACGACTAAAA AAGCCATTAT 108 0 

TGTTGCTACA AATGATAATG AAGCTGTTGC AATTAAAGAT ATGCAATTAG CTGGGAAAAA 114 0 

GAGAATGTTA GCTGCCAATT ATTTAAGTGG TGCGCAAAAC ACACTAGTAG GGAAGAAACT 1200 

TATATGATAG AAAACGTGAG AAGTCTTGCT TTTGACACGA TTCAAGATAT ATTAAATGAA 1260 

GGTGCGTATA GTAACTTGCG TATCAATGAA GTGTTGTCAG AAAATGAATT AAATGCAATG 1320 

GATAAGGCTT TATTTACAGA AATTGTCTAC GGAACCGTTA AAAGAAAATA TACGTTAGAT 13 8 0 

20 TTTTATTTA 13 8 9 
(2) INFORMATION FOR SEQ ID NO: 326: 
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(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 2746 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326: 

TTTGCTAATA ACAATAAAGC CAAAGCCGAT TCACACTCTA AACAGCTAGA AATTAATGTT 60 

AAGAGTGACA AAGTACCTCA AAAAGTAAAA GATCTAGCAC AACAACAATT TGCTGGTTAT 120 

GCTAAAGCAT TAGATAAACA AAGTAATGCA AAAACTGGTA AATATGAACT TGGCGAaCTT 180 

TTaAAATTTA TAAATTTAAT GGTGAAGAAG ATAATAGTTA CTATTATCCA GTTATAAAAG 24 0 

ACGGTAAAAT TGTTTATACT TTAACACTTA GTCCTAAAAA TAAAGATGAT TTAAACAAAT 3 00 

CAAAAGAAGA CATGAATTAC AGTGTTAAAA TTTCAAACTT CATCGCTAAA GATTTAGACC 3 60 

AAATTAAAGA TAAAnATTCA AATATCACTG TTCTTACTGA TGrAAAAGGG kTTTATTTTG 420 

aAGAAGATGG cmAAGTTAGA TTAGTAAAAG CTACGCCTCT ACCTGGTAAT GTAAAAGAnA 4 80 

AAGAAAGTGC TAAAACAGTT TCAGCAAAAT TGAaACAAGA GTTaAAAAAT ACAGTAACAC 54 0 

CTACTAAAGT TGAAGAAAAC GAaGCGATrC AAGAAGATCA AGTTCAATAT GAAAATACAT 6 00 

TAAAAAACTT CAAAATTwGA GaACAACAAT TCGATAACTC ATGGTGTGCA GGATTCAGTA 660 

TGGCAGCATT ATTAAATGCa ACTAAAAATa CAGACACTTA TAATGCACAT GATATTATGC 72 0 
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AAATGATTGA ATACGGTAAA TCACAAGGCA GAGATATTCA TTATCAAGAA GGCGTACCAT 84 0 

CATATGAACA AGTTGATCAA CTTACAAAAG ATAATGTAGG AATTATGATC CTTGCACmAA 90 0 

GTGTATCTCA AAACCCTAAT GACCCACATT TAGGACATGC GCTAGCAGTT GTTGGTAATG 96 0 

CTAAAATTAA TGACCAAGAA AAACTTATTT ACTGGAATCC TTGGGATACA GAaTTATCAA 102 0 

TCCAAGATGC AGATTCAAGC CTATTACATT TATCATTCAA TCGTGATTAT AACTGGTATG 10 8 0 

GTTCAATGAT AGGTTACkAA AAAGTAATAT AGATATTGAT TAAAGGCAGG TAAAACTATG 114 0 

TATCAACTAC AATTTATAAA TTTAGTTTAC GACACAACCA AACTCACACA TCTAGAACAA 12 0 0 

ACCAATATCA ATTTATTCAT TGGTAATTGG AGTAATCATC AATTACAAAA ATCAATTTGT 12 6 0 

ATACGTCATG GCGATGATAC AAGTCACAAT CAATATCATA TTCTTTTTAT AGATACGGCA 13 20 

CATCAACGCA TTAAATTTTC ATCTATTGAT AATGAAGAAA TCATTTATAT TCTTGATTAT 13 8 0 

GATGATACAC AGCATATCCT CATGCAAACG TCATCCAAAC AAGGTATTGG CACTTCGCGA 144 0 

CCAATCGTTT ATGAGCGCTT AGTATAACTA ATTTAAATGA TTTCACTTCA TAAAGCGGGT 150 0 

TGGCGAGAAT TCAATTTCTC ACCAGCTCGT TTTTTCATTG TAATAATAAT CTTTAACATT 156 0 

TATTCTTTCT CTATTAATTT TTCTCAAACT ATCTTATCTT TATGATAATT AATTAAAATG 16 2 0 

CCCTTTTAAA TTCTTATAAA ATAAAAAAGC CACCTATCGT CGCTAATAAA CGACGCAAGT 168 0 

GACTTAATAT CATATTCAAA ATAACTTATG GGAATTTAGG GAATTGATCG AAGTCAGGAT 174 0 

CACGTTTTTC TTTAJIlACGCA TCACGGCCTT CTTTCGCTTC ATCAGTTGTG TAATAAAGCA 18 00 

ATGTTGCATC CCCAGCCATT TGTTGTAAAC CAGCTAAACC ATCTGTGTCA GCATTCATAG 186 0 

35 CTGCTTTAAG GAATCGTAAC GCTGTTGGTG AGTGTTTCAT AATCTCTTTA CACCATTGCA 1920 

CAGTTTCATC TTCAACTTTC TCTAAAGGTA CCACTGTATT TACTAGACCC ATATCTAAAG 198 0 

CTTCTTGTGC ATTGTATTGA CGACATAAGT ACCAAATTTC ACGTGCTTTC TTATGTCCAA 2 04 0 

CGATACGTGC TAAATATCCT GAACCATAAC CCGCATCAAA TGAACCTACT TTAGGACCAG 2100 

TTTGTCCAAA AATAGCATTA TCAGCAGCAA TCGTTAAGTC ACAAACAACA TTTAGTACAT 2160 

TACCGCCACC TACAGCATAA CCTTTTACCA TCGCGATAAC CGGTTTTGGA ATAATACGAA 222 0 

TTAAACGCTG TAAATCTAAT ACATTTAAGC GAGGGATTTG GTCTTCACCT ACATAACCAC 22 8 0 

CATGTCCACG TTTCTTCTGG TCACCACCAG AACAGAATGC TAAATCACCT TCACCAGTTA 23 4 0 

ATACGATAAC TGAAACGTTT TGATCATCAC GTGCACGTGA AAATGCGTCA ATCATTTCAG 24 0 0 

CAACTGTTTT AGGTGTAAAC GCATTGCGTA CTTCAGGGCG ATTTATTGTT ACCTTAGCAA 24 6 0 
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TCCACATGAA TTGTATGACC TGTAGCAGAA ATTAATTTAC ATTTACTATT AGGAATTAAA 264 0 

TTTGCCATTT TTTTCGCAAT CTGTACAAAT TTTTCATCAT ATTCTCCAGC TAATATTAAT 270 0 

GTTGGTACTT TAATTTCTnT CAGCGCGGCC ATAAGTTTGG CATTTG 274 6 



(2) INFORMATION FOR SEQ ID NO: 327: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327: 

TAATGTTTAG TTTATTAACA GTAAGTTCGT ATATCAATGT TTAGTGCTCC CCAAAATTGA 60 

20 AGTTTGAATT TTAAAAGCAT CTTGTAGAAT TTAGTTGTAT TTTTTTCAAA GAAATTCATT 120 

TTGATTATTT TTGATAATGA GCATTTTAAT AGTAATACAT GTTTATAGTG TGTAGTATAT 160 

GTCTATACTA GTAGTAACTA TATAGAGAAA GTAGGAATAA ACTATGTCAC AAGATGTAAA 24 0 

25 

TGAATTAAGT AAGCAACCAA CGCCAGATAA AGCAGAAGAT AACGCATTTT TCCCATCACC 30 0 

ATATTCCCTT AGTCAATATA CAGCACCTAA AACAGATTTT GATGGTGTTG AACACAAAGG 36 0 

TGCCTATAAA GATGGTAAAT GGAAAGTATT GATGATTGCT GCTGAAGAGC GATATGTATT 42 0 

30 

ATTGGAAAAT GGAAAAATGT TCTCTACGGG TAATCATCCT GTTGAAATGT TATTACCTTT 4 80 

ACATCATTTA ATGGAAGCAG GTTTTGACGT TGATGTTGCG ACATTATCTG GTTATCCAGT 54 0 

35 TAAATTAGAA TTATGGGCTA TGCCAACTGA AGACGAGGCA GTTATAAGTA CTTATAATAA 600 

ATTGAAAGAA AAATTAAAAC AGCCAAAAAA ATTAGCAGAT GTGATTAAAA ATGAATTAGG 66 0 

ACCTGATTCA GACTATTTAT CTGTCTTTAT CCCAGGCGGA CATGCTGCAG TTGTTGGTAT 720 

40 

TTCTGAAAGT GAGGACGTTC AACAAACATT AGATTGGGCA TTAGACAATG ACCGCTTTAT 780 
AGTTACATTA TGTCATGGAC CAGCAGCACT ACTTTCAGCA GGGCTTAACA GAGAAAAATC 84 0 

TCCATTAGAA GGATACTCTG TTTGTGTCTt CCCTGaCTCA TTAGATGAAG GTGCAAATAT 900 

45 

(2) INFORMATION FOR SEQ ID NO: 328: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3642 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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ATCTGAnTnG 
CaTCaGCTAA 
AAAATGaTGa 
CAATTTCCCC 
TGTTAAAAAC 
CTTGCCACTT 
ATAAAAATAG 
TGTTGCGCCG 
TAAACATTGT 
TACACTTATC 
AATTAATTGT 
CTACATATAC 
TTTAGGATCG 
TTTATAGTTA 
GATATATTTC 
TGTAGCTTCT 
ATGATCCTTT 
TAAAATATGA 
CTTTAAATCT 
AGCATATTTC 
TTCATGCCAA 
ATTATCTAAA 
CTTTTTAATC 
TTTTGCTTCA 
CGCCTTTTGT 
GGCAAACGCT 
TGCTATTGCA 
ATACAGTTAA 



AGATAGTGAT 
CaCtTCaTTa 
CtTTTCAACa 
TGTTTGCTTC 
AGATTGTTGC 
ATCAATGCCT 
CAAGTAACTA 
TATTTCATTC 
TTATATTCTT 
CTTCATTATA 
CTATTATTTT 
CTTTAAGTAC 
AATTGCTTAA 
TTGTTGTAAA 
TTACCGAAAT 
TTTGGATCTG 
TGA.ACCATTT 
CTCTTCTTAG 
TTTTCTTTAT 
AAGAATTCTT 
CCTTCTGGGA 
ACTGTTTCAT 
TCTTTATCTA 
GCTGAATTTT 
GATGCTGGTA 
GTAGAATTTG 
ATATACTTTG 
AATTATAAAT 



AATGTGTCAC 
TCtTTAGtCG 
CCGAACaCTA 
AAAGTAGCTT 
ACAGTTTGAT 
TCTTTATTCA 
ATCATCGTTA 
CAAGCTAAAA 
AGATGCACGT 
CCCGAACTTT 
TGTACGGTAC 
ATGTAGACGT 
TAGCATTTTT 
TCGTAGCTTC 
CGTCTTTATG 
TAAAGTCTAA 
GTCTGCGTGA 
CTTGAGGGAT 
CTCTATCAGA 
TTGCGCCATA 
AAGGTTTATT 
CTAAATCCAA 
ACTGTGTCTT 
GATACCAAGC 



TAGAAGCTTG 
TCGTTTGTGG 
AAATTTTATT 
GCATTTATTT 



CcAATTTTaA 
CTTTATGCCC 
AAACaTTGtC 
GCTTCTTGTA 
TGGCATCTTT 
TATTGATAAG 
ACACACCAAT 
ACATGCATTT 
ACGTCGTGTT 
TTATATTAAA 
ATTTAAAATT 
CCAATTCATA 
ACGTAATTTA 
CCAACTACCA 
TTTTTCAATT 
TAAATTATCT 
TTCTTTACTC 
ACCTTGTTGT 
AATATAGTAG 
GACAGGTTTA 
ATGTATTGAT 
AGCAATAGCT 
TGCACTGTTA 
CACTGCCATA 
TTGCGTTTGT 
TGCTGAAACT 
CATCTTATCA 
AATATTGCTA 



AcCtTTATTT 
tTCtATTAAA 
tTTTTGATGG 
TTTATTTTCA 
ATTTAGAACA 
ACCATTCGCC 
AATTAGTCCA 
CTCTCCCTAC 
GCGCTCTGTA 
AGCAAATTTA 
AAGGATCAAT 
TATTATTTAA 
TCTTTTGCTT 
TACATTGGGT 
AATGCTTCAC 
CCAAATAGCA 
TTATCATCTT 
TTTAAGTTCT 
ATATCGACAC 
GCTTTAGCAG 
GCATAGCCTT 
AACTTATGTT 
TAACCTTGTA 
ATATTTTGAT 
TGAACTTCAG 
GTAACCGCTA 
CCTCATGATT 
TACTATGAAA 



TTaAGcGtTT 
CtCGGaACtA 
CCaTTAGACA 
ATATCTTTCT 
ATGGCATCTG 
AATCCAGATA 
AACTTCAATT 
TACTATGATT 
ATGTTATACA 
TGGAAAATGC 
TTAAAAACGC 
CrrCGCCTGT 
TrTCACTTGC 
TAGGGAAAAT 
GAGATTCAGC 
TGACAAGTTT 
TACCTTTTAG 
TTTGTGTTGC 
CTTTTTTGTC 
CTTGTACCCA 
GATATGGAGA 
TACCTTTATT 
AATATAATGC 
TACCAAGATT 
CAGAACTTTT 
CCGATAATGA 
AATATTTAAA 
AGATATTTAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
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GTGACTAGGC CTTCCTATCA GACATATTCA CTCATCCACG TATCATTATG TGTACAGTGT i86 0 

GCTATCTCTT ATTTACCTAT TGGAACAACC ATAAACTCAT CCATAGTTTA CCTTTTATAA 1920 

ATAGCAGTCC TCACTCATAC AATTTCTCAT AAAAATCACA ACGCTCCAAC GTATTTCCAA 198 0 

CTTACTTTCA CCTATTTTAA TTCATAAAAA CGACACTTTA ATTGTCATTA TCCAATAATA 2 04 0 

GCAAGACGTT ATTATTGCAA TCTTTTTTAT AAAATAATAG AATCATAGTA TTGTCATTTA 2100 

AAGATAAAGT AAGAACGTTT TTATTTTTCA GATTTTTTAA ATTATTATGA ATATCTAGTT 216 0 

TTAGGAAGGA AATTACATTG AAAAAACAAG TTATTATTTC GGGCCTCATG TTATTTTCAC 2 22 0 

TATTTTTTGG AGCCGGAAAT TTAATATTCC CGCCCATGCT TGGCCATACA GCGGGTCAAA 228 0 

ATATGTGGAT TGGTATGCTA GGCTTTGCCC TTACAGGCAT ATnACTCCCC TTTATTACTG 234 0 

TTATTGTTGT TGCATTTTAT GATGAAGGTG TTGAAAGTGT AGGCAATCGT ATACATCCAT 24 0 0 

GGTTCGGGTT TATTTTTGCT GTCGTGATTT ACATGTCTAT CGGAGCATTT TACGGTATTC 2460 

CACGTGCTGC AAATGTCGCG TACGAAATTG GTACAAGACA CATTTTACCT GTGCATAACC 2520 

AATGGACTTT AATTATATTC GCAGCAATCT TTTTTGCCAT CGTTTACTGG ATTAGTTTAA 258 0 

ATCCATCGAA AATCGTTGAT AATTTAGGTA AATTATTAAC ACCGTTATTA CTATTAATGG 264 0 

TCGCTCTATT AAGTATTGCT GTCATTTTCA ACCCTGAATC TGCACTAAGT GCACCTAAGG 27 00 

ATAAATATAT AACACATCCT TTCT^TTTCAG GAAGTTTGGA AGGCTATTTT ACAATGGATC 2760 

TTGTTGCTGC GTTAGCTTTT TCCGTAGTCA TTGTCAATGG CTATAAGTTT AAAGGCCTCA 2 82 0 

CAGATCGCAT GAAAATTTTA AAATATGTCT GCTTTTCAGG TCTTATTGCA GCCATATTAC 2880 

35 TTGGAATGAT TTACTTTGCA CTTGCATACG TTGGGGCATC AACAGCTCCA GGAAACTTTA 294 0 

AAGATGGTAC AGATATATTG ACGTACAACT CATTACGATT ATTTGGTTCG TTCGGTAACC 300 0 

TCGTATTTGG AATGACGGTT ATCCTTGCAT GCCTAACAAC ATGTATAGGA CTCGTcAATG 306 0 

tTGCGCCACA TTTACTAAGA AACACGTACC TAAGTTTTCT TATAAAATAT TCGCACTTAT 312 0 

TTTCtCTATC ATAGGGTTCT TATTTACAAC ACTTGGTTTA GAAATGATTT TAAAAATTGC 3180 

TGTCCCATTA TTGACTTTAA TATATCCCGT GTCGATTGCA CTTGTACTCA TATCATTTGC 324 0 

TAACATGTTT AGCACATTCA GATTCAGTTG GGCCTATCGA CTCGCAACTG TTATTACATT 3 3 00 

GATTATTTCA ATTTTACAAA TACTAAATAG TTTCAACTTA TTACACGGTG TTATTTTGAA 33 6 0 

50 ATCGTTTATG ATGTTACCTT TAGCAGATAT CGATTTAGCT TGGCTTGTAC CATTCATGCT 342 0 

CTTTGCTATT ATCGGTTTCA TAATCGATGT ATTTATACGC CGTCCGAAAC AAGCGACAAC 34 8 0 

TTAATAAATG CTCACTGCCT AGTAATGATT GACCCATCGT TACTAgGCTT TTttATATGA 354 0 
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TAPiPiAPiATCC TAGCTGTTAT TCAAAAATAT TAGTTTTTAA AA 3642 

(2) INFORMATION FOR SEQ ID NO: 329: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2187 base pairs 

(B) TYPE: nucleic acid 
iC) STP-ANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329: 

TTATTGATAT TGAAAATTCA AAAACTGCAA CACATCTTAT CACTAAAAAC CAACACTATC 6 0 

ATCAAACTGA CACTCATTTT GAACAGTATA AGAAATTTAT TTTAGATTCA GGTATATCAT 120 

CAACACAATT TGTATATAAT AACCTGTCTG TAAGCGGATT TAAATATACT AATGATGGTA 180 

AGAATCCAAT TCAATTATCT GACATAGTGT ATCACTTAAT CGCATTATTA CGATATGGCG 24 0 

GTGGCATTAG CTATCAATTA TTAGATGACC ATTCAAATTA TATTTCCTTG TACAACAAAT 3 00 

ATGGTAGCCC CCTACCATTA ATGCATTTAT ATAAAATGTT TAGACCTTTT GTTAATGAAG 3 60 

ATATTGAAAT TACAAATAAT TATGTATTGA GTCGTAAAGA TAATAACTAC CATTTCTTAT 42 0 

TATTCAATAA AATTAATGAT CGATATATGT CAGACGTAAA ACAAGATTTC ATTTTCCATA 4 BO 

ATGAATTACC TCAAGACTCT TTGATGATTA TTAAAACATT GAATCATGAA CATGGTTCAA 54 0 

TTCAACATTT GCTTCCAATA AQCGATCAAC TTGTTTATAT AGAAAAAGAA ATTTTAGATG 6 00 

AATTAGACAA AACGAATTAC CCTAAAACGG AGCTTGCAGT TCAAGAAGAA ACTGGTCGTA 660 

35 CATTTGAACT CAAGTTAAAT CACGACGAGG TTAAATATAT TTGCTTTAAA CCAAGCTAAA 720 

TACTAACAGT CCTCTTGTGT TTAGTTTCTT ACGTTAAAGG CTATTTATAT CATAAGGAGA 7 BO 

TGATTTGTAA TGAGTAACTC ACAAGCAATT CAAGCAATTG AAAACGTGTT AGTAACGTCA 840 

AAAGTTGGTG TATTATCAAC TGCATATAAT AATAAnCCTA ATAGTAGATA TATGGTCTTT 90 0 

TATAATGATG GTCTTACTTT ATATACTAAA ACGAATATCC ATTCTGCTAA GGTCAAAGAA 96 0 

ATTAAAGATA ATCCAGCAGC ATATGTTTTG TTAGGCTATA ACGACACAAC TAATCGTAGT 102 0 

45 

TTCGTTGAGA TGGAAGCGAC AATCGAAATC GTTACAGAAC AAGAAGTGAT TGATTGGCTA 108 0 

TGGGAAACAC AAGACAAAAG CTTTTTCAGT TCAAAAGAAG ACCCAGAGTT ATGTGTTTTA 114 0 

50 AGAGTAGTTC CGCAATCCAT TAAGCTAATG AATGATAAAT CATTAGATAC ACCTATCAAA 120 0 

A-T^CGAT-TTAT AACACAAAGT GTATATAGGA AATAACTTTT ATGAATTCTA GATATAACAA 126 0 
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TTTAAATTAA TTTTATGTAA TATAAATACT GCATTTGCAA ACTGTTGCAC TTTTAGGTAT 14 4 0 

AACAGAATTA ACTACATTTA AGGAGATTGA TGAACCATGA AAAAGAAAAA AGGTTTTGGT 15 00 

CTTGGTATTA GTTTAATCGC CATCATGTTA ATTGTATGTA TTGTATTAGT AATCATGATG 15 60 

ATGACTGGCG GAAAGAAAGA TACATACTAT GGAATTATGA AAGATAATAC TACTATTGAA 16 20 

AAAATGATTA GTGAAAAAGA TGAAAGTATT GAAAAAAATG TTAAATTACC TTCAGATTCA 16 80 

GATGTTAAAG TTAAAAAAGG TGATTTTGTA ATTGTTTATA AATTAGCAGA TTCAGATAAA 174 0 

ATTGTTAAAG TTAAAAAAGT TGACCATGAC GATGTACCAC ATGGTTTAAT GATGAAAATT 18 00 

CATGACATGG GCAAAATGCA CATGAAACAC TAATTGTAAT TTAAATTACA AATTTTAGTT 18 60 

GCCATCAAGG TATATACGAG TAAAAGCAGC GGTAAGTTGA TTTCCAATTT GGAATCATTT 1920 

TACTGCTGCT TTTTATATTT GAAATACTTT CATATTGAAT AGCTCCACTT GCCGTTCGCC 198 0 

20 TgcgCTTTGC GCATGCATAA AAGCCCCTAA CAACCTGAGG TCACTGCGCT CCGGTTCGCC 204 0 

TGcgCTTTAG CGCATGCATA AAAGCCCCTA ACAACCTGAG GTCACTACGC TTCGGTTCGC 210 0 

CTGCGCTTTA GCGCATGCAT AAAAGTCCCT AACAACCTGA GGTCACTACG CTTCCGTTCG 216 0 

CCTG.CGCTTT AnCATGGCCA TAAAAGC 218 7 
(2) INFORMATION FOR SEQ ID NO: 33 0: 
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ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1788 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330: 

CCnCCTTTTT AAACCTGGnG AAATGtmCAG tTTTGATGGt ATTGGGtTCT TTAGTATTAT 60 

GCTTAAGATA GAGTGTAATA CGCTCCTGTT GTTCTTTATA GTATATTGCT TTTTGTTTTT 12 0 

CTTTTTCGTC ATATTTCACT TTTAAATAAA TGACTGATGC AACTATACAT ATACATAAAA 180 

TGACACCAAT AATTATAAAA ACATGTTTTT TGTTCATTAA AGTAACTCCT AAAATGTGGT 24 0 

GGAAAATGAA AAAATTTTTA TAATCTATAA TTATGAACAT TACAGATTAT AAACCATAAC 3 00 

ACTAACATCG TCGCCTTCAT TAACTTGATT GTTAAAATCA GCAATTACTG AGAATTGTAC 3 60 

50 AAGGAATAAT TGCCTATTAT GCCCTCATGT AATTATTGCC TTACTAACAT TAACAAAATC 420 

GATAGCTATT ACATTAAATG CCTATACCCC AGACCTCAAA CACCTTTTTA TACAGGACGC 480 

ACTGTAATTT CATyyACGTT AACGTGCTTT GGTTGTGTTA ATGCATATAA TACTGCTTCT 54 0 

55 
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GTATCTACCA TTCCTGGAGA AATGCTTGTT ACTTTAACGC 

TCTAATCCrr GAGTAATAGT GTGAACTGCT GCTTTCGTCG 

rrCGTTACTT CAAAGCCAGA AATAGATGcA ATGTTAATAA 

AACATAGTTG GTAATGCAGC CTGTGCCGTA TATAAAGTGC 

ATACTATCCC ACTCATCTAC TTGATAATCA GTAATCTTAG 

CTATTGATAA CAATATCCAA ACCACCGAAT GTTTGTTGTG 

ACTTCTTCTT TATTCGTTAC ATCTGTTGGC ACTACCTTCA 

TTCGCTACGT TTTGTAATTT rrCTTTATTT CTACCTGCTA 

TCATGTAGTA ATGTTGCAAT TGCTTCTCCA ATACCACTAC 

ACTTTATCTG TTAATACTGT CATAATGATC GACTCCTTTG 

^^.^^Tr-^T-L JiaTRCATATT ACTTITAAAA AGCGTATCAC 

TAACTCGCTT CATTTTCATA GATAAATTAC ATTACAAGCC 

CGTTGCTTAT ATTTTTTATT TTTAAAAATG TTGAAAAATC 

ACATTATATT AGTAATAAAG TTAATACTGT GnATTTaTCA 

GAGGAATAAA ACgTGACATA TCATGAGCGT GTTTTAGCAT 

ACCGCATTTG ATTTTCGATT CGAAGATTTA TTTAGCAAAG 

CTTGCAGAAA GACAAAAAGC TGAAAAAGCA TTTCGACACG 

TAAGAATGCC CTTCTCAAGT GTCCATGACG CCCAAGTAAA 

CTTATAACGG CaTTAAACGT AATTTTAAAC AAGTTGAAAA 

TTCGT-TTATA TATnGCAGAC CATGQATAGA ATTTTnTATG 

(2) INFORMATION FOR SEQ ID NO: 331: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1341 base pairs 
{Bl TYPE: nucleic acid 

IC) STRANDEDNESS : double 

ID) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 331: 
TAAGCCAAAT CCATTGTTAC CAGAAGAAGT TCGCTTCATC TTAAAACAAA TGGGTCTTAA 
...^^.^.A-^-^ ATCGATGTTG AACTCGAAGT TGGCGAGCAA GTTCGTATTA AATCAGGTCC 



CTGTCTTTGC 
CACTATAAAT 
GATGGCCACT 
CTTTGATATT 
ACGACAACAT 
CAATTTTTAT 
CACTATCTTG 
AGACAACTTT 
CTGCACCTGT 
ATTCTTTTTA 
AATTCATATA 
ATTCGAAACA 
GTCACTTCTT 
TTCGATTGAA 
TAAGAGCAGA 
AAGAATGGcT 
AgTTAAAAAT 
ATTATATAAT 
TGGAAGGATT 
GTnAATCC 



CAACTCTTTT 
CGTACTACTT 
TGATTGTTCT 
CACATCAATC 
TTGCCCCGCA 
CAATTCATCG 
TGACAATTGA 
TGCCCCTTCT 
AACTACTGCT 
TTTTTTCAGG 
ACGGTCATAA 
TACAATTAAT 
TATTGTAAAA 
TGATTAGAGG 
AAGTAAAAGA 
AAGTATGTCT 
ATGGACGATG 
GTTGTATATT 
CTAATATCAT 



660 
720 
78D 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1788 
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GCTTTAATTA ACAATTAAAG TTATTAAACT AACCAAAAGA TAAAAAAGAG TATTGATTTT 3 00 

TTAATTAGAA AAGTGTTAAA ATTATGTGGT cGcGCTTTTA GAGCGCCCAT TTCGTcACGA 36 0 

AATGTTAAGA GTGGGAGGGC AAAACTGAGC CCTGTGACCA CATCACGATA TCAAGGAGGT 420 

GCACATCGTG GCTAAAAAAG TAGATAAAGT TGTTAAATTA CAAATTCCTG CAGGTAAAGC 4 80 

GAATCCAGCA CCACCAGTTG GTCCAGCATT AGGTCAAGCA GGTGTGAACA TCATGGGATT 54 0 

CTGTAAAGAG TTCAATGCAC GTACTCAAGA TCAAGGAGGT TTAATTATTC CGGTAGAAAT 600 

CAGTGTTTAT GAAGATCGTT CATTTACATT TATTACaAAA ACTCCACCGG CTCCaGTATT 660 

ACTTAAAAAA GCAGCTGGTA TTGAAAAAGG TTCAGGCGAA CCAAACAAAA CTAAAGTTGC 720 

TACAGTAACT AAAGATCAAG TACGCGAAAT TGCTAACAGC AAAATGCAAG ACTTAAACGC 7 30 

TGCTGACGAA GAAGCAGCTA TGCGTATTAT CGAAGGTACT GCACGTAGTA TGGGTATCGT 84 0 

TGTAGAATAA TTTTACGAAT ATTAAATTTG ATTACATGAT TTAAACGATG AAGCAGATAA 90 0 

CAGAGATAAT AATGATGAAT TATAAATATA ATCTGAATGA CTAGATTAAT GATTGATTTA 960 

TTCATAAGAT TAATTCTTCT GTTGTCTGCy CTTAACTTGC ATATAGCAAG TAATGTGGGA 102 0 

GGAAATTCCG CTAAAACCAC TAAAGGAGGA ACTATAAATG GCTAAAAAAG GTAAAAAGTA 10 BO 

TCAAGAAGCA GCTAGTAAAG TTGACCGTAC TCAGCACTAC AGTGTTGAAG AAGCAATTAA 114 0 

ATTAGCTAAA GAAACAAGCA TTGCTAACTT TGACGCTTCT GTTGAAGTTG CATTCCGTTT 12 0 0 

AGGAATTGAT ACACGTAAAA ATGACCAACA AATCCGTGGT GCAGTTGTAT TACCAAACGG 12 6 0 

AACTGGTAAA TCACAAAGTG TATTAGTATT CGCTAAAGGT GACAAAATTG CTGAAGCTGA 1320 

35 AGCAGCAGGT GCTGACTATG T 1341 
(2) INFORMATION FOR SEQ ID NO: 33 2: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5136 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332: 

CTCTAAATCT TCAATAGGTA ACTTCATTTT AATAATTCCC ATTTGAACAA TTGCTTCTTC 60 

50 ATCATTGACA ATTTCTAAAA CTTCACCCTT TTGACCGTAA GAkAATACTT TkACTTCATC 12 0 

ACCAGCAACA ATTTTATCGT ATTTTTGCTT TTGTACATTT TGCTTTATAG ATTTCGCTTC 180 

ATAATGATCA TCTAATCGTT TCTTCTTATC AATCAATTCA TGTTCTTTAA CATCTGCACC 24 0 
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CTTAATCTTC TGATTTGCTT TkTCTTTCGC TTCCTCTATT AGAGACTTTT CATAATTTTG 360 

GAATTGTTGA TACTGCTTAG ATAAATCATC ATGCACTTGt CCGCkTskTT TACAAGACGG 420 

TCCAGkTCTA ACCTCTGTGT CTCTACACGT TTGTAATTAC GCTCTAATGA TTCAATCATT 4 80 

TCATTTATTT CTTTTTCATC AGTACCAATC ATCGTCTTAG CCTTATTAAT AATATTCAAA 54 0 

CTAAGACCTA ACTTTTTAGA AATGTCAAAA GCATTTGAAC GACCCGGCAC ACCCATTAAT 60 0 

AACTTATACG TTGGACTCAA AGTATCTACA TCAAATTCTA CACTCGCATT CATAACGCCT 66 0 

TCTCGATTAT AACTATATGC TTTAAGTTCA GGATAGTGCG TCGTTGCCAT TACTAGAGAA 720 

CCAATTTTTC TAACATGATC TAAAATGCTC ATTGCTAATG CAGCACCTTC ACTCGGATCT 7 80 

GTACCTGCAC CTAATTCATC AAATAAAACT AAACTATGTT TGTCTGCATG CTTTAAAATT 84 0 

TCAACTATAT TCGTCATATG AGATGAAAAA GTTGATAATG ATTGTTCTAT TGATTGTTCA 900 

20 TCTCCGATAT CGCAATATAC ATTTTTAAAT ACACTTAACT GACTACCATC AAGTGTGGGA 960 

ATCAACAATC CTGATTGAGC CATAACAATA ATTAAACCTA ATGTTTTTAA TGTTACAGTT 102 0 

TTACCACCTG TATTCGGTCC TGTAATAATT ACCGTTTCAA TATCTTCCAT AAATTCGATG 10 8 0 

GTATTAGCTA CAACAGTCTC ACGATTTAAT AATGGATGGT ATGCTTTAGG TAAATATACA 114 0 

GTACGGTCCT CTTTAAATAT CGGCTTTGTT CCTTTAATAC TTCTACTATA TCTCGCTTTT 12 00 

GCGATTAAAA AATCTAACTG ACCCATGACT TGTTCTGCCA CAAGTAGTGC ATCTTTGTCC 12 60 

GCAGCCAC^T AACCAGTTAG TTGCGTTAAA ATGCGTTCTT TTTCAATTGC TTCGTCATGA 13 2 0 

CGTAATCGAC TAATTTGATT ATTCATTTCA ACAACTGATG ATGGCTCAAT ATACAATGTT 13 8 0 

TGTCCTGAAG CAGATTGATC ATGTACAATC CCATTAAAAT CTTGTCGATA TTCAGCTTTG 1440 

ACAGGTATAA CGTTTCTTTC ATTCCTAACT GTTACAATAG CATCTGATAA TTTTTTCTGA 1500 

TTTGCTTGGC TTTTAACAAT ACGGTCCAAA TTTTGTCTAA TACGTTGATT CGTGCTAGAA 156 0 

ATTTTACTTC TAATCCCTTG CAATTCATAA CTCGCATTAT CATATAAATC ATACGTATCG 1620 

CATGTTTCAT TTATTTGTTG AAAAAGATCA GTTAACACAG GTAATTGATT CATCTTGTCA 168 0 

TCTAATATTG GGTATTTAAC ACCTTCATCT TCTTCAACCA ATTGATTATA AAATGTCTTG 174 0 

AATTGATTTT GTACTTGAAT TAATCTTTTT ATCAJVGTTAA GCTCTGATAC ATTTAAAACG 1800 

CCGCCAATAT CAGCGCGATG AATGAATGCT GATACTTTAG ATAAGCCACT CAAGCTTGGT 186 0 

AAACGATGCT TATTATAGAT TTGAGCAATC TCATCCGTTT CTTCCATTTG AAAAACAACC 192 0 

GTTTCAAAAT TAGTAGCTGG CATCATTTGA TTGACCTTTT CCAAGCCTAA GTCACTAATA 1980 
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TCACGCGATA ATGCGTTAAT CACTCTATCT 
CGCCATAATT CATAAAATCT AAATGATTTG 

5 

TCACATTTGG ATATTTACGA ACGATATCAG 
TAATTTCTAA TACTGTATTC GTTTCTTCAG 
TATAACCATC TCTTCTACCT ATAATACGCC 
GATTGCGACA TGCATTAGCT AATCGTTCCA 
GATGAATAGC TCCAATTACA TAATCAAGTT 
15 AGCCATCAGG TAATATATCC ATTTCTGTAC 

TATCTAAAGC CTTAATTTCT TCGTTTTGTC 
CAACACGTAA ACTTTGTGAA TGATCAGTAA 

20 

TTGCTTCTAC CATGTCTCGA ATAGAAAACG 
TATCACCATT AATATCATCT ATTGTAATGA 
CATCTTCTCG CATAGCAGGT GGTATAAAAT 

25 

CACTATCATA TTGAATTAAT GTACCATCAG 
CATCACGTGC TTTAGCAAGT TGTCGAATTC 
jQ GCTGCAATGT ATGATAAAAA GCACTTGGTT 

CATCATCATA CGCTAATTCT AATGAAACTT 
TATTGGGAAT ATTTAATAAT TGCTGCTGCA 
AATCTAAATC TTTGCTCATT TCTTTAAAAC 
CGATATAATT TAATGTATCT ATATAATCAA 
CAATTGGATA TCTATCTTTC TTAGCACCAA 

40 

CCGTTTTCTT AGCAAATCCG CTTAATTCAC 
GTGACGCTTT ATCAACAATA TTCAACTCTT 
GTCCTTGAAT TTTCAAAAGT GGAATAAGAC 

45 

AATACTGAGA TTCACCGGTC TCACGGTAAT 
TGCCTTTtAA CTCCGTtACA TCAGATATTT 
50 TTTGAGCTGC TTTTCGaTAC GCTGaTATTT 

AAGTAGCAAT TTGTTCTAAT AGTTTGATAA 
AAAAGAAGAC CAGGACGTAT CATTAATATA 

55 



TTTGTtACAA ATCCTTTTTG CGCAGTTGtA 216 0 

TATGATGCGC ATCAGTGTTA ATAGTTAATT 222 0 

CGCTCAGATC CAGTCGATGT GGATTGGCAT 22 80 

CTAATGCCAT TAATTGTTCA ATATTCGGTT 2 34 0 

CTGTTGGATG CGCTATATGT CGCACGTATG 24 00 

TAATTTGTTC TTCTGATTGG TTAAAGCTTT 24 6 0 

GTGCTAAAAT TTCATCATCA TAATCCAGCG 252 0 

CTGAATAAAT ATCAATTTCA CTATATTCTT 2580 

TTAAAAGTCT TTCTACTTGT AAGCCATTAG 2 64 0 

TTACCATGAA TTTATAACCT TTTGCGATAT 270 0 

CACCATC7VCT ATACGTTGTA TGCATATGAA 276 0 

TATTACTTAG ATCTTTATCA AATTCGCTAC 282 0 

TCACATTAAA ATGTTCATAT ATCTTGGCTT 2 8 80 

CTTGTTCAAT TCCATATTCA CTTACTTTTT 2 94 0 

TTATATTATG TTCTTTTGAC CCAGTAAAAT 300 0 

CAATTAATCG AAAATCGACA CCAATCGTTT 3 06 0 

TTGTGTTCCC CACTGCAACT TCTTTTACTT 312 0 

CTGCTTTTGG GTTATCGGTA CTTATTATGA 3180 

GACGGAAGCT TCCTGCAGAT GAATATTGAT 3 24 0 

TGATTTCTTG ATTAAGTCTT CTCATTTGAT 33 0 0 

GTTGTTTCAC AGCTTCTAAT ATGTTTTGTT 33 6 0 

TAACTTTTCC ATTTTCACAA GCAACTTGAA 342 0 

TATATAGCTT AGCAATTTTC TTGCTTCCAA 34 8 0 

CTTCCGGAAC TTCTTCCTGT AATTGCTGTA 3 54 0 

CATTGATTAC TTCTGCAACA CCTTTACCAA 3 60 0 

CATCTAATGO TCGTTCATCT AATTCAAGAC 3 66 0 

TAAAAGTATT TTCCCCTTTT AATTCCATAT 3 72 0 

CATCTTTTTT TGTCATAATA ACACTCCATA 37 8 0 

TACCTTTGTC CTGACCTCTT ATGATAATTT 3 84 0 
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TAGATArrrr aagctgatct tctaatgctt cgttaggata taatgccaat agatataacg 

TAAAGTGTAA GACAATTATC GTCATAAACA CACCAACTAT CATTCCCATT GCACGACTGA 
AAATATGAAT GTTTTGATAC GCTATTArrT TATCAAAAGT TACGATAATT AGATATAAAA 
TGAACTTACA AAAC^TGTA ATCATTAAAA AAGCTACAAT CGCTTCAAAT CGATITrGTA 
GATGATTAAA ATGAAACGCA AAAGTIGTAT TAAATGCTGT TGrTTTAGGA TATGGAATAA 
ATACAATTAA TCTTTCTACA ATAGATTTGT AAAATTGACT GGCAATCCAC AATGATACAA 
nCGTTGCACT CAAATGTATC ATAGATAACC AAAAACCTCG TCTOAATCCA ACGATGACAA 
AATACACAAA GAAAATGATT ATGATAAAAT CAATGACCAT rTATTGCTCA CGCTGCTGCA 
ArriGTGAAT rrGTTGTTrC AAACGTCGAT TTTTrrTCTT CTAGTAGTAC rrrTTCATGC 
ATAATATTCA CAGCAGTTAG TATTGCTTrT CTTGAAGTAT CTAAACCTGC TGCTTTATAC 
ccTAA.rrcrrr TTATTTTATC ATCAACTAAA TGTCCTACAT atcgtatgtg ctctgggtta 
TCTTCCCCAA CAAITGTAAA AAGCTGATCA TTAATTGATA CATTTACCTT GTTTTTAAAC 
TGTGTCATTT ATAATTTCTC CTGATCCTTT TTTTAAAATC TAAATTCACG TTATAAAATA 
TGACTGGATA GTirGTCTGA ATTTGATACT AATATTCTTA TATTGCAATT ATGATAAAAC 
AACAACACAA TCTCTATAGA TGACTTAATG TTCmTTAT AATGAAATAA TGTAAAGAAT 
TTTCTATTCA ATACTTTATC ATGTTTAAAT TGTCTCACTA TAACATTTTC ATAAACATTA 
TACATGACCA CTATGTATTT TGTAAGTATC CGCAATTAAT TCTTTACAAC ATACATAAAT 
GTTTCTGACG TTATTATCAT TTATGATATG ATTAnTrTG CTAAAGACAA TGAAATTTTA 
TGAAAGGATT TACACAAATG GCGAATATCG rrTTTAAATr GTCGGATAAA GACATAACGA 
CATTAATGTC ACGCATTTCT TTTGATACTG AGAATTTACC TCAAGGnATG AnAGCACGTG 
CAAAGTATCC AAAATACAAC TGTAAATATT TACCAT 
(2) INFORMATION FOR SEQ ID NO: 333: 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 23 9 base pairs 

(B) TYPE: nucleic acid 

4£ (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi: SEQUENCE DESCRIPTION: SEQ ID NO: 333: 
.GCCAAAATT GCACCTCCAC TTTCCTTTGA ACAAATCTAT TTTAAACGCA CATTCCCATT 
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GGTATCTTTC AAAGATAAAA TCTTAATAAT 
CACAATATTA TATAGACCTA TTTTATTTTG 
^ ACCTTTATAA AAATTCATAC CTTTATGCTA 

AATTCTTAAC TATCCTTTGA TGATTGTTTT 
CGAATTTCTA TAGTTCAATG CTACTAAAAA 

10 

TGTTTTATAA TTATGCTTGC GATGGTGTTT 
ATTTAAACTC TTTACCATCA TGATCTACTG 
CTAATATTAA TTCACTTAAA TTATCTTCGA 
TTGCACCATA TTCTGGATCA TATCCTTCTT 
CAATAATATT TATGTTTTGT TCAGATAATC 
20 CAATTTCTTT TAATTCTTCT TTTGTTAGTT 

TTAAAAATTC TGGACGGAAT GAATTTTTTA 
CATAATCTTG TCCATCACTT GAACCACCGA 

25 

GTGCCCCAAC GTTTGATGTC ATTATGATAA 
TTGTATCTGT CAAATGTCCA TCATCTAAAA 
GAGCTTTTTC AATTTCATCA AATAAAATTA 

30 

TTAATTGTCC ACCATCATCA TGACCAACAT 
CTGCGTGTTT TTCCATAAAT TCACTCATGT 
35 ACATTGATTC AGCTAAAGCT CTAGCTAATT 

AGATAAAGCT ACCAATTGGT CGTTTAGGAT 
CTTTACTGAT TGAATTAACA GCATCTTTTT 
CTAGACTAAG AAGTTTTTCA GATTCTGTTT 
ATCCTGCAAT AACTTCAGCA ATATCTTCTT 
CATTCTTCCA TTCATTTTTA GCTTCTTCAT 

45 

GCAGgTTAGC AGCATTTTCA AACTCTTGAG 
CTTTTTCAAT TTCTTGTTCA ATTTCTTTTA 
TTACTTTAGA ACTTGCTTCA TCAATTAAAT 

50 

AAACGTATCT GTTACTTAAT TTAACAGCTG 
GATGGTGTGC TTCGTAACGA TCTCTTAATC 

55 



TTCTTAGTAA ACTCTTTTCT CTAGATTTAT 24 0 

ACGTAAGTTG CTAGTATCTT CAAACAPiAPJ^ 3 00 

TCGCTGTAGG CTCATTAACT TGTTACATAT 3 60 

ATTAGATTGT TTCGTTGATG GATACTTTCA 4 20 

AACAGCCCAA AACTTTAATT TGTTTTGGAC 4 80 

TAGTTTCTGA AGTTTGTTCA GCAATGTCAT 54 0 

TAACTTTCTT ACCTTCAATT TGATTACCAT 500 

TAGTTTTTTG TATCGCTCTA ATTAATGGTC 560 

CTGCGATTTT GTCTTTCGCT TTATCAGTTA 72 0 

GATTTGTTAA TTTATTAACC ATCATTGTTA 730 

TATGGAATAC AATGATATCA TCTACACGGT 84 0 

ATTCTTTTAA CATCGTTTTT CGAATTGTTT 90 0 

ATCCAGCAAA TCGTTGATCT TGTAATTCTT 96 0 

TTGTATTTCT GAAATCAACT GTACGTCCTT 102 0 

CTTGTAATAG AATATTAAAT ACATCTGGAT 1080 

CAGAATATGG TTTACGTCTA ACTTTTTCAG 114 0 

ATCCTGGAGG AGCACCAACT AATCGGCTCA 1200 

CTACACGGAT CATCGCATCA TCATCGCCAA 1260 

CAGTTTTACC AACACCAGTT GGTCCAAGGA 1320 

CTTTTAACCC TGCACGGGCA CGTCTAACCG 13 80 

GCCCAATAAC TCTCTCATGT AATGTATCTT 144 0 

CATTGATTTT AGTTAATGGG ATACCTGTCC 1500 

CTGACAATGA AGTTGaCATG CCATTTTGTG 156 0 

ATTGCTTTTC AAGTTTTGTT TGTTTATCAC 1620 

CATGTACTGC GGCATCTTTT TCATTTTTAA 16 80 

AATTATTAGG TGTCGTATGA CTCTTAAGTC 174 0 

CAATTGCTTT ATCTGGTAAG AAACGATCTG 1800 

CTTCAATAGC TTCGTCTGAA ATATTAATAC 1860 

CTTTTAAAAT AGCAACTGTA TCTACTACTG 192 0 



10 



25 
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TTTTGCGATA TTCATCTAAT GTAGTAGCAC CAATACATTG TAATTCACCA CGTGCTAATG 204 0 

CCGGCTTCAA AATATTCGAA GCATCGATAG CACCTTCAGC ACCACCAGCA CCAACTAAAG 2100 

TATGCAACTC ATCAATAAAT AGGATGACAT TACCTGCTTG TTGGATTTCT TCCATAACCT 216 0 

TTTTCAGACG CTCTTCAAAT TCACCACGAT ATTTAGTACC TGCAACTACT GTTCCCATAT 2 220 

CTAAAGACAT AACACGCTTA TCTTTTAATG TCTCTGGTAC CTCATTATTC ACTAtGGCtT 228 0 

GCGCTAAACC TTCAGCAATA GCAGTTTTAC CAACACCTGG CTCTCCAATA AGCACAGGAT 234 0 

TGTTTTTCGT ACGTCTACTT AATACTTCAA TTACACGTGT AATTTCTTTA TCACGTCCTA 24 0 0 

TAACAGGATC TAATGTACCG TCTTTGGCAA TGACTGTTAA GTCACGAGCT AAACTATCTA 24 6 0 

15 

AAGTTGGAGT ATTATTTGAC TTACTAGCTT GTGCATTTTT ATTACTCATT TCAGGGTTTC 2 52 0 

CTAAAGCTTT CACAACTTGT GCACGTGCTT TAGTAATATT TAAATCTAGA TTTGCAAAAA 25 80 

20 CTCTTGCTGC AACACCTTCA TTTTCACGAA TCAAGCCTAA TAAAATATGT TCCGTTCCAA 2 64 0 

CAAAATTGTG ATGTAATTTT CTAGCTTCAT CCATCGATAA TTCAATGACT TTTTTAGCTC 270 0 

TAGGTGTATA ATGCAATGTA CCAACATGAT CTTGACCATG TCCGATTAAT TTTTCAACTT 276 0 

CTTCAATTAC TTTATCTTCA GTGATATTAA AACTTTCTAA TACTTTTGCA GCAATTCCTT 2 820 

CAGGTTCTTT CATTAACCCC AATAATAGGT GTTCTGTTCC TATATTTGAA TGATTTAAAC 28 80 

GAATTGCTTC TTCTTGGGCA TGTGCTAATA CGCGCTGTGC ACGCTCAGTT AATCTACCAA 2 94 0 

ATAACATAAA TAATGACCTC CTACTTTATA TGTTCTCTTA GTATATCTGC TCGTTTTTCT 3 00 0 

TTTACAGATT TGTCATCTTC TTCATCTAAT AAAAATGGTG ACTGTATAGC TACCATCAAT 3 06 0 

TCATTAAATT TAAAGTTTTG TAATTCAATG TAATTTAAAT CTATACCAAG TTTAACTcGC 3120 

TTAATCTATA AGAAGCCTCT TCCATAGTTA TCATTCTACA GTTTTGTAAA ATACCTAGCG 3180 

AGCGAAAAAC ACGGTCTTGT GTTTCTAATT GATTATAAGT GTCTAACTTT TGTCGTATTT 3 24 0 

40 GTTTTTCTTC ATGAATGATT TGATTAACAA CTTCTGTTAA TGTTTCTATG ATTTCTAACT 3300 

CAGATTTACC AAGTGTAAGT TGGTTGGATA CTTGATAAGT ATGTCCATAA ACTTGCGAAC 336 0 

CTTCACCGTA AATACCTCTG ATTGTATATC CAAAACGATT AATGGTTTGA GCAATCCGTG 34 2 0 

TC^TTCTTTT CATAATAGAT AGACCTGGCA AATGTAGCAT CACGCTTGCT CTCATACCAG 34 8 0 

TACCTATATT GGTAGGACAT GTAGTTAAAT AACCAAGTTG TTCATCATAA CTTATATCAA 3 54 0 

GGCTTCGATC TAATTCATCA TCAATTGATG AAGCTTGATT ATATAAAGCC TGTAATGTCG 360 0 

so 

TGTCAGTTCC CATAGCTTGA ATACGAATAT GGTCCTCTTC ATTTATCATG ACACTTAAAG 3 66 0 



30 



35 
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CAAAGTTTGG CAAGGCATCT TGTACCTCAT TTATAACTCT AAATCCcATC ATTTTCAGTA 3 84 0 

GCATACATTA GTGGATGCAC ATGATTTTCT AAAtTACGCG CTAACCGAAT TCTAGAAGAC 3 900 

5 

ATAACAATTG GTGTTTCTTC ATTACTTTTC ATCCATTGGC TGATATTATC ATGAATATTA 3 960 

TGCGTCATCA TGTTGCACCT CACTCTCAGC TTTTAGTGCT TTAATTTCAT CTCTAACAAT 4 02 0 

GGCTGCTTCC TCAAAATCTT GGATTTCAAT AAGTTTTTTC AAATATTCAT TCTTTTCTTC 4 080 

10 

GATTTTTCGC TTTAAAGCTA TCTTTTTATG TGAAGAATGT GGTGTCTTTC CAACGTGCTC 414 0 

AAATTGTCCA CCTTGAACTC TGCGGACGAT ATCAATGATG TCATCTTTAA ATGTTGcATA 4200 

^5 ACAATTAGCA CACCCAAATT TACCAACATG TGCAATATC 42 3 9 
(2) INFORMATION FOR SEQ ID NO: 334: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1245 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334: 

CAAAACTTTG ATTATGTCGT GTCAGTTATT AATGCATACA GTAATGCATC GTTGTTATTT 60 

TTTATTTATT TACTATTTAT TATGTACCAA ATCAGCCTTt CAGTAAAATA GGTCAAATTA 12 0 

30 

CTGATTTTCT AAATATAAAA TGCCTCCTAA TAACATACTA TTAGTACATC ATTAAGAGGC 180 

TCTTGTGTTA TTTGCATACT AAGCGCTCAA ATTTAAATTT AAGATGAAGA TTCTTGCAAT 24 0 

AATATTTCTA TATTTGTCGA TATATCCAAT GGATCTTCCA TTGGCAAGAA TCGATTTACA 30 0 

ACATTTCCTT GTCGATCGAT TATAAATTTT GTGAAATTCC ATTTTATTGG GGaCCCAAAG 36 0 

ATTCCTGGTT GTTCaTTCTT TAAATGCGtA AATAACGGAT GTTCATCGTT CCCGTTCACA 4 20 

GATATTTTAG CTAACACTGG AAATGTTACA CCAAATTTCT CACGACTAAT TTTCAAGATT 4 80 

TCTTCATTAG AACCTGGTTG TCGATTGTCA AAATTATTAT TCGGAAAACT CAACACTACA 54 0 

AACCCACGAT CCTTATATTT TTGAAATAAA GTCTCTAGTT TTTTCAATTG TTCGCTATAT 6 00 

45 

ATACATTCTG TTGCAGTATT AACAACTAAA ATCACTTTAC CTCTAAATGC TTCTAATTTA 66 0 

TAAGTTAAGC CTTTATAATC ACTTACTTCG ATATCATACA CATTTCTATT ATTCATAAGA 72 0 

CACCCCTACA CAGCCTTTTT TATATTGAAT ATGTTCTTTT TAGAATGTTC TGATAAAATA 78 0 

50 

AGTGCGCGTT TACACCGTGA ACACACATTA TATAGCGTGA TACATTTTTC GAGCACACGA 34 0 

TAAATAATGT TCGAGTTTAT GTTGTTGCTC AACCTATCCG ATTTACCGTC TTTTTTCACC 90 0 

55 
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15 



CCGTTATAAC CCCTCATTAT AATCATCCTT ArTTTCTATT rTTAAAAAGA CAATTAGACC 1020 
GCTCTTTAAA CTATAGaTTA ATACTTAAGJc TAAACTCATA CATACTGATA CCATACGT^A 1080 
GATCTAACAA TTTAAAATrC GTTATAACTA TGGATTAAAG AGCTGCCCAA CTCATATAAT 
CCTTAAAAAC ITCACATCTG ATTGTkTATT AAGCCCTCCT TTATCnTATT AAATATCCTT 
ATAACCCTTT TAAAATTAAA CTGACACACT CATACATTGT TACAC 
(2) INFORMATION FOR SEQ ID NO: 335: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1399 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : dov±ile 

(D) TOPOLOGY: linear 



1140 
1200 
1245 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335: 
CTTGTAAAGG TAGrTGrTCA TTTAAAITAA AACAGTATGC TAAGTTnGTG CTTATAITTT 
GCAACTTCAA TTCGACAGGC TCATCTTGAT ACATGAATGC CTCAATTTCA TCATGTGATA 
ACTTTTTCAA AATATCAATT AAATGrATAC TAAAAACGAT AAATAACATA TCCCAATTTG 
AATICAGCCC TAGCGATTTT AATTTGriTA AAATCTTATC TTTTTGAAAA ATTCGATTCC 
TAATGTCTTG TATATCGTTA TCAGTCAAAG TITCCCAATC TATATGTGAA TGAAGACCTA 
AATAACACTT ATCCATTAAT AATTCATATA CCGTTAATGC AGAGACATCG AAACAACGTT 
CTTCACTTAr AAAAACGCCA TCAACATCAA ATAAAATrTT CTTCACAATC CCCACTCCAT 
TTCTGaAAAT TCAGaTATAA ATCATTCTAC TATTrGACTA AAAAAAGCGC AAACCCTATT 
GAAGTaGATT TGCGCTTTAG CTGTTt^T TTTATAAATG TnTTTCAATT TCATCAGCAA 
CCTGCTGTAC GTGTGTACCG ACAATAACTT GAGTTGAATG rTTGCCATTA ACAGTAACAC 
CAACTGCACC GGCGTrTTTA ATCTTCTGn TATCAATAAT AGATGTGTCT TTPAACTCTA 
GACGCAACCT TGITGCACAA TTGGTTAAAT TAACAATATT CTCTTGACCG CCTAAACCTT 
CTAATATTTG TATAGCATGT TGATGATATT TACTTTGrTT AATATCATTT TCACCAGGAG 
CAATATTATC TTTTACAACT GTTGGGTCAA CTAArTCATr TTCACCTCTA CCAATCGTAT 
TCAAGTTAAA TACTIGGArr ACTACACGGA AAATCACATA GTATAAGATG AAAAATACAA 
CACCTTGAAC AAGCAACATC AATGGATGAT TTGATACTGG ATTAATTAGT GATAACACAT 
AATCTATCAA ACCTGCACTA AATGAAAATC CAGCTGTCCA ATGGAATGTA GCTGCGATAA 



420 
480 
540 
600 
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ACCAACCGTA AACTTGTTTT TTCTGAGTAG TTTTAGCTGT ATGATACATT GcTAACGCAG 12 0 0 

CCGCTGGAAT ACCGAACATC ATGATTGGGA AGAATCCCGC TTGATAGCGT CCTGTAATAC 126 0 

CTTTTATAGC ATCTTTGCCA CTTTGGAATT TACCAATATC ATTAATACCA ATCGTATCAA 13 2 0 

ACCAGAACAC ACTATTCAGT GCATGATGTA ATCCTGTAGG AATTAATAAT CTATTGGCAA 13 8 0 

CACCATATAT GAAAGCTCC 13 9 9 



10 

(2) INFORMATION FOR SEQ ID NO: 33 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1329 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336: 

TATAGTTnTA TTATTTAGCG AAGCATTAAT ACTACCACCG GTTATAACAA ATGTATTTTG 60 

CGTATTAAAT TGAATGGTAG GACCAATCAA AGTATATTCA ATCGCTGGAC CATCATTGTT 12 0 

AATTAATGAC TGCGCAACCT TAAAACTAAA TTGATCCATG GCACCTGCGC CTGAAAATCC 18 0 

AATATGTTCA TAACCTATTC TTCCTAGATC TTGTACCGTT GAAAAGAGAC CTGGTTGTAA 24 0 

AATCTTAATT GACATTTTCA ATCACCACCC AGTCATCAAC ATTAAAGTTG CCATCTGATA 300 

30 

TATCTCTTTC GATTTGTATA AATTTCTGTT CATCTATTGC ATAAAATTGT ATCCATTCTC 36 0 

CTGCTTCGTA CATTGACATT GGTTCACGCT CGCTGCTAAA TACTTTTAAc GGTGTGCGTC 420 

CAATAATTTG CCATCCGCCA GGAGAATCTG ATGGATATAG TCCTGTTTGA TTATTCGCAA 480 

35 

TACCTACAGA ACCTGCATGA ATTTTTAACC TTGGCTGATT ACGTCTAGGT GTATGTAGTT 54 0 

GTTCATCAAG TcCGCCTAAG TATGGAAATC CTGGCATAAA TCCTAGCATA TATATTAAAT 600 

40 AAGGTTTACT TGTATGTTTT TCAATAACTT GCTCAACAGT TATTCGATTA TGCTTTGCTA 66 0 

CTTCTTCAAT ATCTGGTCCA TATGTACCAC CATATTGAAC AGGTATTTTA ATAATACGAT 72 0 

TGGTTTGATT CACAGCATGA ACATTTTTTT CATTAAATTT GTTAAGTTCT AAATTTTCAA 780 

TTAATTTAGA AGATGTTATA GCTTGTTCAT CAAAATATAT TAGAACTGCT CGATACGAAG 84 0 

GGACAATATC TTGAATTTCT AATATTTCTT TTTCTCGTAT CCACCGTACC ATTGCTGTGA 900 

CATTACGATA TGTCTCTTCG GATATTTTAT TTTCAAAATA AATCATAATT GTCTGCTCGT 96 0 

SO 

TAhTPiPiATCr TACATCCACT TTAAATCCCC CTTTGTATTG CAATAAACCA GTATTGAATA 1020 

CCTTTTCATT GTATCATTGA GAAGCACAAG TTGTTTAATA AGTAATTCAA ATCGCATATA 1080 



55 
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20 
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TTAATArrGT AACTCTTACA CTAATTTAGG TTCIXSCTATC ATTCGGTCTG ATGGAAAATT 

rrrAcrrrrc atctgtccga TrrrTTGArr ttgaatataa aaaagcacga ccgaagtatc 

ATTAACACAC TTCAATCGCG CAATTAAATA ATCTATITGA TCATTTATrG GATATTAACA 



atttttacg 

(2) information fok seq id no: 337; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337: 
GAATAGTGCC ATTTGGAAAG ACATAAAGAA TCCAGAAGCA CCTTTAGCGA ATATGCCGTG 
TTCATATAGA TTTGTGAAAC TGGCATGCCC GAATTGAGTT TTAAATGCAA ATAGAATCAT 
GACGAAACCA ACTACTATTA AACCAATAAT TGTCGCTATT TTAATGATAG AGAACCAAAA 
TTCTAATTCT CCGAAAAGTC TTGCGCTAAG TAGGTK5AAT GACATTAATA ACAGTACACA 
AAATAGTGCA CTTATCCAGT TTGGAATTTC TGGGAACCAA AAGCTAACAT ArTTTGCCAC 
;,GCCGTTACT TCAGCCATAC CTGTAATAAT CCAACAGAAC CAGTATGTCC ATCCGGTAAC 
AAATCCTGCA AAAGGCCCAA TATAl^TATT GGTTACATCT GCGAAAGATT TAAATTCAGT 
;,TTCTGTATA ATGATTTCTC CTAAACCTCG CATAAACATA AATAACATAA ATCCTATAAT 
GATGTATGTT AATAGAATTG AAGGGCCGGT TAATGCAATC GTrTGACCAG CACCTAAGAA 
TAAGCCTGTA CCAATTGCAC CGCCAATTGC AATTAATTGT ATGTGGCGAT TGCTCAGTTC 
CCrrTGTAAT TTTTCAGCCA TAATACATCT CCCTTAAATA TAGATATGTT TATTATOCAC 
rrATATTGAG ATATATACAA TTATTrTCGG TAAAAATGTG TAAAATTCCA TGTTAATATA 
CrrTGGTTTT TATAATCATA TATAATAACC AATTGAAAAT TTAATTCTAT l^TAAAATTC 
ATGGATTATT CACATCTTGA AAAAGCTTTA ATGGTGCTAT TTGTGGCTAT TCTGTGACAT 
TTACATAGAT TTACAAAAAA ATTGrrGCAC ATATAATGCC AGTtrTTATA TTTCACAAAC 
GAAATGCGTT TAcTATAATA TTAGrTGAAA GCCATTTCAT AAAGAAACAG TAAAGGGGAA 
ATTCATCaTA GCn^^aATTAC AAAGAGGTTT AGAAGGGGTT ATCGCmGCGG AGACTAAAAT 
;;;.aTTCAATT ATTGAAAGTC AATTGACTTA 1.3CCGGCTAT GATATTGATG ATCTAGCTGA 
. T.^^T.-rnnAAC TATAGATTGC CAAACGAAGA 



1200 
1260 
1320 
1329 



TACACATTTT GAGGAGTATG TTACAGATCA 
ATTATCATAT ATTGCACATT TCGATCCTGA 
5 TGAAAGAGCA ATGCGTATAC AGGCTAAAGT 

AAGACAAGAT AAAGAACCAC TTAAGCCTAA 
ATATATGTTA CGTGGGGAAT TACCAACAGA 

10 

TATTTTACAC GCTGATCATG AGTTGAACGC 
ATCATTGTCA GATATGTACT CAGGTATTGT 
CATGGTGGTG CAAACGAACA AGTTATGACG 

75 

GTTGATGCTT ACTTAGATGA AAAATTTGCT 
CGTGTATATA AAGATGGTGm tCCTAGaGCG 
20 mCGAAAGACG CTGGTCGTGA AGAATTATTT 

GCAGAAGAAA AAGGATTAAT TCCTAATGTT 
ATGGAAATAC CTCATGACTT ATTCACGCCA 
ATTGCTCATA TTTTAGAACA ATATAAAGAT 
ATTGGCGAAA CGAATCGTAA GTATATCCCG 
AAAAATGAAG ATGTAAAATT TGGAGGTAAA 

30 

GGAACTGAAG GATTAAACGT ACCTAATGAA 
ATTGGACCGG ATATTTGGAA GGCAGCAAGC 
TATAATGGCG AAAAACGCaT TGAATGGAAA 

35 

ACAACTGGTG AATGGTTACC TCAAGAAACA 
GTTAAAGGAC CTTTAACAAC ACCAATTGGT 

40 CGCCPlAGPJKT tagatttatt tacttgctta 
TCACCTGTTA AACGTCCACA AGATGTTGAT 
ATTTATGCTG GTATTGAATT TAAAGAAGGT 

'^^ TTACAAAACG AAATGGGTGC GACAAACATT 

AAACCAGTTT CTAAAGAAGG AACTGAGCGA 
GATAATAACC GTAAATCAGT TACTTTAGTT 

50 

GGCTCATTTA AGCAGTGGGG TTACGATTTA 
ACTTGGCAAC AATATGACGA AATTGTTGAA 

55 
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CGTGCATCCA ATGACAGCAT TACGTACGTC 12 60 

TGCTGAAAAT GAATCAGATG AAAATCGTTA 132 0 

AGCATCATTA GTTACAGCGT TTGCTCGAGT 13 80 

TCCTGACTTA AGTTATGCGG CAAACTTCCT 1440 

TATAGAAGTA GAAGCCTTCA ATAAAGCACT 1500 

ATCTGCATTT ACGGCACGTT GTGCGGTATC 1560 

AGCAGCCGTA GtTCTCTGAA AGGGCCATTA 1620 

ATGTTATCTG aGATTGGGTC AaTTGAAAAT 16 80 

AATAAAGrTA AAGTAATGGG CTTCGGTCAT 174 0 

AAaTATTTaA GaGAAaTGAG CCGTCAAaTT 1800 

GAAaTGTCAG TGAAAaTGGA AAAmCGTATG 1860 

GATTTTTATA GTGCGAGTGT TTATCACTGT 1920 

ATCTTTGCTG TAAGTCGTTC TGCAGGATGG 1980 

AATAGAATTA TGCGTCCTAG AGCGAAATAT 2 04 0 

CTTGrAGaAA GAAAmTAATC AATACAAATT 2100 

ATAACTATGA CTGCAGAAAA AATTACTCAA 216 0 

CCAATTATCC CATTTATTAT CGGTGATGGA 222 0 

CGAGTTATAG ATGCTGCtGT TGAGAAAsCC 22 80 

GAAGTGCTAG CTGGCCAAAA AGCATTTGAT 2 34 0 

CTTGATACAA TTAAAGAATA TTTAATTGCT 24 00 

GGTGGTATTA GATCATTAAA TGTGGCTTTA 2460 

AGACCGGTAC GTTGGTTTAA AGGAGTACCA 2 520 

ATGGTTATTT TCCGTGAAAA TACTGAAGAC 2580 

ACAACAGAAG TTAAAAAGGT AATTGACTTC 264 0 

CGATTCCCAG AAACTTCAGG TATTGGTATT 27 00 

TTAGTTAGAG CAGCTATACA ATATGCTATC 2 76 0 

CATAAAGGTA ATATTATGAA ATTTACAGAA 2 82 0 

GCATTATCTG AATTTGGTGA TCAAGTATTC 2 8 30 

AATGAAGGCA GAGATGCTGC TAATGCTGCT 2 94 0 
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TTACAACAAA 


TTTTAACTCG 


TCCAGCTGAG 


CATGATGTTG 


TAGCAACTAT 


GAAv-pi 


3060 


GGTGACTATA 


TTTCAGATGC 


TTTAGCTGCA 


CAAGTTGGTG 


GTATTGGTAT 




3120 


GCAAACATTA ATTATGAAAC 


AGGTCATGCT ATTTTTGAAG 


CAACACATGG 


TACA^J^ ^ ^ 


3180 


AAATATGCAG 


GTTTAAATAA 


AGTGAATCCA 


TCTTCAGTAA 


TTTTAAGTTC 




3240 


TTAGAACATT 


TAGGATGGCA 


AGAAGCGGCA 


GATAAGATTA 


CAGATTCAAT 


TGA^G A 1 Av-A 


3300 


ATTGCTTCAA 


AAGTTGTTAC 


TTATGACTTT 


GCCCGTTTAA 


TGGaTGGtGC 


TGAAGAAGTT 


3360 


TCTACATCAG 


CATTTGCAGA 


TGAATTGATT 


GnAAATTTAA 


AATAAGCAGA 


ATAGAATTAG 


3420 












3421 



G 

(2) INFORMATION FOR SEQ ID NO: 338 r 

(i) SEQUENCE CHARACTERISTICS: 

(Al LENGTH: 3173 base pairs 
(B) TYPE: nucleic acid 
t^rTOKXTntrnMRQc; * double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338: 
CCCI^GATAC CCAAACTGCC CTAATACCAC TGGCTAAACC TAATGGATAG TACCATTTAT 
rrrCCAATAA ATAAhCCAAC TGCAAItlGCr AT.ACTCTAA ATATAATAGA GATAATCmcA 
rrAAnCGAT TAAtACGCCA AATACTAGTA ATAATAGGCT AGATAATAAT CCACCTAAAA 
;,GTACTTTTT AATTCCAAAG AAAGCTAATA TCAATAATGC TGCCGGTGCA GATAATTGAA 
AATCTAATCC Tt5GTATAATG GACGGTATTT TCAAAACTGC CAAAATGGTT AAAATCGCAG 
CAATGACACT AATTTGAGTA ATATCTrTTG ATGTCATACT AAAACCCCTA TACCGTTTCA 
TAAACAACTT GCTICGGTGT GCTTTCTAAA AATGATATGT AATGATTTAA ATCAATACAA 
TCGTCCACAA ATATrATTCT GCCTCCATAT CTCGTATTAA CTGGTTrAAT ATCAAATAAT 
CGATGGTAAC CAATTTTAGC AGCGGCAAAA TAACCTGTCG TATACGTTAA GTCATCGGAC 
ACGCAAAGTT CTCCTTTGAC ATACGGATGC GCATTGATAC AACTAGCAAT TGCTAAGGCA 
TCAGTCACTC mCArTAAG ATCACCTTTA TrATTTATAT CTTCAAACGA AAAATGTGTT 
GCCCTAATCC CCCTTTGTCC AAATGAATCT AAACGTTTAC CAGATATAGC AGATAGAATA 
ATAGCTCCTG TATAAACCGT rTCArrTTTA ATATATGTCA TCCCTTGATT TAGCGCTTGT 



^^^^n^z^TPT TOGAGATTTG CTTTATCATC CTCAATAATT 84 0 

TCAGTGACAC CACATTCTTG TGTTAAATGT TGGAGAXl ru Cii 



,_^^^„^^-T-r rzTT^T'^'^'rrA'^ A'^THAGAAAA 
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ATTTTGTCAA ACTCACAAAT CGTTTCAGCA CCACTAATAT GAACATCTTG ATTGCTAGAA 102 0 

CGCATTTTTA TACTATACAT GACGATCACC TcAATCTTCT TGaTGCAAAA TTTCAAACAA 10 80 

CCTATCTATA TCTTGTTCAG TATGAAAATA CGACAATGAT ATTCTTAACA TTGGCTTAGT 114 0 

CACAGTtGGA TACCTTAAAT AACTTGTAAA CACATGATGC TTTAATAATG TTTGATGAAT 12 00 

GTTCTCAGCC GCTTCTATGT CATCAAACTC AATAAACTTA ATCGGCGAGT TTGaACTATT 1260 

ATAATkAACA TTGAGTGCTT TTAACTTTTG GTTAAAATAT TTACTCAAAC TATTTAATTT 132 0 

AGTGCGTCTA TCATCAGCAT TTATTAACTT TTCAATGTTT CTTTTTATAA AATACAAATT 13 8 0 

ATAAATTGGC AAACTACTTG AGTAGATGAG TGGTCTACCG TGATTAATTA aCATATCCtT 144 0 

CaCATCaTTT GaACTkaAAA TcACACCCCC GTATGCACCA CATGCTTTAG ATAAACTAGA 15 00 

AGTGAGTATA TCTACACCTT GATAATTCGA GTAAtTCTCT ATtCCAAAAC TATGTGAAAC 156 0 

ATCGAGTATC AGTGTTGCGT TAnATTTATG CTTTAATGAG ACTAATTGAC CAATATCCAC 16 20 

AACGTCGCCA TTCGTTGAAA ATACACTATC AGATATGATT ATTTTTGGTA TATTTTGATT 16 80 

AGGGTATTTC TCTAACCTTT TTTCTAAATC AGCAATATCT AAATGCTTAT ATATCACTTT 174 0 

25 TTCTAAACCA CTTAACTTAA TACCGTCAAT AATACTCGCA TGATTTTCTT GATCTGAAAA 1800 

CACGACACAA TTTGTATTTT TGAAAATATT AAATAACGCC AAATTAGCAT CATAACCACT 186 0 

ATTTAAGATA GTACATGcAC TATATCCGAG CCAACCTGCT AACATTGTTT CAATTTCTTC 19 20 

30 

ATAAGCTGTC GAACTTCCAC TAATTAATCT TGAACTTGAT AAGTGATAAC TATACTTCCG 19 80 

CATAAATCTT TCGAAATCAT CCTTATCAAA CGCTATTTGA CCTAATCCTA AATAATCATT 2 04 0 

AGATGTATAG TTCGTACATC TCTTATTTTC TACTTCAATA TACTGTCTAT CTATATACCC 2100 

TACCGATTTA AGCGACCGAT ATAACCCTTT CTGTTGTAAT AAATCAATTT GCTCTTGAAA 2160 

CTTCATTCTT GTTTTCCTTA TTTTCACAAG TGTCATAATC AATTTCAAAG CCTAAATCAT 222 0 

TAATCATATC GTAGTCTAAT TGGTTCGGTT GCCCACCAGT AATTAGATAA TnCACCGACA 22 80 

AATATTGAAT TCGCCGCTTT TAATGCTAAT GGCTGTAACG AACGTAAGTT GACCTCTCTT 234 0 

CCTCCAGCAA TACGAATTTC TTTCGTAGGA TTGATTAATC GGAATAATGC TACGATTCTT 24 00 

AAACATTTCA TTGGTGTTAA ATCATCCATG CTTCCAAACT TTGTGCCTTT GATTGGATGC 2460 

AAAAAATTAA TCGGAATACT GTCGGCATCC ATTTCTTTTA AAGCAAATGC CATATCAACA 2 520 

ATATCTTGAT TAGATTCTCC CATACCACAA ATCACGCCAG AACATGGTGA TATATTATTC 2 58 0 

GCTTTCATTA GTTCTATCGT ATCTGTTCTA TCTTTATAAC TATGCGTTGT CACGACGTTA 2 64 0 

TCATGGTAAT TTTCACTTGT ATTAATATTG TGGTTATATC TGTCTACACC AGCTGACTTA 2 7 00 
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TGTTGAGATT TAATCGTTCT TACAGTATTA CTAATATGAT CAACTTCTTT ATCGCTCGGT 282 0 

CCTCTACCAC TCATAACAAT ACAATATGTT CCAATATGAT TATCATGTGC CACCTTTGCT 28 80 

CCATCGATAA TTTGTTCCTC TGGAATTAAA GCATATCGCT GTTTTTGTTT AATATCTCGT 2 94 0 

GATTGTCCAC AGTACCCACA ATTTTCAGGA CATATACCAC TTTTAGCATT TAAAATCATG 300 0 

TTTAATTTTA CTTTTTTACC AAAATAATGT TTTCTTAAAA TGTACGCCTC ATTTAATAAA 3 06 0 

TCTAAGGTAT CAATATTAGT ATCCTCATAA ATtTTCAATA CAGTCTCTTT TGTtAATTGT 3120 

tCCCCTTGTA ATATGCGTTT AGCCAAATTC ATATTAACAC TTCCTATCTA AAA 3173 
(2) INFORMATION FOR SEQ ID NO: 33 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1694 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
20 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 9: 

CGATTATCCA TTAATACAAC CCTAAGTAAA TGTATAAAAA TTATCTTCCA CAAACTTCAA 6 0 

CAAAAGCCTA AATAAATTAC AGCAATTTAT CAAATATTGC TTACTTTGAT TTTATGAAAT 120 

nACTTAATTC TAACACATAC TAAATCATCA TATACTAATT CGAAATCAAA TGCATTTAGA 180 

GATA_ATCaAA ATGCGGAAAC ATCTCCaATA ATCAATAATC TATTCCCAAT AAATATGAAT 24 0 

GTTCTCAACA ATACATTATT TATATCTCTT TACACTGTCA TCGACAAAAA CTAAATCTTT 3 00 

CACTTTCAAT TTCGAACGTG GTTCTACGAC ATTTGCTGCT ATATCATTTA ATGGGATTAA 3 60 

AACAAATGCA CGTTCATTCA TTCTCGGATG TGGCACCGAC AGTTTTGGTA AATCTATCAT 4 20 

TTCTTCTCCA TACAACAAAA TATCCACATC TAAAGTTCTA GGACCCCATC GTTCCTTTCT 4 80 

-^0 AATACGGTGT AAACATTCTT CTGTCkTCAA ACAACATTCC AACAGTTGTA ATACTGTGAG 54 0 

TGTTGTTTsA ATTTCAACAC ACAAATTTAA AAAGTTAGGT TGCTCAGTAT ACCCAACTGG 6 00 

TGCTGTTTCA TAAATCGGAG AAATAtTAGA TACGTTAATA CCATCATATT CATTCaAAAT 66 0 

CTT>'ATAGCA TCGTTTAACt GGCTTTCTCT ATCACCAaTa TTACTACCTA AyCCTAAGTA 72 0 

TGCTTGAATC ATyTATTCTC CCTCACTATT TCGATACCTA CTCCATCATA ATGACCCGGA 78 0 

ATCGGTGGGT TTTCTTTAGT GATTCTCACT TTCGTTTCCA TTACACGATT ATATTGTGAA 84 0 

TTTATACGAT TTGCAATACG TTCAGCTAGA TGCTCAAGTA AATTAACGGC CTTACCTTCC 90 0 
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ATTTCATTTT CAGCTGATAA AGCACCATGA TATCCATAAA AGCGCATACC TTTAAGAAAG 10 80 

ATTGTGTCTT GCATTTTCAT TCTCCTTTAA AAAATCTATA CCTTTAGCTA ATTTAGCATT 114 0 

5 CAACTCGACA TTATGAACGC GTACTGCTCT AACGCCTTTC ATAATACCAT ATGCAGTCGT 120 0 

AGCTGCAGTT ACTTCATCTC TTTCAACCGG TGTTGTATCA TAACCCATCA TCTCTTTAGT 126 0 

GAAACGTTTC CGGCTTGTCG CTAATAAAAC TGGATATTCT GTTGCAACAA GTTCATCCAG 1320 

10 

TCTTGCCATA ACTTCGGCTT CTTCATTTCT AGTTTTAGCG AAACCTATAC CTGGATCTAG 13 80 

CCAAATTTTA TTTGAAGGTA TACCAGCTAT TTTAGCTTGA TGTGCTTGTG CTAACAAAGA 144 0 

TGTTAACATT TCTTCGACAA CCGGTTCATC ACGATTACCA TTTCCATTAT GCATTAAAAT 1500 

15 

AATTTCCGCG TCATAtTTAG CTACAATTTG GAACATACGA TGATCATACA GACCgCCcAT 1560 

tGATCATTAA TCATATCAAC GGCTAATTTT AAACATGCTT CAGCAACCTC ACTTCGAAAT 162 0 

GTATCGACTG AAATTTTTTA CATCAnAACC GACAATAGCT TCAnCAACAG TAATACTCTG 168 0 

TTCCATCTCT TCTG 1694 
(2) INFORMATION FOR SEQ ID NO: 34 0: 
^5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1358 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340: 
AGCATTTCTT TTCTATAAAC ATTTAATTGA ACATTATTAA GTACACTATT ACTATAGTCA 60 

35 

CTATATTGAA CaCATACCTC ATTTAATTCT AATAGCGGTT CAGATTTGTA CTTATTATCA 120 
TTATTTGCAG ATGTTTCATC TATCCATTTT TTCACTTTAA ATTTAACATG TTCACTCATA 180 
CAAACGTCAC GTAAATTCGC TAAGTTATCA ATGGATTCGA CATCTACTTC TGCATATTTA 24 0 

AGCGCTGTAC AGTATAATGG TTCACGTATG CCTGCTTCTT TAAGCTTAGA TGATTTTAGC 3 00 

AAATCACTAG GCGTTGTATT AGCGATGATT TTTCCATCTT TAAAAAGAAG AACTCTATCA 36 0 

AACGTATCAT CTAATGATTC TTCTAATCGA TGTTCGACAA TAATCATCGT TGACTTTGTT 42 0 

TCTTCATGAA TATTGTTTAA CAATCTCAGC GTTTCATGTC CTGTCGCAGG ATCTAAATTG 48 0 

GCCAGTGGCT CATCCAATAT TAAAATAGGC GTACGATGGA TTAATATACC ACCTAATGAA 54 0 

50 

ACGCGTTGTT TTTGACCTCC AGATAAATCT TGCGGTCGGT GATTTAAATG TTCTATCATG 60 0 

CCAACTTTTT CAGCCCAATA ACTTACATTT TTCTTCATAT CATCTTGTTC AACACAATTA 66 0 
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TCTTGTAAAA CTGTACCAAC AACATTAGAT CTATCATGTA 
TGATTATTTA TATATAGTTC CCCAGTTATG rrACCTITAG 
TTTATGCAAT TTGCAAAAGT CGATTTACCA CTACCCGAAG 
TCTCCTGGAT AAATATCAAC ATTTATATTC TGTAATGTAG 
TGAAAACTAA AGTCTTTGAA CGAGATAATT GGTTCAGTCA 
ATATTCATTT ACATATCTGA TTCAACAAAA TAACTATTCC 
AAriTGAGAT GAaGCATATG CTTTTAATAA TATTGTCCCA 
ATTTAATACT GCAGAGATAA CACCTTGTGT ATAAACCTTG 
CAAAATATCT AATGTTGGTG CAATAAGTGC CCAGCAAATA 
GATATTAAAA TAAACCATCG ATTTCCTAGA AAATCGGCCT 
TCCAATCCAG CCATATAAAC AGCCTATAAT TCCCGAGC 
(2) INFORMATION FOR SEQ ID NO: 341: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4557 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



AACAACTAAC 
TTTTAAATGG 
CACCAACTAC 
GTGTTGCTTG 
TGATATATCA 
TTACGTAAAC 
ATAATGCCAA 
TTAGCTGGTT 
ATATTCGCAA 
GAAGAAAGAT 



GGTTGCATCT 
AATTAATCCG 
TAATACTTTT 
ACTATGATAT 
TTACCTTTCT 
TACCTTTTTT 
CTGAAATAAT 
CGTTATAAAT 
TAATTTGACC 
TTAATTTTAG 



7B0 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1358 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341: 
TAGAAGAATT GGAGAAAATG CTAATTCAAT TGTCAACATT CCATAGriAT 
AGTTTCTArr TGTGACACGT GAAGATGAAG TTGAAACATT GAAATGGGCA 
CACATATGAC ATTGAGAnGG CAAAACATTA GAGGArTTGT TTACAATCAA 
ACCAAATTTT AACGTCAArT TATAGCATGA TTAAAGAACG TATCCAAGCT 
CAGCAGAAGT AATGAGCAAA rTATTTTCAC ACCGCAATTA GTGTrTGTCA 
GTCATTAATT ATTGATCATG TCATTTTAGA ATATGTAAAC CAAGATTTAT 
TATrrCATTA ATCTTTGTTG AAGATCtGAT TGaAAGTTTG CCAGAGCATG 
TATTGATATC AAGTCTCGTA CTGAAGGCGA ACTGATTACG AAAGAAAAAG 
ATTGAAATTT ACACCTGAAA ATATTGrTAA CGTCGATAAA GAATATATCG 
GGCGAATTTG ATACACGTCG AACATTTGAA AAAIX^CAATT CCTGATAGTA 
AGAGATGTAT AACGTGAAAG AAGTAGATCA GCTTGATGTG GTTAATCGAT 
,^^,^^,,-rr:r, rpr^-r^rCTTT AGGTGTAAGA GGTAAAGATG 



CATGATTTAG 
CGTTGGTTGC 
CGAACGCGTG 
GTGCtGaACG 
TTACAGATAT 
CAGAATATGG 
TAGATACCAT 
AATTAGTTCA 
CGCGACGTTT 
TTACATTTTT 
GGAGACAAAA 
ATATTTTATC 



bO 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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AGGGAAATCT GAGATTATCC AATCATACAT TTTATCTTTA GCTATTAATT TTCACCCTCA 840 

TGAAGTTGCA TTCCTATTGA TTGACTATAA AGGTGGGGGT ATGGCGAACT TATTTAAAGA 9 00 

TTTAGTCCAT TTAGTTGGTA CGATTACAAA CTTAGATGGC GATGAAGCGA TGCGTGCCTT 96 0 

AACATCAATC AAAGCCGAAT TGAGAAAACG TCAACGTTTA TTCGGAGAGC ATGATGTTAA 102 0 

CCATATTAAT CAATACCATA AGTTATTTAA AGAAGGTATT GCGACAGAAC CAATGCCACA 1080 

TTTATTCATT ATTTCCGATG AGTTTGCCGA ATTAAAATCA GAACAACCTG ATTTTATGAA 114 0 

AGAACTTGTA TCAACGGCAC GTATTGGACG TTCGTTAGGT ATTCATTTAA TACTTGCGAC 12 0 0 

ACAAAAACCA TCGGGTGTTG TTGaTGACCA AATTTGGTCT AACTCTAAAT TTAAGTTGGC 126 0 

ATTAAAAGTA CAAGATAGAC AAGACAGTAA TGAAATTTTA AAAACACCAG ATGCAGCAGA 132 0 

CATTACmTTA CCaGgTCGTG CGTATTTACA AGTTGGTAaT AATGAmATTT ATGAATTATt 13 8 0 

CCAATCTGCA TGGAGTGGTG CAACATATGA CATCGAAGGC GATAAATTAG AAGTTGAAGA 14 4 0 

TAAGACGATT TACATGATTA ATGACTATGG TCAACTTCAA GCAATCAACA AAGACTTGAG 15 0 0 

TGGACTTGAA GATGAAGAAA CGAAAGAAAA TCAAACTGAG TTAGAAGCGG TCATAGATCA 156 0 

25 TATCGAATCT ATTACAACAC GATTAGAAAT CGAAGAAGTT AAGCGTCCAT GGCTACCACC 1620 

ATTGCCAGAA AATGTATATC ArGAAGATTT AGTAGAAACa GATTTCAGAA AATTATGGTC 16 BO 

AGATGATGCA AAAGAAGTGG AATTAACATT AGGACTTAAA GACGTACCAG AAGAACAATA 174 0 

TCAAGGACCG ATGGTATTGC AATTGAAAAA AGCTGGGCAC ATCGCGTTAA TCGGAAGTCC 18 0 0 

AGGATATGGT AGAACAACGT TCTTACACAA CATTATTTTC GATGTTGCAA GACACCATCG 186 0 

TCCTGATCAA GCACACATGT ACTTGTTCGA TTTCGGTACC AATGGTTTGA TGCCAGTTAC 192 0 

AGACATACCA CATGTCGCTG ATTACTTTAC AGTAGATCAA GAAGACAAGA TTGCTAAGGC 1980 

GATACGTATA TTTAATGATG AAATTGATCG TCGTAAGAAG ATTTTAAGTC AGTATCGTGT 204 0 

CACTAGTATT TCTGAATATC GAAAATTAAC TGGTGAAACA ATTCCGCATG TCTTTATTCT 2100 

TATTGATAAC TTTGACGCAG TAAAAGATTC ACCTTTCCAA GAAGTTTTTG AAAATATGAT 216 0 

GATTAAAATG ACGCGTGAAG GGCTAGCATT AGACATGCAA GTAACCTTAA CTGCTTCAAG 222 0 

45 AGCTAACGCT ATGAAAACAC CAATGTACAT TAATATGAAA ACGCGTATCG CAATGTTTTT 22 8 0 

KTKTQKTPJ\A TCAGAGGTGT CGAACGTAGT AGGACAGCAA AAATTTGCGG TTAAAGATGT 234 0 

TGTGGGTCGA GCATTGTTAA GTAGTGATGA CAACGTATCA TTCCATATTG GCCAACCATT 24 0 0 

TAAACATGAT GAGACCAAAT CATATAATGA TCAAATTAAT GATGAAGTAT CGGCGATGAC 24 6 0 

AGAATTTTAT AAAGGTGAAc ACCAAATGAT ATtCCTATGA TGCCAGATGA AATTAAATAT 2 52 0 
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GGATTAGATT 
TCATCAGAAA 
ATATTAAATG 
AGGCATCAAG 
ATGATTGAAG 
ATTATCAATG 
AAGCTTATTA 
GAATTAAtAG 
AGTATAGGTA 
CCTGTTATTA 
TGGTTTAAAT 
AAACAATGGT 
AAATTGAAGT 
ATTTGAGTTT 
CTAGTGCTAA 
AAAAAAGAAA 
AAGAATCCCT 
AAGCAATGAA 
GTGGCAAGGT 
GTCAAAAGCA 
TCGCCAATAA 
AACCGATGGC 
ATACATTATC 
AATAGATTAT 
AAATTTTGAA 
GGAATTAGTA 
CAGAGAGTAT 
TAGCTATGAA 



ATGAAGGTGT 
ATCCGAGAGA 
AAAAATATGC 
TGGCTAACTT 
ACTTAAAGCA 
ATTTTAAAAC 
CAAAAGGACC 
ATGCTTATGA 
TTCGTATTTC 
AAGAAAATGA 
AGCAATGAAT 
TAAGTCGAAA 
TCGTTCTGGA 
TGCACTAGAT 
TAAGGACAGT 
TTTGATGAAA 
CCAAAAAGCT 
TTAATATTGA 
TGATCAAGCG 
AACGCAACAG 
GGTGAAAGAT 
TGACCATCAA 
AATTAAGCAA 
TTTTCTTACG 
TTGTTTGATC 
ACAGAGAATG 
ATTAGCGCCA 
AAAGATGAAT 
AATAAAGACA 



TACACTACAA 
AATTGCGCAT 
GATTTGTATC 
TGCCGAAGAA 
AAGAGAAATG 
ATTTATTGAT 
AGAACTTGGC 
TAAACAGATT 
AGACCAACAA 
AGCATATATG 
TAAATAGGAG 
TTTAAAGATA 
TATTTAAGAA 
GATATTGAAA 
GTAACTGTTA 
CAAAAGCTTG 
AAGGAAATTG 
GGTGAAGATA 
AAACAATTAG 
CTCGCTGAGT 
GTGTTACTCA 
AAAGCAATTG 
GGGCTTGATA 
AAGAATTAAC 
CATCAAATTT 
ACAAGTTAAC 
TTGTAAATAT 
ATATTTTGTT 
TTGCATGGTG 



AAAATTAAAT 
ATTGCTGAAA 
GCAGACTCAA 
AGAGAAGACA 
GACGGCCCAT 
TGCACGTATA 
TTGAACATTT 
GATGTTGCAC 
TTCTTTAAAT 
GTCGCAAACC 
GGAGGTATGT 
TTAAAAAGCA 
AAGCTGAACA 
GCACAGCAAA 
AGGGAAAGGC 
AAATGTTGGG 
CTGGCGAAAA 
TGGGTGGATA 
CGGCAAAAAC 
ATATCGAAGG 
TTATGGCAAA 
ATAACTTAAG 
GGGTGAACCC 
AATTTTAGGT 
TGAAGAAGCT 
TGATGCAGGT 
TCGAATTAAT 
AAGCCGGTTT 
GTCGATTGTA 



TAACTGAACC 
TTATGATGAA 
GTGGAGAGTT 
TTAAAGCGAT 
TTGAAAAAaA 
TTCCGGAAGA 
TATTTGTCGG 
GTAAAATGAT 
TTAGATTTAT 
AAGCTTATCA 
TATGAATTTT 
TGCTGAAGAG 
ATATAAGCGA 
GGACGTACAA 
GCCCAATACG 
TGAAGATATC 
GGCAAGTGAA 
TAAAGGTATT 
AGCTAAAGAT 
TAGTGATTGG 
GTTTCAAGAA 
TCAAAATCTA 
ATGATGAAAG 
GGTAGTAAAT 
AAAGCTGCTT 
TTTAAAGTGG 
GATATGTATT 
AAAAATAATG 
CAATCATATC 



AGCAATGATT 
AGAAATTGAC 
TAAAGCTTAT 
TCATCAACTA 
TTCACTTTAT 
TGATGTTAAA 
CATTCATAAA 
TAACCAATTT 
TCAACGAGAA 
AAAGATTAGA 



AATGATATTG 

TTAGAATTTA 
ACTGCAAAAT 
TTATATATTG 
GATAAAAATA 
TATTTTAATA 
AAAGCAGATG 
ATTGAAGCAT 
GAAGGACAGT 
GAATTAGTAC 
GCGAAATACG 
ATGTTAAGCG 
TGCCTCTCGT 
TAATTGAAAA 
CTACATTAGT 
TTGCACCATT 
GGTTTCAAAT 
CTTTATTGAT 



2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 
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CTTAAATAAT GAAAGTATCG ATACGATTGG GCGTGTTTTA GAAATTGAAA TATACAATCA 444 0 

TCAAGGTGAC CCTCAACAAA GTTTATATAA CATTTATGAA CAAAATGATT TGTTATTCAT 4 500 

5 

TCGATACCCA TTAAAAGATA AAGTGCTGAA TGTTCATATT GGTGTCATTA ATACATT 4 557 
(2) INFORMATION FOR SEQ ID NO: 342: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3931 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

75 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 

TTGAGTGACT TTATTGAAGC GCGTGTAGAA GAAATATTCT TCGAAGTATT TGATGTTTTA 60 

20 CAAGATTTAG GATTAACAAA AGTAAATGGT GGGTTTATTG TAACTGGTGG ATCTGCAAAC 12 0 

TTACTTGGCG TAAAAGAATT ATTATCAGAT ATGGTAAGTG AAAAAGTTAG AATTCACACG 180 

CCATCACAAA TGGGAATTAG AAAACCTGAA TTTTCTTCAG CAATTTCTAC AATTTCTAGT 24 0 

AGTATCGCTT TTGATGAGTT ATTAGATTAT GTTACAATTA ATTATCATGa TAATGAAGAA 3 00 

ACTGAAGAAG ATGTTATTGA TGTGAAAGAC AAAGATAACG AATCTAAATT AGGCGGaTTT 3 60 

GaTTGGTTTA AACGTAAAAC AAACAAAAAA GATACTCATG aAAATGAAGT AGAGTCAACA 42 0 

30 

GATGAAGAAA TTTATCAATC AGAAGATAAT CATCAGGAAC ATAAACAGAA TCATGaACAT 48 0 

GTTCAAGACA AAGATAAAGA TAAAGAAGAA AGTAAATTCA AAAAACTAAT GAAATCTCTA 54 0 

TTTGAATGAT TATTGGCCAA TAAAACTAGG AGGAAATTTA AATGTTAGAA TTTGAACAAG 6 00 

35 

GATTTAATCA TTTAGCGACT TTAAAGGTCA TTGGTGTAGG TGGTGGCGGT AACAACGCCG 66 0 

TAAACCGAAT GATTGACCAC GGAATGAATA ATGTTGAATT TATCGCTATC AACACAGACG 72 0 

40 GTCAAGCTTT AAACTTATCT AAAGCTGAAT CTAAAATCCA AATCGGTGAA AAATTAACAC 780 

GTGGTTTAGG AGCAGGAGCT AATCCTGAAA TCGGTAAAAA AGCTGCAGAG GAATCTCGTG 84 0 

AACAAATTGA AGATGCAATC CAAGGTGCAG ACATGGTATT TGTTACTTCT GGTATGGGTG 90 0 

GCGGAACTGG TACTGGTGCA GCACCAGTCG TTGCTT^AAAT TGCAAAAGAA ATGGGCGCAT 96 0 

TAACTGTTGG TGTTGTAACT CGTCCATTTA GTTTTGAAGG ACGTAAACGT CAAACTCAAG 102 0 

CTGCTGCTGG AGTAGAAGCT ATGAAAGCTG CAGTAGATAC ATTAATCGTT ATACCAAATG 10 8 0 

SO 

ACCGTTTATT AGATATCGTT GACAAATCTA CGCCAATGAT GGAAGCATTT AAAGAAGCTG 114 0 

ACAACGTGTT ACGCCAAGGT GTACAAGGTA TCTCAGACTT AATCGCTGTT TCTGGTGAAG 12 00 
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GTATTGGTGT TTCTTCTGGT GAAAATAGAG CGGTAGAAGC TGCTAAAAAA GCAATCTCTT 132C 

CTCCATTACT TGAAACATCT ATCGTTGGTG CACAAGGTGT GCTTATGAAT ATTACTGGTG 13 80 

GCGAGTCATT GTCATTATTT GAAGCACAAG AGGCTGCTGA TATTGTCCAA GATGCTGCAG 144 0 

ATGAAGACGT TAATATGATT TTCGGTACAG TTATTAATCC TGAATTACAA GATGAGATTG 150 0 

TTGTAACAGT TATTGCAACT GGTTTTGATG ACAAACCAAC ATCACATGGT CGTAAATCTG 156 0 

GTAGCACTGG ATTCGGAACA AGCGTAAATA CTTCTAGCAA TGCAACTTCT AAAGATGAAT 1620 

CATTCACTTC AAATTCATCA AATGCACAAG CAACTGATAG TGTAAGTGAA AGAACACATA 1680 

CAACTAAAGA AGATGATATT CCTAGCTTCA TTAGAAATAG AGAAGAAAGA CGTTCAAGAA 174 0 

GAACAAGACG TTAATCGGTT AATATATATA CACAAATAAT TCAACACAAA TCATCAGATA 1800 

ACATATCTGA TGATTTTTTT ACTAATTTTT AGaACATGTA GAAGGACATT TAAGTTTTTC 1860 

aAAGTTATTA AAAGTGTTTA AGTATCGTGT GAAAATTAAG TCaAAAATTA TTTGCGCAAC 1920 

ATTTTAACTT TAAACATAAA TGTTATATTA TATAATTATT AACTTI'GTaC AGTTAGACGA 1930 

AGATAATTTA AATGAAATGA TGGTGACGAT CGAGTGAATG ATAATTTTAA AAAGCAACCG 204 0 

25 CATCATTTAA TATATGAAGA GTTATTACAA CAAGGTATTA CTCTAGGTAT TACAACTAGA 2100 

GGAGATGGTT TAAGTGACTA TCCTAAAAAT GCTTTTAATA TGGCGAGATA TATTGATGAT 2160 

CGCCCATATA ATATTACTCA ACATCAATTG CAATTAGCTG AAGAAATTGC GTTTGATAGA 222 0 

AAAAATTGGG TGTTTCCCAT TCAAACACAT GAAAATAAAG TCGCTTGTAT TACAAAGGAT 228 0 

GATATAGGCA CAAATATAGA CACTTTAACT GATGCGCTTC ATGGTATTGA TGCGATGTAC 234 0 

ACATATGATA GTAATGTCTT ATTAACGATG TGTTATGCAG ACTGTGTACC AGTATATTTT 2400 

TATAGTACAA AACATCATTT TATTGCATTG GCGCATGCAG GTTGGCGTGG TACCTATACT 24 60 

GAAATTGTAA AAGAAGTGCT AAAACATGTG AACTTTGATT TGAAAGACTT ACATGTCGTT 2520 

ATTGGACCAT CTACATCATC AAGTTATGAA ATTAATGATG ATATTAAAAA TAAATTTGAA 2580 

ACATTGCCAA TTGATAGTGC CAACTATATT GAAACTAGAG GACGAGATCG TCATGGTATT 2 640 

GATTTGAAAA AAGCCAATGC TGCATTATTA ATTTATTATG GTGTTCCTAA AGAAAATATT 27 00 

45 TATACGACAG CGTATGCTAC ATCTGAACAT TTAGAATTAT TTTTCTCTTA TCGATTAGAA 2 760 

AAAGGTCAAA CAGGACGCAT GTTAGCATTC ATTGGTCAAC AGTAAACAAG GAGGAGATAT 2 820 

GTTTGCGTGT GAAAGATAAT TTACAACAAA TCTCAACACA AATTAATGAC AAAAGTGAAA 2880 

AAAATAATTT TTCAACAAAA CCAAACGTGA TTGCAGTTAC AAAATATGTT ACAATAGAGC 2 94 0 

-:^r-T^\AA^A Af^'^riTATCAG GCTGGAATAA GACATTTTGG TGAGAATAGA TTGGAAGGCT 3000 
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AATCTCGAAA AGTTAAGGAC GTTATAAACG ACGTAGATTA TTTCCATGCT TTAGATCGAT 3120 

TGAGCTTAGC CAAAGAAATT AACAAACGTG CAGAACATAA AATTAAATGT TTCTTGCAAG 3180 

^ TGAACGTTTC GGGAGAAGCT TCTAAACATG GTATTGCTTT AGAAGATGTT GATCAGTTTA 324 0 

TAGATGATCT TAAAAAATAT GACAAAATCG AAATTGTAGG TTTAATGACG ATGGCACCAT 33 0 0 

TGACAGATGA TGAAGCATAT ATTAGATCGT TATTTAAACA GTTACGTTTG AAAAAAGAAG 336 0 

10 

AAATACAACG ACTCAATTTA GAATATGCGC CTTGTGATGA ATTATCAATG GGAATGAGTA 34 20 

ATGACTATCT TATTGCAGTT GAAGAAGGTG CGACGTTTGT TAGAATTGGG ACTAAACTTG 34 80 

TAGGAGAAGA GGAGTGAGCC ACTTGGCTTT AAAAGATTTA TTTAGTGGAT TTTTTGTAAT 354 0 

15 

AGATGATGAA GAGGAAGTAG AAGTACCTGA CAAACAACAA CAGGTAAATG AAGCGCCAGC 3 6 00 

AAAAGAGCAG TCACAACAAA CAACAAAACA AAACGCAATC AAATCAGTCC CTCAAAAATC 3 660 

20 TGCATCAAGA TATACAACAA CGTCAGAAGA AAGGAATAAC CGTATGTCTA ATTATTCAAA 372 0 

AAATAATTCA CGTAATGTTG TAACTATGAA CAATGCTACA CCAAACAATG CATCACAAGA 3 780 

AAGTTCAAAA ATGTGTTTAT TCGAACCACG TGTTTTTTCA GATACACAAG ATATTGCTGA 3 84 0 

25 TGAGCTTAAA AACCGCCGTG CGACACTTGT CAATTTACAA CGTaTTGATA AAGTATCAGC 3 900 

GAAAAGAATT ATTGATTTTT TAAGCGGTAC T 3 931 
(2) INFORMATION FOR SEQ ID NO: 343: 

30 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3150 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiable 

(D) TOPOLOGY: linear 

35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343: 

AATTGTCGGG GGACTCTTAG GTTTTGTCAT GCAAAGAACA AGATTTTGTT TAACAGGTGG 60 

40 

CTTTCGAGAT ATGTATGTGC AAAAGAATAA TAAGATGTTC TATGCATTAT TAATCGCTAT 120 

TACTATTCAA AGTATAGGAT TATTGATTTT GACGGCAACA GATATTTTAC AAATTCCTGC 180 

45 ACATAGTTTT CCAATATTGG GAACAATTAT AGGTTCTTTT ATTTTTGGAA TTGGAATAGT 24 0 

ATTGGCTGGA GGATGTGCAA CAGGtACTTG GTATCGCGCT GGTGAAGGGC TAATTGGTAG 3 00 

TTGGATTGCA TTAGTATTAT ATGCTGTTAC TGCAGCAATC ACTAAAACAG GGATTTTAAA 360 

GCCAGTAATG GATAAAATTA ATCAACCAAC GAATGTAAAT AGTGATATGT CTCAAACAAC 42 0 

TGGCATTCCG TTTTGGGGAT TAGTCGTTAT ATTAACTATA ATCACCATTT TTCTAGTTGT 4 80 



55 



EP0 786 519 A2 



10 



15 



20 



25 



30 



35 



40 



45 



50 



AGGTATTAGA 
TGGACTTATC 
AGGTATAACA 
TATTGATTGG 
AGGATCAAGA 
TGGTGGCATA 
TTTGGTTGAA 
TTGGTGTATG 
AATCTGCAAA 
TGATCAAATA 
AGGAATGGTA 
TGGAGATGAA 
GGCTGCAGAA 
AATTACAATT 
TGGCTACTTC 
CATTTTTATA 
TGCTTTTTAT 
CGATACGATA 
TTTATTTTAT 
GGCAGCAATT 
GAATGGGAAT 
GACTCAAGAA 
GTTTTGTACG 
TCCTTATTTA 
GGTTTGGGGC 
TGTTCATATT 
CAAACATTCT 
GTGGATTGGT 



TATTACCTTT 
GCACTCTTAG 
ACGCCTTCAG 
GGTGTCTTTr 
GAATTTAAAT 
TGTATGGGAT 
ACGGCAACGA 
GACAATGAGT 
GGTTAAACAG 
AAAGTGATTG 
TGTCCATTTC 
TTAAAAATTG 
AATGGTTATC 
CAAAAAGTTT 
CGCTAATTTA 
TAGTGTGGCA 
AAATTGTCAG 
TTTATGTTTT 
GGTGGGTTAG 
GTGCTTGTTA 
TTATGGTTCG 
GATATTAGCC 
ATTGCAATTT 
TTATTCGTAG 
GGCGGTTGGA 
ACATCAGGTG 
GAATCAACAC 
TGGTATGGAT 



TCGAAAAACG 
CTTGGCCAAT 
CAAATTTAGT 
TAGTTCTAGG 
GGCGArrOCC 
TTGGTGCGTC 
TGACTTGGCA 
CATTTTATCT 
CAAACGCAAA 
GAAAAGGAGA 
CGTTAATTGA 
ATTTTGATTG 
CTGTAACAAA 
AACGTTATCA 
AAAGTGAGTA 
CATTTTATTC 
ATTATTATGA 
TGTGTACATT 
TTCAATCTAA 
CATTTGTATG 
GAAATTGGGA 
CACATATTCC 
CTATTTTATC 
TAATATGGAC 
TTAACAAACT 
TTTCTGGTTT 
CACATAATCT 
TTAATGTAGG 



ATACCATCCA 
GAGTGCATCA 
ACACTTTTTG 
AATTTTCATT 
AGACAAGATT 
AGTTGCTGGT 
AGGATGGATT 
TTGTTCGTCC 
TAGTATAGAA 
AATAATTATG 
AGCGCAAAAG 
CACGCAAGCG 
CTATGAACAA 
TTTTAACAAT 
AGTAGTCTTT 
CAAAAGATGT 
AAAAAAGGGA 
ATTAGTTTGG 
AAATGCGCTT 
GATAACAGTT 
ATATACTTTT 
TTTCGCTTTG 
TGGTTCAATC 
TGCTCTTGTA 
CGGTGTATTA 
AGTATTAGCT 
TATCATTACG 
TAGTGCmT 



TTTATTGCAG 
ACTGGAAGAA 
ATTACAGGTG 
GGTTCATATA 
ACAATACGAA 
GGTTGTTCTA 
GCGCTAgCAT 
AATGAAAAAA 
GATTATTATG 
ATACACGAAT 
AAAATGGCAA 
ACGGAAGCCA 
ATTGATAATG 
AAAATAGATA 
TTTTTTTrAG 
AATAAAACTT 
GTGGTAAGTA 
TTAATGACAC 
AATACTGTCA 
GGTTTTACAA 
CTTAATCATG 
TTTATGTTAT 
GCTGAGAAAA 
TACAGTCCAG 
GATTTCGCTG 
ATTATGATTG 
TTGATTGGCG 
ACATTTGATA 
nCTATAGGTT 



CAATTGTAAT 
ATGACGGTTT 
AAACTAAATT 
TTGCAGCTAG 
ACAGTGCCAT 
TCGGTAACGG 
gCGATGATAG 
GTACACCAAC 
CAAATGATGT 
TAGGTACAGT 
CATTGCAATC 
TTCCAAATTG 
CTTCATGGAC 
TTAGATTCTA 
TTCATGAAAT 
AACGCATTTT 
TGAATCTTAA 
CAGGATTAAG 
TGCAAAGTAT 
TTAGTTTTGG 
TAGGTTTTGC 
TTCAAATGAT 
TGAAGTTTAT 
TAGCACATTG 
GAGGTACGGT 
GAAAAGGAAA 
GTATATTCGT 
ATATTGCGAT 
GGTTAATTTT 



600 
660 
720 
780 

B40 

900 

960 
1020 
1080 
1140 
1200 

t 1^ r\ 
J. ^ w 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 
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ATTAGTTGTC ATTACTCCTG CAGCAGGATA TGTAACATAT CTTAGTGCAA CAATAATGGC 24 0 0 

TTTAATAGGA GGTATCTGTT GTTATATTGT CATTAATTAC ATCAAGGTAA AACTAAAATA 246 0 

TCATGATGCA TTAGATGCAT TTGGTATTCA TGGTGTTGGT GGTATTATTG GTGCTGTTTT 2 52 0 

AACAGCAGTT TTCCAAAGTA AAAAAGCCAA TCCTGACATT GAGAATGGCT TTATTTATAC 2 58 0 

TGGTGACATA CATATTATAC TTGTACAAAT ATTATGTGTA ACAGCAGTTG TAATTTTTAG 264 0 

TATAGTCATG ACGTTTATTA TTGCGAAAGT AATTAAATTa ATTACACCAT TATCTGTTAC 2700 

GGAACAAGAA ACGAATATAG GATTAGACAA GATTGTTCaC GGTGAACATG CTTACTTTGA 276 0 

75 AGGTGAGCTA AATAGATTCA ATAAACATAT TCGATATTAG AATATATTTA CATAGAATAT 282 0 

TCATTGTCCT GACATTTAAC TAAAGGTTGA TGTTGGGACA TTTTGTTATA CAAAAGTTTT 2 8 80 

ATTTTGAAAT CTTTTTATGA AAGAAGCAGA AATATTATTT AAAGCGGTTA CACATATGCT 294 0 

AAAATAAGGC TAAGTGTCAC AAATAATGAT AGGTGAATAA GTATGAAAAA TATATCTGAT 3 000 

ATTGCCAAAT TGGCAGGCGT TTCAAAAAGT ACAGTATCTA GATTTTTAAA TAATGGATCT 3060 

GTCAGTAAAA AAACAAGTGA AAAATTAACA AGAATTATAG CAGAACATGA CTATCAACCG 3120 

AATCAATTTG CTCAAAGTTT AAGAGCGAGA 3150 
(2) INFORMATION FOR SEQ ID NO: 344: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3719 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344: 

GTTATAGTGA AATTGACTCA TCACATTTCA CAGACCGTGA CAAACGCGTT ATTAGACGTG 60 

ATCATGTTAA AGAAGCACAA AGCTTAGTAG AGAACTATAA AGATACACAA AGTGCTGATG 120 

CTAGGATGAA AGCCAAACAA AAAGTTAACA CATTAAGCAA ACCGCATCAA AACTATTTCA 18 0 

ATAAACAAAT TGATAAGGTT TATAATGGAT TACAACGCTA ATCCAAAGTA AATTATAAGT 24 0 

TATACATCTC GTTTTTAAAT GACAATTTAT CCCCGTAAAT ATTATAAATA ATCTTTTCAA 300 

ATTCCACATA GATATAGAGA CACTAATAAA CCTCTTTGTC TCGATATGAT AGTCTGCAAC 36 0 

GATTCATGTT GTAGGCTTTT TAATTTTACA AATAAGGCTA AATATATAAG TTCTGGCACC 42 0 

TAAAATATAG AAAATACATA AAAGTAAGTA TAGTTATTTT ATTATAATTA TTAAATTTTT 48 0 

ATTAATTAAT TGTAAAAATG TATAATTATA ATTAATTAAC GTTTAATATT AAAATTAACT 54 0 
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ATCcrrrcAA tattacttat agggatggct atcagtaatg tttcgaaagg gcaatacgca 

AAGAGGriTT TCTATTTCGC TACTAGrTGT TTAGTGrTAa cmAGTTGT AGTTTCAAGT 
CTAAGTAGCT CAGCAAATGC ATCACAAACA GATAATGGCG TAAATAGAAG TGGTTCTGAA 
GATCCAACAG TATATAGTGC AcTTnCAACT AAAAAATTAC ATAAAGAACC TGCGACATTA 
ATTAAAGCGA TTGATGGTGA TACGGTTAAA TTAATGTACA AAGGTCAACC AATGACATTC 
AGACTATTAT TGGTTGATAC ACCTGAAACA AAGCATCCTA AAAAAGGTGT AGAGAAATAT 
GGTCCTGAAG CAAGTGCATT TACGAAAAAA ATGGTAGAAA ATGCAAAGAA AATKSAAGTC 
GAGTTTGACA AAGGTCAAAG AACTGATAAA TA-TCGACGTG GCTTAGCGTA TArTTATGCT 
GATGGAAAAA TGGTAAACGA AGCTTTAGTT CGTCAAGGCT TGGCTAAAGT TGCTTATGTr 
TATAAACCTA ACAATACACA TGAACAACTT ITAAGAAAAA GTGAAGCACA AGCAAAAAAA 
GAC^TTAA ATATTTCGAG CGAAGACAAC GCTGATTCAG GTCAATAATG CTCATTGTAA 
AAGTGTCACr GCTGCTAGTG GCACnTTAT AAnTTTAGA TCACGATATG ATTTATTATC 
AATTCAGAAT TAAAAAAGTA AATAGTATCA AAAGTAAGTG TATTTAATAT TAGAAAATAA 
AAATTTTAAA TTTAGTATTA AAATGGAATG TTACTATATA GTTCAATCTG TATTATCACA 
GAAAATAAAA TAAT^^CTTTA CrTCTATArT TAAAAGTGTA TAATGAAAGT TAAGTAATAA 
AGAGCGIK3AA GAAAAATGTG AGTTATTrAT ATAGAATATT CTCCrTTTCA TTTATGAATT 
TGTTACAAAA TATTTAGTGC AAAAGCACGA cGGAGGTATT CAATATCaAT AACGGTACAG 
TTAAATGGTT TAATGCAGAA AAAGGTrTTG GTTTCATCGa AAGAGAAGAT GGTAGCGACG 
TATTCgTACA CTTCtCAGCA ATCGCTGAAG ATGGATACAA ATCATTAGAA GAAGGCCAAA 
AAGTTGAATT CGACATCGTT GAAGGCGACC GTGGCGAGCA AGCTGCAAAC GTAGTTAAAA 
TGTAArrrXA ACrrATICAA ACAGTCCTTA CTATAGGGCT GTrrmTAT GCTTTAAATC 
GATAACAGTT GGTGTGGTAA AAGCACTAGC CGTTATTTrT TTCTCCAATA AATTTAGTTG 
GAGATTTAAC AATATATAAT GGTrCTAAAA TAAATCGAAC OXJAl^SGAAAA GXTrrrTACT 
TTTCATCTGT CCGACTTTTG ATnTGAATA TAAAAAAGCG CCAATACAGA ACTTTAATAA 
TGACGAGAAT TAAAGTCTGT ATATGGCGAT AACAAGAAGT AATGTTAAAC ACTCAAAATG 
TTTAACAATA ATAGGATACC ACATCGCATA ATATCTTACT ACTTAATTAA TAATTTAACT 
AATCAACrrX TTGTTAATTT TTTATTAAGA CTGATTAATT ATTGAGAATA TTTATTGTrT 
riAAAATCTC ATAATAATTC AGTAATCTTG TmCATTTA AAAGGCGAAA CATTAAAATA 

.^..^^^jvAA-^A -"Ar-TTATTTr TAATGCTTTG 
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ATTTCTGTGA GTATTTGGAA GCTACCATTA GGCAACGGTT TAACAATAGA CAATTGCTTT 24 6 0 

TCCGCTTGTT GTATTAAAAA AGGTTTTGTA GATTGATTAT TAATATGCCA TTCACTCATG 2 52 0 

5 TATGTTTTTC ACTCCTGCTT TAAAATAGGG TTAGAAAGTT TATAGTTGAG ACATTCATGT 2580 

TCAACCAAAA TTTTGTTCGA ATTCAATAAA TGTCTTGTTT AAAATAGAAA TATTGTAAAT 264 0 

GTTATCGTCC AAAACTTCAC CAGTTAAGTA TTTGTTTTGA ATTAAAATTT GGCAGTTAGT 270 0 

10 

TAAGAAGTCT TGATAATCAC GATCGCAAAA ATAGTTTTCA CGTGCATCTT TAGCATCGCC 2 76 0 

AAAAAAGTTA GCGACTGTTT CTGTTTCTCC TTTATTCGAA CGTTCAATAT ATAATTTGTA 282 0 

AAATTTAGCT ATTGTATACT TTTGTTCTTT AGTTAGTTCA TTCAAAATAT TGGGCCTCCT 28 8 0 

15 

GAAATATCAT TTGTAATCTA TACCCAATTT ATTGCAAAAC AAAAACTAAT TTAACTATTT 294 0 

GATGAAACTG TGTTAATAAg CTTTAACAAG CCTTAGTrTG TATGGATCTA TAAAATTATC 3000 

TTTAATTGCA TAGGGTGAAA TAATATGTAG TCCATAACTT TTAACTGATT TTTCACTTAC 3060 

ACCAAATTTA TAAGCTTGGT AGATAATTTT AGTACAATAC GTAAATTTTT TGCTGTTCAA 312 0 

ATTTAATGTA ACTAGATAAC GATGATTTGT ATTCTCATAG TTTTTCTTAA CCCATTCAGC 3180 

25 CGCTTTTTTA CCTGCACCAG GATAGCTGCA ACGATAAACT TTCATCCAAT CATTTTTGCC 324 0 

ACTTGCATAA TTATATTTAA AAGATTCGAA GGATTGTGTA GTTGGTTTGT CGCCAGGCCC 33 0 0 

CTCAATTTGT AAAATCGTTT TATCATCAAT CGCGATACTA CAATGACCAA AAAATCscCA 3360 

CATGACAGGG CCTTTTGTAA CAATAATATC ACCAGGTTGT AATTGGAATT TGTCATCTTG 34 2 0 

AATTTCTGAA TACTTATTAT CTGCAATTGT TTTTGGTGAG TTTATTGOGG ATACGACAAC 34 80 

GAATAATATA AGTAAAATTA TCGTTCGTTT AATATAGTTC ACTTAAAAGC TCCTTGTTGA 3 54 0 

35 

AGAAATATAT GTAAATAGTC TTAAATTAGA ATTGTAATCT TTAATAAGCT TGtAAGACTA 3600 

AAACATATCT TAAATATTAA AGTATGAGAG TGTGAAATGT CTATTAAGAA TnAAAAACAG 3660 

TCTGAAACAT CATTGAGACG TTCCAGACTG GATATAAAAT GAATTTCATT TATAGCACA 3719 

40 

(2) INFORMATION FOR SEQ ID NO: 34 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1676 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 5: 
TTGCGTTGCC GCACCAAGAT ATTGAATGCC TAGCGATTCC GAGTATGCAA ACTGAACGCA 60 
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CTCTTTAATA CGCGTTATCG CTTTTTGTAA ATCTGCATCA 
ATAGCCACCT GCTGTCGGAA GATCATCTCC CGGCTTAATr 
CTTAATCTCG riTTCGAATT GCTCAATTTT TACAACTTTT 
ATTrGTCCAT TCAGGTATTC TTGCTTTATT ATTTAAAGCA 
TGCATAATAT TGCTGATTCA AGATTGTTTC ATCATGAATA 
ATTrrCCTTA TGTGCCAATT GTrTGATCAA GTnTCGTAA 
ATAAATTGAG TTCGGtACTT CCTTACCAAT AACTTGAAAT 
CGCACTAGCT TCGTGAACAA TAACAGGTAA TTGATTTGCT 
TAAAAAAITA GATTGATGTT CGTCCGGTrT CAACCATGGA 
TGACGTATAG ACAACATCTT TGTCATATAA ATCACTTAAC 
ATTArrrGTC ATTACTTCCC ATTCCCTTTC AAATGCGCAT 
TAAACGTCTT GArTTGTAAT TAACTCTAAC CCCATCAACG 
ATTAATGCTT CATCACCATG TACACTCGCA GCCGCTTCTC 
ACTAAATTAC GTGATCCTAT TTTAATATGA GGATGTATTG 
TTCCCTGTAT CCGTAGAGCC ATAACCAAAA TCATCATCAA 
TCAGCATATT TAGCAAATAA ATCATCTAAT TTCGGCGTTT 
TnTGAATTC a^rCAA-ATTC ATAATCACAA CCAGTCTGTA 
TGAriTACTT nTCTGTTAA TATATCCAAT TCTTTACGCG 
CGAGCATGTG TATAGTCTGG AATAATATTA GCTGCTTTCC 
TGCACACGTT GATCTTTTTT AATATGTTGT CGTAGTTGTG 
ATCATAGCGT CTAATGCATT TAACGCTTCA TCTGCArrTT 
CCGTAAAATT TAACATCTAA AACATCCACT GCCAAAGTAT 
TTTCCCGGAT GAATCATTAA GGCAATGTCT ATTTGATCAA 
GAAGCTTTAG CGCTACCATT TTCCCCACCT TCTTCAGCrG 
TTACCACCAA TTTGGTCAAT CACXTGCTTC AAACCAATTG 
CCAATGATAT TATGACCACA AGCATGACCC AATCCTGGCA 
(2) INFORMATION FOR SEQ ID NO: 346: 
fi) SEQUENCE CHARACTERISTICS: 



TGATAACAAA 
ACTAACGGGA 
CTTTTTGGTA 
ACAAATAACG 
TATTGAAAAT 
GATTGCTGAT 
AGCTGATGCA 
ATTAATAACT 
TTCGATATAT 
GTTAAGTTCG 
GCTCTTCAAT 
CCATTATTTT 
TAAATCTATG 
TTGGCACAaC 
TAACTGCTTC 
TAATGAATTC 
TCGCAGCTCC 
TCATTGCTCT 
CGCCATCTAA 
CTACACCATT 
CAGAGGCATG 
CAATCGTTTT 
TCACACCAGC 
GACATCCAAG 
CACCAAGAAC 
AAGCATCGTA 



TCATAACGCC 
ATTCCCAATT 
AAAACTTCCC 
TTTTATCTAA 
AAATCTTTTT 
TGTTAAATGT 
ATTTGTCTGT 
CCCTACCAGT 
ACGAAGGTCT 
GCTCATTACC 
AATGTCTTGA 
AGCGCCTTTA 
CGTATGTCCT 
ATgaCTTACG 
ACCAACTTCT 
ATTCACACCG 
ACGTGCGATT 
AGTATAAAAA 
AATCACACCA 
AAAATAACTA 
AGCACTTTTT 
ATAAGTTTCA 
CTTGACATAA 
AACGACTACT 
ACTTGCAGTT 
TTCTGC 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1676 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346: 

TACAGTAGGA ATCATAAAAC CTAATACAAC AAATACAAAA CCATTTAAGG CATAACTAAA 6 0 

TGTGTTCCAA ATTTGATGGT AATTCATTTG TAGTTCCGTT TGTGCTCTAA TTAAACGGTC 12 0 

GCGTTCTAAA CCATGGATTA GACCTGCGAT TACAACTGCA ATGATACCTG AAGCATGAAC 180 

TTCTTCTGCT AAAAAGTATA CGACAAAAGG AGTTAATAAT TGAATAAAAG TTAAGGTATT 24 0 

GTTATCTTTT AAACCTTTAT TAGCGGTTAA GTCTATACGT ATTCTAACGA CAACGAATCC 3 00 

^5 AATAATTGCA CCAATAAGTA CACCTAGTAT TGTTGAAATG ATAAATTGTT CAACAGCTTG 3 60 

GAATAATGAA AAGGTACCAG TTACTAATGC AGTAACAGCA ATTTTAAATG AAATGATACC 420 

TGCTGCATCA TTGAGTAAAG ATTCACCTTC TAAAATCGTC ATAGAACCTT TAGGTAATAA 4 80 

TTTTCCGCGT GTAATAGCAG ATACTGCTAC TGCATCAGTA GGACATAAAA TTGCTGCTAT 540 

TGCAAAAGCG GCTGGCATTG GTAAGGCAGG CCAAATCCAA TGTATAAAAT AGCCAACACC 600 

GACTACAGTT GCAAACACTA GTGCCATTGA CATTAATAGT ATAGGTTTAC GATATTCTAA 660 

TAATTTTGTT CGAGAGACGT GGGTACCTTC CACAAAAAGT AGTGGCGCGA TAACGGCAAA 72 0 

CATAAATACT TCAGAATTGA ATTGGAAATC AACTTGTATT GGAATAATGA AAATAACGAC 78 0 

30 ACCTAATGCA ATTTGAATAA AGGCAGTAGG AATTTGTGGG AATCGATTAT TGATAACCGA 84 0 

ACTAATAATC ACAGCAAAAA TAAAAATTAA AAATGCTTCT AATAGTGCCA TACAATACTC 90 0 

CTCAAAATTT TAATAGTTAA TATTTTATCA CTTTTAAGGC ATAATGACAT AGATATATTG 96 0 

ATAAAATGAA GTTATTTTCA AAAAAACTCT AGTATCGGTT GAACTGATAC TAGAGCGAGA 102 0 

TGTTTAAATT ATTGATTGTc ATATCTGAAA TGACCGCTGT CATTTTGTCG TTGTTCATAC 10 80 

GCGAGCTTTT CAGCATTCGT TTTGTATTTT TtATAAAAGa AAAATAeAAA TATnAACCaG 114 0 

AATGGCGAAA TATAAATAGC TGCTCTTGtT TCGTCACTAA AGAATAATAA AATGAATACA 12 0 0 

AAGAAGAAGA ACGCTAGAAT AATGTAAGCA ATAGGCTTAC CACCAATCAA CTTAAATTTA 12 6 0 

CTGTTTTTAT GTGCCTCAGG ATGCTTTTTC AAAT 12 94 
(2) INFORMATION FOR SEQ ID NO: 347: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1935 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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ACATGATAAT 
TCGTATGATT 
ATATATATTT 
TAAAAATAAT 
ACCATCGATT 
ACAATTTACT 
AGTTAATTTC 
AAGAATCAnA 
ACAACTTTTA 
GAAAATAATA 
ATTAAATCTG 
AACGGCTCAC 
GATAATTGGT 
GGCACTACAG 
GAACAAAATG 
GAACAAGAAG 
ATTATTGCTG 
AAAGTAACTG 
ATTGATACAC 
CAAGATACTA 
ATTTCTGAAG 
GGCGGCAAAC 
GTTCCTGAGT 
GATAGCAATA 
AATAGCAAAT 
ATATTTGTTT 
GACAATGTTG 
r.r^TATATTG 



GATGACGCTA 
GATATTTGTT 
AACTATTTAT 
GTGTGTACAA 
TGAAATTTAT 
ATAAAtTAgC 
ATAGAACGAA 
ATTATGTTTA 
GTGAAAAAGA 
ATGAAATAGA 
AAATTTTGAA 
CTATACTCGC 
CATATCCCCC 
ATATGAAAGG 
AATTGCCTCA 
GTGCCAAATT 
AACCAACTGG 
CAACTGGTAA 
TGCTTGAATT 
AACATGAATT 
AGGATGCAAA 
AATTTAACTC 
ATGATAACGA 
AGCTTTCACT 
TAATTACTAC 
CAGCGCTTGT 
ATTTAGCCAT 
AAAAAGATAT 



TTAAAACACG 
GATATGTATC 
TAAATGATTT 
AGTCATTAAT 
ATATAATTAT 
ATATCCGATA 
ATCAAATAAC 
nATAAATATA 
AAAAATTCAA 
CGTTTGTAAT 
AGTTAATGAA 
ATTGAGTGGT 
TTTTCAACTG 
CGGTTTAATG 
TGGAACGATT 
ATTAGCTGAT 
ATCTGGAATT 
AGCTGTCCAT 
TTATAATCTA 
AGATGTTGCG 
TTATGCATCT 
TGTACCAGAT 
CTTTATAGAA 
CGATATTCCA 
GATTAAAGAT 
AGGCGCAACA 
TTTTGGACCA 
GTATCTGAAA 



TTTTTTATTT 
GACATGTGAA 
TGTTAATATT 
TTAGCAAATA 
TAGCTAAATA 
TCAGCGTTAA 
ACACTACTTT 
TATACTACTT 
TTACTAGCAG 
TATTTAACAG 
CACCGCGCCA 
CATATGGATG 
ACAGAAAAAG 
GCTTTGGTCG 
AGATTACTGG 
AAAGGCTATT 
TATTATGCAC 
AGCTCAGTTC 
TTTAAAGAAA 
CCTATGTTCA 
GGTCTTACAG 
GAAGCTTCAC 
TCGTTTTTCC 
AGCAATCACC 
GTAGCTTCTA 
GATGCCTCTA 
GGTAATCCAT 
TATATTGATA 



rrCATTGTTA 
TAATATCACA 
ATTAAATACT 
TTTTTATTTA 
ATATCCTGCA 
TAAGATCGTT 
CTGCATTTTA 
TGAAAGGTGT 
ATATTGTTGA 
ATTTATTCGA 
ATATCGTTGC 
TTGTTGATGC 
ATGGCAAATT 
TATCTCTAAT 
CTACTGCTGG 
TAGACGATGT 
ATAAGGGGTC 
CATTTATTGG 
AATATTCAGA 
AATCATTGAT 
CTGTATGTTC 
TTGAATTTAA 
AAAATATCAT 
GACCTGTAAC 
GTTATGTAGA 
GTTTCTTAGG 
TAATGGCACA 
TTTTTAAAGA 



TAACCTTCTT 
AAAACAGAGA 
TTATCCTCTT 
GTAGTTAATA 
TCTTTCTCAT 
GATACTAGmC 
AATTATGTTT 
GAGCTTAATG 
ACTACAAACT 
CAAGTACGAT 
AGAAATCGGT 
AGGAAATCAA 
ATATGGCCGA 
CGAATTAAAA 
CGAAGAGAAA 
CGATGGCTTA 
TATGTCATGT 
TGACAATGCA 
GCTTAAACAA 
TGGAAAAGAA 
GATTATAAAT 
CGTAAGACCA 
TAATGATGTG 
AAGCGATAAA 
ACAAGACGAA 
AGATAATAAG 
TCAAATCGAT 
GGCTTCCATT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
650 
720 
7B0 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
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TCAAATATCA ACAAGCACAT TTTCATTGAT TAAGTGATGT AAAACTGAAA TTATTGTGCT i86 0 

GATTTGTCAT ACATATATTG ACTAATGGGC ATATAAAAAG ATAGCCTCTA ATAGTnACAT 192 0 

^ AAACTCGTAA AAnCC 193 5 

(2) INFORMATION FOR SEQ ID NO: 34 8: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1351 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

75 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 8: 

CCTTTnCCTA AACAATTTTT AGATTTAGAC AACAAACCGA TTTTAATCCA TACATTAGAA 60 

20 AnATTTATTT TAATTAATGA TTTTGAAAAA ATTATTATCG CGACGCCACn ACAATGGATG 120 

ACGCATACGA AAGATACACT TAGAAAATTC AAAATTTCTG ATGAAAGAAT TGAAGTCATT 180 

CAAGGTGGTA GCGATCGTAA CGATACAATT ATGAATATCG TTAAACATAT TGAATCAACA 24 0 

AATGGTATTA ACGATGACGA TGTCATTGTG ACACATGATG CAGTTAGACC ATTTTTAACG 3 00 

CATCGTATTA TTAAAGAAAA TATTCAAGCT GCTTTAGAGT ACGGTGCAGT AGATACAGTG 3 60 

ATTGATGCTA TAGATACGAT TGTTACATCT AAAGATAATC AAACGATTGA TGCAATTCCA 420 

GtGCGTAATG AAATGTACCA AGGTCAAACA CCTCAATCGT TTAATATTaA TTTATTAAAA 4 80 

GAaAGCTATG CACAGTTGAG TGATGAGCAA AAGAGTATTT TATCTGATGC TTGTAAGATT 540 

35 ATTGTAGAAA CAAACAAACC GGTTCGACTT GTAAAAGGTG AGTTATATAA CATTAAAGTA 60 0 

ACAACACCTT ACGATTTAAA AGTAGCGAAT GCTATTATTC GAGGTGGTAT TGCCGATGAT 660 

TAATCAAGTA TATCAATTAG TTGCACCTAG ACAATTTGAA GTTACGTATA ACAACGTAGA 72 0 

TATTTACAGT GACTATGTCA TTGTACGTCC TTTATATATG TCAATTTGTG CTGCCGATCA 780 

AAGATATTAT ACTGGTAGCC GTGATGAGAA TGTCTTATCT CAGAAATTGC CAATGTCTTT 84 0 

AATTCATGAA GGTGTTGGTG AGGTCGTATT TGACAGTAAA GGTGTGTTTA ATAAAGGTAC 900 

AAAAGTAGTT ATGGTACCGA ATACGCCGAC AGAAAAAGAC GATGTCATTG CTGAAAACTA 96 0 

TTTAAAATCG AGCTACTTCA GATCAAGTGG ACATGATGGG TTTATGCAAG ATTTTGTGTT 102 0 

GCTAAATCAT GATAGAGCTG TACCACTACC TGATGATATT GATTTAAGTA TTATTTCATA 108 0 

TACAGAGCTT GTAACAGTAA GTTTGCATGC TATTCGTCGT TTTGAAAAGA AATCTATTTC 114 0 

AAATAaAAAT ACATTTGGTA TTTGGGGTGA TGGTAACTTA ggTTACATTA CAGCCATTTT 120 0 
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CAGTCACrrC TCATTl^TTG ATGA^TCTT CrrrATTAAT AAAATACCTG AAGGCTTAAC 
AriTGATCAT GCATTTGAGT GTGTGGGTGG T 
(2) INFORMATION FOR SEQ ID NO: 349: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENOrri: 411 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



35 



(2) INFORMATION FOR SEQ ID NO: 350: 

(i) SEQUENCE CHARACTERISTICS:^ 

(A) LENGTH: 163 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



1320 
1351 



TCATCAAGTC 


TACGATAAAT 


TAAGTCCATA 


TCTAAAGGCT 


CGGGGTCGAC 


AGTTTGTAAA 


60 


GTATAACCAA 


CTGCACAGTG 


GCTACAACGC 


ATATTACAAA 


GATTTGTAGT 


TGTAAATTCG 


120 




AAGTTAATTG 


GCCATGTTCT 


TTAACATCGT 


TATATGCTTC 


CCATGGGTCG 


IBO 


TTTTGAATAC 


TTATTTTAGG 


CTTGTTATTA 


CGCATTTTAT 


AAACTCCTTA 


ATTGTTATTT 


240 


GATACCAATT 


TGATACCGTT 


TAATCAAATA 


TGCTCATAGC 


TTGATGTTTT 


TTATCAGTAT 


300 


ATAAATGAGA 


GTACGTTTGA 


ATTGTTTCTG 


TAATGTTAGA 


ATGCCTCATT 


AATTCCATTA 


360 


ATAAATACAT 


ATCTACACCA 


TTATTAATTA 


AaTAGCTAGC 


GTACGAGTGT 


G 


411 



(xi) SEQUENCE DESCRIPTION: SEQ ID 



4G 


TCATTTTCAT 


AGGTTATTAC 


GCAGATCAGC ATAATAATCC 


ATTCCATATG AGTcCTTATT 


60 




TTGGTTATGC 


AGCACGTCTA 


TTGGCAACAA GTGGCATTGA 


CTATACGTAT GTAAGAATGG 


120 


45 


CAATGTACAT 


GGATCCACTT 


AAACCATATT TACCAGAATT 


GATGnATATG CATAAACTGA 


180 


TTTATCCnGC 


TGGCGATGGT 


CGTATTAATT ATATTACTAG 


AAATGATATT GCTAGAGGTG 


240 




TCATTGCTAT 


TATTAAAAAT 


CCAGATACTT GGGGCAAACG 


CTACTTATTA TCAGGCTACA 


300 


50 


GTTATGATAT 


GAAAGAACTT 


GCTGCAATTT TATCTGAGGC 


ATCaGGCACA GAAATTAAAT 


360 








ACATTTGCAG AAATGTATGA 


TGAACCTAAA GGCTTTGGTG 


420 
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TTAATAATAA AGGAGCGTTA TAGTGAATAT CATCTCAACA ATTtTAATCA TATTTGTGGC 6 00 

ATTAGAGTTT TTCTATATTA TGTACCTTGA AACGATTGCT ACAACTTCCA AAAAGACTAG 6 60 

CGAGACATTT AATATAAGCG TCGATAAATT GAAAGACAAA AATATTAACC TACTTTTGAA 720 

GAACCAAGGC GTATATAACG GTTTAATCGG AGTTTTGCTA ATATACGGTT TGTTTATCAG 7 80 

CAGTAATCCA AAAGAAATAT GCGCAGCTAT TTTAGTGTAT ATCATTGGCG TTGCTATTTA 84 0 

TGGTGGCCTT TCAAGCAATA TTAGTATCTT TTTCAAACAA GGCACATTGC CAGTATTGGC 900 

ACTCATATCA ATGCTTTGGT AAGTATTGGT GTTTGGGGGG GTGGAGATGT AGTCGGAGGT 960 

TTGGAGGATT TGAGCGAATT GTGTGTGGAC TTTAGACTCA GAGTATTTCA TCCTAATTAT 1020 

TTCAAGCAGA GGTGACAGTA GCGTTGCCTC TGTTTCCTTA TAAAAAAATT ATTTAATGAA 1080 

GAAAACCCAT ATCTGATTTA ATTTTCAGCT GATAAATACT CCATATATTA GAATGGCTAC 114 0 

TTTATCTATT GCATCAATCC TTTAAAACAA AAAACCCATG ATTTCGAAAT TCCCGTATGA 120 0 

TGGGGTTCCT ACTCTCATGG ATCAGTTAAA TAAATATTAT CACTATCAGT TTATTATTTC 126 0 

AATATTATTA ACAATATATG TAGTCGTAAA AGGAAAGAGG ACATGAGAAC TTCGGTGTTG 13 2 0 

ATTGGCATTA CATAACGCTT CCAAACATAT TATTTGGTAA CAATAAGAAA CTATTTACAC 13 80 

AATATATTTT GTATAGTAAA ATTATTTTAT AATATTTAAA TCCAATTGCA CAAGGAGTGA 144 0 

TTATCaTGGT ACCAGAAGAA AAAGGTTCTA TTACTTTGTC AAAAGAAGCA GCTATCATAT 150 0 

TTGCAATCGC AAAATTCAAA CCATTTAAGA ACAGAATTAA AAATAACCCA CAAAAAACAA 156 0 

ATCCATTTCT TAAATTACAT GAAAACaAAA AATCTTAATC ACTTTtATTT ATAGcATTTC 162 0 

^5 TAATCTCAGA AATGCTATA 163 9 
(2) INFORMATION FOR SEQ ID NO: 351: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1816 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351: 

AAAATCGCAT ATAGTAATAT GAATAACCAG ATTGTATCTA CAAAAAAGTA TATTGAAAAA 6 0 

50 CCPJKQCGCAC CCATTAATAA TGCGAGAATA ATAATAATTT TTCTATTAAA GTGATGCGTA 120 

TCCGAAAATC TAGCAATAAT TGAATTTACT GTAAACTGGC TAATCGCTGC AGATGCTAGA 180 

AGTAATCCAT ACTGATTTGT TGTCATACCT AAATCTTTAG TTGCAAAAAG AACAAGATAT 240 
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TTCT^ATTT Ot;.T«.CGC TOC«.C.T» TCCTMCCC a=CTTCrrAa AGccccrrrr 
„rmAT,>AA rlAGGOGCTC TTATOCAOrt OOTOCATTAG C^CCAACIO TArrCCTTTG 
XCCCCTTTTA ATTIATraAT TA=«=OCTCT T™<rrOTrO GTOCATTAGC AACCAACTAC 
OTTCAATTTA ACCGAATACT TTAAAATTAA ATAOAAAOT TAAATTAGTC TAAAACTACG 
CCrrreOTTG TTCAACAAAG CTCGCCATGA GATTTACAAA AGAATCAA.. TGTGGCAATT 
.CAACATGCT CGGATCATAA CrCATAAA,.= TCGAACGAAT CAGCOGTTCA TTATCAATTr 
CTACTTTTTC AAACTCAAAT TGTTCTrtGC TGATATTTTT CATCATAATT TCl^CAAGA 
riGTAACACC TACACCAGTA ATCAACATTT omGCAAGT TGGTAGTTGA TCCACTGTAA 
TAGTTGCATG GTAATCITGT TGTAAATTAT GGTTATACCA rrCnTTATT TGATTTATAT 
AAATCGGATC AGCTTGAAAC TCTATAAATG GTAACmGT AACATGA«:T CGTCTATTTT 
™-...AAAAT AAAATAATGA TCATCArTAA ATAAA^^T GTTAGCTAAA TTCATTACCT 
TATTTCCACG AGTTATCATA ACA^TAAT CTCTATGATT IGCITTAATr XGTTCAGTTG 
AACCAACTIG CACTTGTATT TCAACATTAG GAAArrGGGC ATTATATAGG CTCAAAACrT 
CAGGAAGTAA GGTTTGTCCA ATCAAAGAAG AACACCCGAT TGATA^TT CCATTCACTT 
CACCAATATG TGCCTGCATT TTOTCAAAAA ATAATCGCTC TCTTTTCAAC ATGTGA03AG 
CATGCTCAAT AATCATTGTt CC^CAGTTG T^AATGAA TTGTTTrm OTTG^ATAA 
AAATATGTAC TCCAAAAGCA TTTTGAATAG CTITTAGTCr TOTGTAAGA GCAGGTTGAG 
ATATATATAA AATTTCAGCC GCTTTACGTA ACGTTTTCGT ^GTCTAAT GTTATTAGTA 
AACGA.AGTC TTCAATCTTC ATAATTTCGC CCCAT.AATT A^AT ^AACTTTGA 
TGGCTACAAG CATTCATGAG TTCATTACTA AC^AATAATT TCACGAATTT TATTGOTATG 
GCTGCAGCTT GAArTACTTA GrrrTTCm TG^TTGGT GAT^AGT TTGATTATAt 
tGCTTAGGCT TTATTTGTTT GCTT^TC* "ATTAGTT. TATTTTGTGG CTTT.GATGA 
„^AG CCT^GCATT AATTTTA™ AAGCAGTACA TGATmCTT TTGGAATCCT 
„AAAAT=AT TTTCTAACTC TGCCATAATT TGATC^CAA XCATATATGC TTCATGAAAT 

xGcrrrr™ taatt^^ctc acttxctaat ggaaacatta aatcatcttc atctaccaac 

TCATaTcACC ACTTGG 

(2) INFORMATION FOR SEQ ID NO: 352: 
SEQUENCE CHARACTERISTICS: 



1816 



50 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 352: 



5 


GCGTTGTCGT 


CGATGATTAA 


TAAAGTATGG 


GTATACCATT 


AAGAATAACJG 


CTACCCAAAT 


60 




gAkTGCTAGT GACGTGCCGC 


CAATGACATC 


TGAAAAGTAA 


TGTGCATGAA 


AATAAAGGCG 


120 




ACAAAATAAT 


ATGCTAAGCC 


ATAATATTCC 


CATAACCAAC 


GCACTCAACA 


CTTTTGTTAT 


180 


10 


TGTCTTAGCA 


GCAAGTGAAA 


TAATAATGAT 


CATTAAGGCG 


AAATATAATA 


ATGTGCTGGC 


240 




GTTGGAATGT 


CCACTCGGAA 


ATGAAAAGCC 


TGTATCAACG 


GCTAAATGAT 


TATATGGTCT 


300 




TGGACGTAAT 


ACAGTATCTT 


TAATTAATTT 


GTTCATGATG 


ACACCTGAAA 


CCAAATATGT 


360 


15 


CACAAACCAA 


ACCGCTAAAT 


G CCT CTGTTT 


AATAAACAGT 


ATGATTGTGA 


CGATAATGGA 


420 




AATCAACACG 


ACACCTTTGA 


CATCTCCAAT 


TTCCGCACTA 


AACGTCATAT 


AGTAATTAAA 


480 


20 


CAAATTGTTA 


ACATACTGAC 


GTTGTGGCTC 


ACCGAAATAA 


TCTGTAAACC 


ATGTTAATGA 


540 




TCCCATATCT 


ATATTTTTAA 


C^C C*ATTCT*TG 


ATTTGTCACT 


ACACTGTAAA 


ACATACCTAT 


600 




AAATACAATC 


AGCGCGATTA 




CAPTGTcATT 


TTCGGTGATG 

X X X SJ* * X 


TTAATTTTTT 


660 


25 


ATCTATCATC 


TTACAATCTC 


PTCGTATCAT 


CATTTTCATT 


TTACAAATGT 


TATCCATAAT 


720 




ATCAATGTGC 


CACAAATTTC 


ACTTTACCGA 


CAATATCAAA 


ATTATAAAGT 


TCATATTGTT 


780 




ATGTATATTG 


CAAATAAAAC 


ATTGTATAAT 


TGAAATAACA 


ATATTTTGCT 


ATTTTCAATT 


840 


30 


TAGTACGATT 


TATATTTATT 


ATACAGAGGG 


GGTAAGGCGT 


ATCAATAGAG 


TTATTTTTGT 


900 




CTATATAGCG 


TTAATCATTA 


CATTAGTTAG 


ATTCTTTACC 


CCTATTCATC 


CATCATTTTC 


960 


35 


AAATTTGATT 


TACTGGATAT 


TTGTATTATA 


TTTTATTCCT 


ATTATACTAT 


GCGTTATCGG 


1020 


TTTCAAGGCC 


GAAAAACTTA 


TTGCAACAAT 


GGTCATTATA 


CCTAATTTTT 


TAGGAATACT 


1080 




TTATCGATTA 


TATGCCTACX3 


TCACACATAT 


ACTCTTTATG 


TAAAAGGATA 


GTGGGCATGT 


1140 


40 


CTCGCAACAA 


ATATGCTTGC 


GTCGACTTGT 


CACCATTCGC 


AAATTTTATG 


ACATACGCCT 


1200 




TTCACGGGCT 


ATATTCAGAC 


CCACGCATTC 


ATCCACGTAA 


TAAACACATC 


ATGTAATAGA 


1260 




AAAACAGCAC 


ACCCAAATAT 


ATGGCGTTGC 


GCTGTTTAAC 


CAAGCATACT 


TCTATAGCTT 


1320 


45 


TAATAAGCCA 


GCAGAAGCAT 


ACCTAACCTT 


CTTAAATATG 


CTTTTCCAAA 


TTATCCTCAA 


1380 




GTTTGAATAC 


GATAATACGT 


TCACCTGTAA 


CTGTACTTAA 


ATCACTATGG 


AAGCTCATCA 


1440 


50 


CTTTGATACC 


TGTAATTTTA 


AAAATGATAT 


CATTCAAATC 


TTGCTCACCG 


GATTCAACTA 


1500 


ATTCAGAACG 


TGTTCGTTTA 


ATATTTAATA 


ATCCTTCATT 


CGTACTACAT 


ACACGATATT 


1560 




CAGCTGGCGT 


TAAGATACCT 


TGTAAACTAA 


TAATCACCAT 


ATCTCTTAAA 


ATGTCTGATT 


1620 
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TTTCAGCTTC 
TGATTACTTC 
TTAGTATATT 
CGCAACATAT 
ACGCAGCCAA 
TACAACATAG 
CCGATCAGCA 
TTGCATGACA 
ATGCGGCGCA 
ACCAGAAATG 
TAAAAATGTC 
TGATGCATTT 
CTGCGCTTCT 
GCGATTCGcg 
AGCAATAGAA 
ATGTTCTGCT 
TTGTCTGACA 
TGCGCCACCA 
AGCATGACCT 
CGCTTGTAAC 
TCGGTCAAAC 
TATTTTTTGT 
GCTTAAAAAT 
CAATTTAAAT 
ATCGTAGCGG 
ATCATGTTTG 
AATACCAATC 
AACATCTATA 



GATTTCACCT 
ATCTTTGTAT 
TTAAGAAGCT 
CATCGTTAAA 
TGATTAGACA 
TCGGGTGCCT 
GCCATAACAG 
CCAACACCTG 
ACTGTCGACG 
ATGGTATTTA 
CGCCCTTCTA 
TTAGCCAATC 
TCCACTGGAT 
TGtTCAGAAA 
GATAATGTGT 
ACCGCAAAAA 
TTTGGACGAT 
CAAGCCCCAC 
GCTAACACAA 
GCGAAATCAA 
TCACGATCAA 
AAAGACGCGC 
GGCCCACTTA 
GTGTAAAACA 
TCTGCAAATT 
AATTGTTCGT 
GTTTCAAAGG 
CAAAATGCAA 



TTCGTTCTTT 
CGTTATCACG 
AGAACATTGT 
TCCAACTTTT 
CCTTCCTAGC 
GAATAACGAC 
ATTGCCATGA 
ACGTGACGGT 
CATAATATTG 
ATAATGTGCC 
AATCAATGCC 
CCCATTCTGG 
GATTCACACG 
TCATCGGCAT 
CTGGCACATA 
CAGTTGTCTC 
TACATATCAT 
ATTCAAGTGA 
CGAACGGCGC 
TTTGCTCTTG 
TCGTCAACTT 
GACTTTTTCT 
ATTCAGGCAA 
TCGATGACAT 
CTTTAATGCG 
CTACGGCATC 
GCCCTGCTGC 
TTTGCGCTTT 



TCATATCACT 
ACATCAACTA 
AGATATGATG 
AAAACGCCCT 
GAAATGCTCA 
AAGTAAACGA 
TAAGCCATAC 
TTGTGTCGCT 
TAAATTAATC 
ATCTTTATCT 
TTTTGTTAAT 
ACGTACCTCA 
ACCAATCGTT 
CGCGTCATTC 
TACCCATGCC 
TGGAATATAC 
CGCTAATAAC 
TGCATGATGT 
AAATGCTTCG 
CTCAGTAAAT 
TGTTTCAGGC 
AGGCATAATC 
TAACAGACTA 
TGTCTGTTGC 
ATATGCCGGC 
XTTTTGAATA 
TTCGATATGT 
CGTTGATGTG 



AACTCCAATA 
TTACATTAAG 
ATATATTAGT 
TCCTCATTAA 
•TTATTCGCGA 
ATTGGCGAAT 
ATCAGATCAC 
TTATTCCCAC 
CATTGTGCCA 
TTACGCCAAT 
TGGCGTCGTC 
CTCCAATCAC 
GGCAGTTTGT 
AATGATTCAT 
AACGTATCAG 
ACACCTGATT 
TTAGCATTAA 
GGATTATTGT 
GTTAAATCCA 
CCAACAGGTA 
TTTTTCAACC 
GAATTGACAA 
GGCATGACAT 
TGTTGTCGAT 
GGTACCATGA 
GGTAATCCAA 
CTACGAAATG 
CCCATCTGAT 



ATATTAAAAT 
TTTATCATTT 
TACTTAGCAT 
CGCTCATTAA 
GTAGTCTTGC 
GATACATCGT 
TCGCATTACC 
TTCCGTAAAA 
CAAGTGCCGG 
CATAATTGTG 
CAATTATAAA 
TCGCAAACCG 
CCAAACGTTC 
ATGCATCTAA 
TAGACGTATG 
GTTTTAATCC 
AACCGCTTGA 
GAGAATGACT 
TCAATTTCAA 
AGTCTGATGT 
ACTTTTGTTT 
TGGTACTTAA 
TATTTTTCAT 
AAACATTCAT 
CAGGTAATGA 
AGAAGCCTGC 
GTTCTGAACG 
TTAGCTCGCT 



1740 

1800 

1860 

1920 

1930 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 
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TGCTTTAATT TTTTGTTTTA ACTGAGATTC GTATGTCATT TCCCAGGCAA TTAGCCATAA 3 54 0 

ATTTTTAAAT ACATTTTTAT TCATAGTTGC TGCAAAATGA ATAAACGTTT GAATTTCATT 360 0 

GACGTCATGT TGTAGTAATA CATCGCTAGG CATATCACTG TAGTAACACC ATGATGCAAC 366 0 

AGTTTGCTTA AACCAATTTT CCGATCTACT TTCACAATCT TTAGCGACTG ACTTAAACTC 372 0 

ATCACCAACT AGCAATTGTT CGACAACTAA CCGAATTGCC AAATAATCCG TTAACAAATG 3 78 0 

TTGTTCAAAG TGATGCTGTT GTGAACGGTA ATACAACATA CCTGCCCAAC CCGGTAACGC 3 84 0 

CAAAAGATGT CCTTCAACAT AAGCTTGGTA GTCTTCCTGA TCTATTGAAA AATGAGTTAA 3 900 

TACTGACTCT ATCGTCATTT CAGGATCATT GGGTAAGCCT TTAATCACTT GGCGCTGTGC 3 9 60 

TTTAGTAAAA CTATGGTCAT GTTGCGCTAA ATGCAACCAT GCATGGTAAA AACTTTGcTC 402 0 

ACGCTTCGGC ATTGTCCAAC TCGATAGAAA TTGATCGATA TAAAGTTTCG TCCATTTAAT 4 080 

20 CATTTGACGA TTCACTTGTT CGCTAAGTGG CTCACCTTGT TCATCTATTA TTGCATCACT 414 0 

CATCGGACGT ACATCATAGT GATGATATGA TTCAGCCATA TCACGTTTTG ATTTTTCTAA 42 0 0 

TAGTAGATCA GCAACAACAT CAACATTTGA ATGATTCATA TATGATGCAG GTACGTCTTT 42 6 0 

TAATGTTTTA ATGTTATCAA TATAAAGATT GATGTAGTGT TGCGGGATAT TGTAGTGATG 43 2 0 

TTCAAGTAAC ATATCAGTAA CAAGTTGATT AAAGACACTT TCATCTAATT CACCACGTGC 43 80 

CACAGCGCTT TCTATTAATG CTTTATTTGG GAAAATATCC ACATCTCGAA CATCACGTAA 44 4 0 

CCATTTTGCG ACATCTTCAA ACGTATCCGC TTCTAATCCT TCCCATGGAT TTCGTGCTGC 4 50 0 

AAAAATCGAA ATTGGTGATA ATGGTGTAAT AACACGTTTC GCATTTTCAA TGACTGAATT 4 56 0 

35 GATATTTAAC TGTGTTGTCA TACCTTTCAC CTCCTATAAA TACTTCTTCA AATAATTCGG 4 62 0 

ATGACTTTCT ATCGCTTTCG AGCGTGCTTC ACCTAGATTA ACTAACCACA CGTACAATAC 4 6 80 

CGCAAAAGCC TTAGAGTATC GATGCCGCGC CACCCAAATA CTTAATAAAC TGCCAAAGAT 474 0 

TAAAATAACA ACACTAATGA TGACACTCAC TGTAGGCGGC GTTGTCGCAT GTGTTGTTAT 4 8 00 

ATTTTGTAAT ACAGCGTAAA AATAATTATG TGTGATGACG TAGATAAATG TCACGATTGC 4 86 0 

AATCAAAATC ATACCAACAA GACGTGCCAT GCGTCCTTTA CTAAAGGCTA CCATTTGATT 4 92 0 

CCAAGATACA AGTAATGACC ATCCTAGAAT GAGTGCACTT AACACTTCAT ATGCACTTCT 4 980 

GTCACTACTC ATCCAAAATA GAAATGCCAC GATAATAGCT AATACACGTC CCATGACAAT 504 0 

CCAGCCATAA GCGTCTTTAG CAGATGCTTG TTTTGGAATA TTGAATCGCT TCACGATAGA 5100 

ACCTGATTGT AAAAATAATG TTGCTTTAAA AATACCGTGC AATATTAAAT GAATAATCGC 5160 

TGCTGAATAT ACACCCAATG CACATTGAAC TAACATAAAG CCCATTTGAC TCATCGTAGA 5220 
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AGAAATACTA 
AAAACGAGTT 
TGGCGTTGGT 
TATCATGACA 
ATGTTGTGGT 
TGCAAGCCAA 
CCAAAAACGA 
TGCCATCAGT 
GAACAATGGA 
TTGAATAATA 
TTTAAATGGA 
TACTGGCATG 
TAAAAACAAC 
CAATTGAAAA 
GTATAAAAAT 
GCAGGTTTCA 
TCTCTTTAAG 
TCATCTTCAT 
TAACATAAAA 
AACCAATGAT 
TTTTAAAATA 
TACGCACAAG 
GGGTATCATC 
ATAAGATTTC 
AATTCATTAA 
TATGAACATC 
TCTTTAAAGA 
ACAACTCATC 



GAAAGGATAA 
AGAATAACAC 
GCCGTTACAG 
GCTAATACAA 
ACCCGCCACT 
CCACATAAAA 
TTAACGTTCA 
CTTAAGTCTT 
AAGTAATGTC 
AAGCCAAGCG 
CCTAAGACAT 
ACTAGTATGT 
AAGCCACTTA 
CTTAACACTG 
AAAAACCTAC 
GTTTAACTAG 
CGTGCTCCCA 
AAATATGCTT 
TATAATTAAA 
TGTAAAGATT 
AAAAATTCGC 
ATAACTATGT 
GCTGGCATCA 
ATAACAAACA 
AGGATTAATT 
TTATTTGAAG 
TTTACTTGGT 
ACTATGTTAA 



GTAATAATGA 
CACCTGCATT 
ATTCAATCAA 
GTAGTACATT 
CGCCAGTCGC 
ATGTCATGCT 
TCAGCAATGT 
CAGACATCCA 
TATAATGATG 
CCATTACAAA 
GTTGACCATG 
ATAAATGTAA 
TCAATGCAAT 
CATAACCTCC 
AATAGTAGAT 
ACACTGCATC 
TGCACATATG 
TGCCTAGACG 
TTTCTGCTTC 
TTGTAAATGC 
AGTTATGATC 
ACAATGAAGT 
TTAAAGTTAT 
AAGGAGGTCT 
GAGAAATTCA 
GGGGCCATTC 
AAATTTTTAG 
ATCAACATAC 



TAACGCAAAT 
CACAATTCCT 
CCATCGATGA 
CGTCAACAAT 
AATATAAATA 
TGATAATTTC 
TAAACATAAT 
TGCTAAAGAT 
ATCACCTAGT 
GCCAGCTAAT 
AATACCGAAA 
TTTAATATAT 
GATAAGCGCA 
TTATTTCTAA 
TCTGTACATA 
ACGGTACGTT 
TATATAAAAT 
AGACCTAACG 
ATGTCAAATT 
ACCTGTACAG 
ATAACAATTC 
TAACTCATAA 
CAAAAGCTTA 
TTCACATGGG 
CTGGTAAGTA 
ACATGGAATT 
GTAACAACTA 
AGGAGGACAA 



CCATTATCAA 
GCATGCATAA 
AAAGGAAATT 
GACCATGTCG 
GTTACAATTG 
GCAGACTCAC 
AATGTAATAC 
GCAAACGACG 
AAATATCGCA 
AACCAAGATA 
AAGCCAATGA 
CTCATTGGCA 
ATAACAAACA 
TCTCTCGCAT 
ATGGCAGAAA 
GATATACCTT 
GTTACTTCTG 
TGTTATTCGT 
CATGAGCTTA 
TTAGGCAGTA 
AAGTTAGGAA 
GCAAAGGAGG 
ATCGAACAAT 
TATCATTGCA 
AGTTATAAAA 
CGTAGCAAAA 
ATCTCAAACA 
AACGATGGCT 



ATATCGGCGC 
TTGCCGATAC 
GTGCTGCCGG 
GATGAACTAT 
CTCCAACGAA 
GTGGCACTTT 
CCCAGCAGAG 
TAATCGCAGT 
TTGAAAACTT 
AACGATCTAT 
CTGCAAATAA 
TAACTGGTGC 
GTGAAAATAG 
AATTGCTTAT 
ATTTACTATT 
GTTGCAGTGT 
TCTGTTCAAT 
TTTAAACTTA 
ACCTCTATTA 
TTTCCCGTCC 
AAAAATCAAT 
TAATCTTAAT 
TCACTGGTAA 
GGAATCATTA 
ATCTCATAGA 
TTATTCAAAT 
TTAACGATCA 
ATTGTAGGTA 



5340 
5400 
5460 
5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 
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TTATTGATGT GAGGTGAGTC TTGTTAGTTT GTTGCAAATA AATGGTCTTG GTGTTTTTTG 714 0 

TATAGGACGT TCTTAGTGGG ACATACGGAA TATTCGTGAT CTTTGTAGTC TGACGCGTTA 7200 

TATTTTTGTG GCGTGTTTTA TGTTTGATAC TCGAGTTCTG AGACATTCAT GATTTGGCAT 726 0 

GCGAAATCTT AATGATTTTC ATGATCTAGC GCAAGATATA TTGGCCACGT GCGGAATTGC 732 0 

GTTGCACGTT TAGACTGAAA CACTCGTGTG ACCGTAAGTG TTAATAGTAC ATTGATAGCT 73 8 0 

GCATTTACTT CACTCATTTT TATGACTGTT AAACAATGAT TGTACCTTCA ATTAACAGTT 744 0 

GGTACGATGG TTTTGCCATT TTTCATCAAC GTAAATATAA AAAGGACTAA GACACATACA 7500 

15 TGTCCTAGCC CTATGGATAA AATGCAAATT TCTGCTTTAT CAAAACTATC ACACTTTAGA 756 0 

TAGATTGAAA ACAAAAAGAT CCTAAGAACA CCTTAACTTT TTATTAATTG TCATAAATTG 762 0 

CAAACAATTA AGCCACAATT CAAAAATGAT TATACTTCAT TCAACTTATC GTGCTGGTCT 76 8 0 

AATTTGCCAT TGATATGGAT CTTCAAATTG TTGCCAATCT GCATCAATTT CTTGCGCATT 774 0 

GACTAAGCAT GCGTCGAGTT CTTTTGTTAA TTTTTCTTCA TCTAATTCTG TACCAATAAT 7 8 00 

GACAAATTGT GTATGACGAT CGCCATATTC TGGATCCCAT TCAGCTGCGA CATCTTGACG 7860 

TTCTGCTAAT ATTTGTGTTT GTTGCGCTTC AGACATACTA GCCACCCAAT ATGTAACTGG 7 92 0 

ATGAATATTG CAAGATGACC CTGCTTGAGA TAATAAACAT GCTACGTGAT TGTATTGTGC 7 98 0 

TAGCCATACG ATACCTTTTG ATCGAACGAC ATTATTTGGC ATGCTTTCTA ACCAATCATT 804 0 

GAACCTTTTA GCATGGAAAG GTAGACGACG TTTATATACA AACGATGATA TACCATATTC 8100 

TTCTGTTTCA GGTGTATGCG ATGCATGCCC ACCAGACTCA AGTTCTTTGA TCCATCCTGC 816 0 

^5 TGACTCGCTC GCTTTTTCAA AATCAAAACG CTGCGTATTC AAGACTTCTT TTAAATCTAC 822 0 

TTCAGAATTT GTTGTCTTAA TAATTTTAGC AGTCGGTTGC AATGCGCTTA ACATTTTTTC 82 80 

TAACTTCGCT AGTTCTTCTT CACTAATTAA ATCAATTTTA TTAATAATCA ATACATCACA 834 0 

AAATTCAACT TGGTCAATTA ATAAATCAGC AATCGAACGC TCATCTGTTT CGTCAACGCT 84 0 0 

TTGATCACGA TCCATCAATA AATCTTCTGA GTTGATGTCA TGTACGAAGC GGTTAGCATC 84 60 

CACAACTGTA ACCATTGTAT CTAAACGGCA AATCGCTGTA AGATCAATGC CAAGTTCATC 8 520 

ATCAATATAT GAGAAAGTTT GTGCAACAGG TACTGGCTCT GAAATCCCTG TTGACTCAAT 8 580 

AACAATTTGA TCGATGCCAC CTTTTTTCAC TAAACGCTCA ACTTCTTTTA ATAAATCGTC 864 0 

50 TCTAAGTGTA CAACAGATAC AACCATTAGA AAGTTCGACT AATTTTTCAT CTGTACGCGA 8 70 0 

TAGTCCCCCA CCATCTGCGA CAAGATCTTT ATCGATATTT ACTTCACTCA TATCATTTAC 8 76 0 

AATTACCGCG ATACGTCGAC CTTCTCGATT TTGTAAAATA TGATTTAACA ACGTTGTCTT 8 820 
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ACTTCAATTT 


ATTTGTAAAT 


AGGAATAATT 


CTGTTTTACA 


TTATATAWjA 




R O 

O 7^ U 




TTTCGCAATC 


TTCGATAATA 


AAAAAATAGT 


ATACTTAATT 


AAA! lAi. loA 


VjV_OV— X X li^^ X 


9000 


5 


TTATAATGGA 


GACAAAGATA 


TATCTCACGA 


AAGAGAATCG 


ACyCj 1 vj i A 1 AM, 


Zk P ATdTT ATT 


9060 




TGTCATTTTA 


GTTTTATATG 


TTACTGGTAT 


TGCATTTATT 




TTTTTGGTTC 

X X X X X VJVJ A A. 


9120 


10 


AAAGACTGAA 


GGATTATCTA 


CGAAACATAC 


TTTATATACC 


Al HjvL-AoiVj 


rr* ATT ATAA 

X^ X xrt X *^ k<w 


9180 


GATTGCTATT 


TTCATTTCAA 


TTGGCTATGC 


CATTCAA 1 AC 




PnPTTTATGG 

V.,0 V« XXX X WW 


9240 




TTTGTAAGGT 


GAAGGTGATG 


AGTAACGGGT 


AGTT CliCiij Avj 




VJVvV9 X XVJAX X A 


9300 


15 


TGATAAAGTG 


ATCATAGCTT 


TTAGTACTTG 


AGGAi 1 1 1 lA 




T A Pn A A T(^Tn 


9360 




GTCATGTTTA 


ATGCGGGACA 


GTAATTTAAG 


TTGTTTTTTT 


IV ^ TV TV T^TV^ ftp JV 

ACAAi 1 VjAVjA 


PT/^TYi A T ATT 


942 0 




TCGATTCGGT 


TCGAATTACT 


TTACATGGGA 


ATAATATAAA 


1 IAAAAAvjAA 


npfinppTAnT 


94 80 


20 


CTCAGTTGTG 


AATATACTGA 


ACATTGGTCG 


CTTTATTTAG 


i Av» 1 A i Vj A i A 


TP T APTTT AP 


9540 




CTATTAATTT 


TTTTCAGGTC 


ATCCTTAATG 


CTGTCTATCT 


CAUACA 1 VjkjC 


APTTTTAAPP 


9600 




CAATCTCCTT 


GAGCTGCACC 


TTTAAAATTA GCTTTAAAAg 


C t TCGCAA lAj 


TTY*' PP P P 21 TT 


7 O u U 


25 


TGTTCAATTA 


ATACTTTTTC 


TTCACCTTTT 


AATCCGTTTT 


CAAIAIUI 1 i 


PTATTTATnP 

1/\1 1 1M1\JV* 


9720 




TTATGTTCAG 


GTGCAATAAC 


TGTGCGAATA 


TTTTCTTTTT 


QyCtiL I i UL-Ai 


TTTAPATATP 


97 80 


30 


AGATTAAGTG 


TTTCTACTGT 


AGTACTTATA 


TCTGGCATTC 




ATCTGGTTCT 


9840 




ATTAGAGTCA 


TTTAATCTCC 


TCCAAATTAT 


CAGTCACTTA 


GCTTATCTAA 


CTGCTTTTCA 


9900 




TAAGACTTTT 


TTAAGTCrrC 


TTTATATTCT 


TCTAATTTCC 


CATTCTTGCT 


TTCTGA 


9956 


35 


(2) INFORMATION FOR SEQ ID NO: 3 53: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2411 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353: 
TTTCTTTTAA CAGaTTTATC cCATTAATTG TTCTACAAGC CCACGaTGAG CAATATCATT 
TTTAGCAACC ATTAATAAAC CAGAAGTATC CATATCTATA CGGTGAACAA TACCTGGACG 
AATTTCTCCA TTAATACCTG ACAAATTTTT AATTTGATAC ATTAAACCAT TAACTAATGT 
ATTGGTATAA TGCCCTGGTG ATGGATGAAC TACCATGCCT TTCGGTTTAT ATACAACTGC 
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AACAACTTTA TCGTTTGCAA CGACTAAACC 
AGACCAATCA TTATTTAATT CAGGCAGCAA 

5 

ATCTGTAATG TTAAATTCAT AAGTCTCCAT 
GTATCCTTTA ATAAGGCAAT AATAATTAAT 
ATATTAAATA TTGGAAAATC ATAACCAAAA 

10 

CCTGTTAAAA TTCTATCAAT AAAGTTTCCA 
ACTTGCATAA ACAAATTATA TTGAGCATCT 
75 ATAATAATGG TAATAATAAA GAAAAATGTC 

CCATTATTTC GATGTGaTGT TATGTTTAAA 
ATTTTCATTG TAGTAGCTAT AATATATTTA 
GCTATTAAAA TGGAAGTGCC AATAAAATAT 
CGTCCATGAG ACAACTCTTT ATATTATAGC 
TTACATTAAA TCTAATGTTA CTAACTCAAT 

25 

ATATGAACTT AATTATTCAT TTATCATATT 
CTGGAACATA AATCCCTAAA AAGACAGTAG 
CTGCTGTTCT CTATTTATAC AATACTTCGT 

30 

CATTACTGTA AAATTCTATA AATAGAATTT 
CTCAGCCTCG GtCTTCGACT GGCACTGCTC 
35 TTAACATGTA ATTTTACTTT TAAATACTTT 

AATTGGACAA ATTCTATGAG AATAGATATT 
TtCACTCGAA TGTCAGTTCG AGGAATAAAT 

40 

ATGGCAATTA ATATAAGGAA AATAGCAGCT 
AAAAAAGCTG ATTTCTATCA AATAATTAAT 
CTTAATGTCC CAAGCCCTAA AACTTGTTGT 

45 

TATCTTAAGT ACATAAATGC TAAAAGTATA 
CTTGTATCAA CATTAATAAA TAATAACCCG 
5^ TAGCCCATAT ATTTGCCACC TAATAGTTTG 

TTACGACTAT AATTGATATA TGCAATGATA 
GTAGAGATGG TAGTCACATA CGTAAATACT 



TGCTTTAATC CAATCTTGTA TCTGGTTACG 4 20 

CTTATCTACA CGCATACCTG TTTGTTCTTT 4 80 

TACTTAACCT CCTTCTCCTT TTTATTGGAA 54 0 

ATTACACCAA TTGTTAAACT TGAATCTGCG 600 

ATATTTGTAT CAATAAAGTC AACAACTTCT 66 0 

AGTGCACCTG CAAAAAGTAA ACTAATAGCA 720 

TTAATAAAGA AATATACTAA GGCTATTAAT 78 0 

ATTTTTCCAC TCAATATTCC CCATGcAGCA 84 0 

AAGTGCGGTA TCACTTCAAA TGAATCTCCA 900 

GTAACTTGGT CAAATATAAC GACAAATACT 960 

TTTTTGTGCA TTTTCGTTCC TCCAATCAAT 102 0 

TTACACCTGC TAATAAAAAA AGTAAGCATA 108 0 

ACTTGATAAA CTACTATGTT TTGACATTAA 1140 

TAAGATGACA TTAAAAATTA GGAAAGCAGG 12 00 

TAAGATATTT TCTAATTAAA AATTATCTTA 1260 

ATTGAATGGC TTCGCTATGC CCATCTGGCA 1320 

TTGATGATGG GTCCCTTCCT AGGGTGCCGT 13 80 

CCTCAGGAGT CTCGCCATTA ATACTACGTA 1440 

AAAAAAATAA GACATGAATC GTCTACACTT 15 00 

GTTAATTTAA GAAAGTAGGC TATTTTGAGT 15 60 

AAAGTTAAAC GAGAGCTAGG TTTTGTATTA 162 0 

CAACGAGCTG TACATTATAA AATACATATC 1680 

AGAAATCAGC TTTTTTACAT TGCCTAAGAA 174 0 

TATTTATTTG ATTTAGCAGC GATACGTTTA 1800 

AACCAAATCG GAATAAAATA AATTGCACGT 1860 

AACACAAAAA TGAAGAATAC AAATATTAAG 1920 

TACGTAGCAT TTTTATGTAG ATCTGGGTTT 198 0 

ATCAGACCCC ATACAACTAA AAATAACACT 2 04 0 

TTTGTCGCAT CTGGGAAAAT ATAGTTTAGT 210 0 
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rrATTCGTCT TAGAAAAGTT CGGAGGTGCT TGTyGTTGAC TTGATAAACC GaAAAGCATA 
CGGCTATTTG AGAATATACC ACTGTTACAT GATGAAGCAG CAGCGGTTAA TACTACAAAA 
TTAATCAAGC CCGCAGCAAA CGGAATTCCG ATCAATGCGA ATrATTrmC GAATGGACTG 
riATCAGGAT CAACTTCCTG CCAAGGGGTA ATAGACATGA TAACCGCTAA CGCCCCAACG 
TTnnATATTA A 

(2) INFORMATION FOR SEQ ID NO: 354: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 354: 
GnGAATTATT TTTAATAATG AAAGGATTAC TTnCATGGGT TTTTACTAGG AnTACCCAGA 
AGGTCAAAAT ATTrTTCATG CGCTAAGTCA ATATGAAGTT AAGCGACGCG GCGATATGGA 
AGAGGATCCA TCATATAAAC AACTCATTTC TTATTCTTTA CTTGAAAATG AGCATGGCGA 
GATATTAGTG TATGAACGAT TATCTGGCGG TGGAGAAGCT CGATTGCATG GACAATCTTC 
AATAGGTGTA GGCGGTCATA TGAATGATGT TCCAGGAGCA GAATCTATIA ACGAAGTATT 
GAGAGTTAAT GCACAGAGAG AATTAGAAGA AGAAGTAGGT riAAGTGAGC AAGATTCACA 
AAATATGGAA TATATCGGTT TTATTAATGA CGATAATAAT GAAGT^CA AGGTACATAT 
TGGTGTTGTA TTTAAAATCA CTGTAAGTAC GAATGATGTA GAAGCTAAAG AAACAGATAC 



TTTACGAATA AAATGGGTTG AAAAAGGCAA CATAGAGTCA TATGATGATT TCGAAACGTG 
GAGTGCATTA ATCCTTCAAG ATTTATAATC AAACGAGGT^ ACATATATGT CAGATATTAT 

40 TCCAG 

(2) INFORMATION FOR SEQ ID NO: 3 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 B base pairs 
^5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355: 

— A,A.Af"^A.A^ TAOGAT^TAA 



2220 
2280 
2340 
2400 
2411 
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ACAATTAGAC ACTGAACATT TAGCTGATTT AAATCCAGAG CGTATGATCA TTATGACAGA 180 

TCATGCTAAA AAAGATTCTG CTGAATTCAA GAAGTTACAA GAAGATGCAA CATGGAAAAA 240 

^ GTTGAATGCA GTTAAAAATA ATCGCGTGGA TATTGTTGAC CGTGATGTTT GGGCAAGATC 300 

TCGTGGCTTA ATTTCTTCTG AAGAAATGGC TAAAGAACTT GTTGAATTAT CAAAAAAAGA 3 60 

ACAAAAGTAA GGTGGAAGTA AATGGCTATA AAAGAAATAA GTAGCCAATC TGCCATAGAT 42 0 

10 

CATPAPiAGAA AAAGACGCAC AACACTCACG TATATAGTGA GTTTGTGCTT TCTTTTTATT 4 80 

TGTATATATT TAAATATGGC GATTGGTTCT TCGAAAATTA ATTTTAGCGA TATCATTCAC 54 0 

TATGTTACTG GTCATACAGA TACGAAAGCA ACGTTTTTAT TGCATAATGT ACGTATGCCA 600 

AGGATGATTG CAGGGTTATT TATTGGCGGT GCATTAGCGG TATCTGGTTT GTTAATGCAA 660 

GCAATGAC 668 
20 (2) INFORMATION FOR SEQ ID NO: 356: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 787 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



30 



35 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 356: 

ATACAAAAAA ACATATCGAA AATPiPAGCTJK AAAGAAACTA TCAAGTTCCA TATTCAATTA 60 

ATTTAAATGG TACATCTACA AACATTTTAT CGAATCTTTC ATTTTCAAAT AAACCTTGGA 120 

CAAATTACAA AAATTTAACT AGTCAAATAA AATCAGTACT GAAGCATGAT AGAGGTATTA ISO 

GTGAACAAGA TTTAAAATAT GCTAAGAAAG CTTATTATAC TGTTTATTTT AAAAATGGTG 240 

GTAAAAGAAT CTTACAGTTG AATTCAAAAA ATTACACAGC AAACTTAGTT CATGCGAAAG 3 00 

40 ATGTTAAGAG AATTGAAATT ACTGTTAAAA CAGGAACTAA AGCGAAAGCA GACAGATATG 36 0 

TACCATACAC AATTGCAGTA AATGGCACAT CAACACCAAT TTTATCAAAA CTTAAAATTT 4 20 

CGAATAAACA ATTAATTAGT TACAAATATT TAAATGACAA AGTGAAATCT GTATTAAAAA 4 80 

GTGAAAGAGG CATCAGTGAT CTTGACTTAA AATTTGCGAA ACAAGCAAAA TATACAGTAT 54 0 

ATTTCAAAAA TGGAAAGAAA CAAGTAGTGA ATTTAAAATC AGACATCTTT ACACCTAATT 600 

TATTTAGTGC CAAAGATATT AAAAAGATTG ATATTGATGT AAAACAATAC ACTAAATCAA 660 

AAAAAAATAA ATAAATCTAA TAATGTGAAA TTCCCAGTAA CAATAAATAA ATTTGAAAAC 720 

ATAGTTTCAA ATGAATTTGT GTTCTATAAT GCAAGCAAAA TTACmATTAA TGaTTTAAGT 780 
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(2) INFORMATION FOR SEQ ID NO: 35-7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 



2B 



30 



35 



40 



SO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 357: 
^.UVGTAAAAA TAAATCTCCC rTTTTAACTr TCGTTTCTGC CATAGCCATT GCTTCnCTG 
XOATAGTTGC TACAATATCT TTrCTTTCAC OGrTAAAA... TTCAACTTGT TCTGCTAAAA 
;,I^CAGCTTC TTCTTCGACG TCAGTCATCA ACAATTCGCa AGCTAATGAT GCGTCATCTA 
AACGACCTAC AGCATTAAGT CTAGGTCCAA TAATAAAACC AATrGTTTCT TCATCAATAT 
TGTCATTGTA TCCCGCTTCT TTTAGCAATG CTTTAACAGA GGTCGGACAT TGATCAriTA 
AGACTTTTAA TCCTTGTtTC ACTAATGATC GArTTTCATC AGTTAAGGAT A..AAATCCG 
aU^TGGTACC TATCGCAACT AATGCriTAA AATAATCAGG TACATTTtCA ATCAATGCTT 

GTGCTAArrr gtatgcaaca cctgovccac acaattgttg gaacggataa ttaaacgatg 

GATGCATTGG ATGTACGATT GCATATGCTT CTGGTAATGT ACTACCAATT TCAT 
(2) INFORMATION FOR SEQ ID NO: 35 B: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3621 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358: 
C^GTGAGTCAA A^TAAATGAA TCTAATAAGT CATAACTATC TArTTGTAAT GTGCAACGCT 
XAACCCATAT ACAAAATGAA TGTGCTGATA ATGATTTACr CAAATTAAAA GGTGAITTTT 
.TTCAATGAT GAATCAAAGT TGCCrrrTTA mrrGGTAA AAGTTAATGC GTCAGTGAAT 
TOTGTAAGTT TTTCAAAAAG TAAAAAGAAA TAATAAAGGT GAATTATTAG AATTCCArAA 
ATAATTCATT ACATTCATAA AGCATTTTAC AAATGGTAAG AAAATGAGTG TTACAAATCT 
^XATTGCA AAAGAAGCTG ATTTAGTCAC AAAAAATGTC CTATGTAATA ATTCGAGAAA 
OATGCACTAT ATACGGTCTT CrrACTATTC AAAI.TAAAA GrTGCrTATr TGCGTGGCrr 
_ _ TAATA-ATCT TGTAGAGAAC AATGAAATGA 



240 

300 
360 
420 
480 
534 
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ATAGCAAACT GTATTACTTT GATACAAAAA TGGTTGTAAT AAATATTTAT CGATATGACG 6 00 

ACTTGAATAT GATAAAGTGA CATATTTATG TATATGACTA TTTCGCAAAA TGTAATCGAG 66 0 

GTAGAATTTC TTGACAATTC TGTCAGTTTA TAAGATGTTA TAAATATGTA GTGTATAAGG 72 0 

AGGCAAACAA GATGACTGAA GAATTCAATG AATCAATGAT TAACGATATT AAAGAAGGTG 7 80 

ACAAAGTCAC TGGCGAGGTA CAACAAGTTG AAGACAAGCA AGTTGTTGTT CATATCAACG 84 0 

GTGGTAAATT TAATGGGATT ATTCCTATTA GTCAACTATC TACGCATCAT ATTGATAGCC 90 0 

CAAGTGAAGT TGTAAAAGAG GGCGACGAAG TTGAAGCATA TGTCACTAAA GTTGAGTTTG 96 0 

ATGAAGAAAA TGAAACTGGA GCTTACATCT TATCTAGAAG ACAACTTGAA ACTGAGAAGT 10 2 0 

CTTATAGTTA TTTACAAGAA AAATTAGATA ATAATGAAAT CATCGAAGCG AAAGTAACAG 10 80 

AAGTAGTTAA AGGTGGTTTG GTTGTTGATG TAGGACAAAG AGGTTTTGTT CCGGCTTCAC 114 0 

TAATTTCAAC AGACTTCATT GAGGATTTCT CTGTGTTTGA TGGACAAACA ATTCGTATTA 1200 

AAGTTGAAGA ATTGGATCCT GAAAATAATA GAGTCATTTT AAGCCGTAAA GCAGTTGAAC 12 60 

AAGAAGAAAA CGATGCTAAA AAAGATCAAT TATTACAATC TTTAAATGAA GGCGATGTTA 13 2 0 

TTGATGGTAA AGTAGCGCGT TTAACTCAAT TTGGTGCATT TATAGACATT GGCGGTGTTG 13 80 

ATGGTTTAGT GCATGTATCT GAACTTTCTC ACGAACATGT TCAAACACCA GAAGAAGTAG 144 0 

TTTCAATTGG TCAAGATGTT AAAGTTAAAA TTAAATCTAT TGATAGAGAT ACAGAACGTA 1500 

TTTCATTATC AATCAAAGAT ACGTTACCAA CACCTTTCGA AAATATTAAA GGTCAATTCC 156 0 

ACGAAAATGA TGTCATTGAA GGTGTCGTAG TAAGATTGGC AAACTTTGGT GCATTTGTTG 162 0 

AAATTGCACC AGGTGTACAA GGACTTGTAC ATATTTCTGA AATTGCACAC AAACACATTG 16 8 0 

GTACGCCAGG TGAAGTGTTA GAACCTGGTC AACAAGTAAA TGTTAAAATA TTAGGTATTG 174 0 

ATGAAGAGAA TGAAAGAGTA TCACTATCTA TTAAAGCAAC ATTACCAAAC GAAGATGTTG 1800 

TTGAAAGTGA TCCTTCTACG ACTAAGGCGT ACTTAGAAAA CGAAGAAGAA GATAATCCAA 1860 

CAATTGGCGA TATGATTGGT GATAAACTTA AAAATCTTAA ACTATAATTT AATATTTAAT 192 0 

AGTCAACTCC ACATGTTTAT GATTGcATGT GGAGTATTTT TATGTAACAA AATATACTCG 1980 

GAATGATAAC GTGGgACAAA TTTAACTAAG TGTTTAAAAA GATArAGTTT TAAGTGCtGa 204 0 

tTTTTATCAT TACAGTAATA AACTCATTTT GAATACACAG TCTCATGTGA TATTATTAAA 2100 

AAGATATaAG AAAGAGAGGA AGTTAGCTTA TGACTAAACC TATAGTAGCT ATTGTAGGTA 2160 

GGCCTAATGT AGGTAAATCT ACAATTTTTA ATAGAATAGT TGGAGAACGT GTTTCGATTG 222 0 

TGGAAGACAC GCCAGGTGTA ACACGAGATC GTATTTATTC TTCAGGTGAA TGGTTAACAC 2280 
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AAATTAGAGC GCAGGCAGAA ATCGCCATAG ATGAAGCGGA 
ACGTGCGTGA AGGATTGACA CAAAGCGATG AAATCGTCGC 
;^CCGGT CGTATTAGCG GTTAACAAAG TAGATAATAT 
ATGATTrCTA TTCATTAGGA TTTGGTGAAC CGTATCCGAT 
GTCTTGGTGA CITGTrAGAT GCAGrrGTTT CTCATTTTGG 
ATGATGAAGA TACAATTCGA CTATCCATTA TTGGACGACC 
TAGTAAATGC TATrTTAGGT GAAGATCGCG TTATCGTrTC 
GAGACGCTAT TGATACAGAG TATAGriATG ATGGACAAGA 
CTGGTATGCG TAAAAAAGGA AAAGTATATG AATCAACTCA 
CTTTAAAAGC GArTOAACGT TCAAATGTTG TTTTAGTGGT 
TCATTGAACA AGATAAACGT GTTGCAGGAT ATGCACATGA 
TTGTCGTAAA TAAATGGGAT ACTGTGGAAA AAGATAGTAA 
;,TGAAGTACG TAAAGAATTC CaATTTTTAG ATTATGCACA 
AAGAACGCAC AAGATTACGT ACATTATTCC CTTACATCAA 
;^CGTGT TCAAAGTTCA ACTTTAAATG AAGrTGrTAC 
CTACACCAAC AGAO^GGT AGACGTrTGA ATCTCTrTTA 
AACCACCGAC ATITGTTGTA TTTGTTAATG ATGTAGAATT 
GCTATITAGA GAATCAAATC CGTGCCGCTT TrGGrmGA 
TAGCrCGAAA GAGAAATTAA CGATTGGGGG ATAACAATGA 
;,TGGGAAGTT TTGGGACAGC CCTTGCCAAT GTTCrTGCAG 
ATGTGGGGTA AAAATCAAGA TGCTGTTCAT GAATTAAATA 
TATTTAAAAT ACGCGAAATT A 
(21 INFORMATION FOR SEQ ID NO: 359: 

(X) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 3 _ base pair s 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



TGTTATTATT 
TCAAATTTTA 
GGAAATGCGT 
ATCAGGGTCA 
TGAAGAGGAA 
AAACGTAGGT 
TAATGTTGCA 
TTATGTTTTA 
GAAATATTCA 
CATAGATGCA 
ACAAGGTAAA 
AACGATGAAG 
AATTGCTTTT 
TGAAGCAAGT 
TGATGCAATT 
TGCAACACAA 
AATGcATTTT 
AGkTACACCA 
CTAAAATTAC 
AAAATGGACA 
CATGTCATAC 



TTTATGGTTA 
TACAAATCTA 
ACAGACGTGT 
CATGGTTTAG 
GAAGATCCTT 
AAATCAAGTT 
GGGACAACGA 
ATCGATACTG 
GTATTAAGAG 
GAACAAGGCA 
GCAGTCGTGA 



_ — i tMi tm r^ "A Ti./^ 
i\t\ti. X J. J. 

GTGTCTGCTA 
GAAAACCATA 
TCCATGAACC 
GTTGCTATAG 
TCTTATAAAC 
ATTCATATTA 
CGTTTTTGGT 
TGATGTTTTG 
AAATAAAAAG 



2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

30^0 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3621 



so 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359: 

^^^.^n^^r ^,nrr-AATTA CCAAGGGAAC TAnTGGCACC 
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AATGCtTCTT TCATCTCCAT GCCCTGTTGC TCATTATTAA TAACACGGTC TATTAACACA 180 

ATGGCATTTG TTACTACGAT TCCAATTAAC ATTAGCATAC CAATTAAACT TGGTACTGAT 24 0 

ATTGTTTCTC CTGTGATTAA TAGTGCAATA ATTACACCGA TAACTGTAAA TGGTAAAGAG 3 00 

AATAAAATTG TAAATGGTGC TAGGCCACCT TTAAATGTAA TAACTAGGAT TAAATATACG 36 0 

ATAATGATTG CAGCTAACAT TGCAAAGGCT AATTGTGTCA TTGCATTGTT AATATCATCT 42 0 

GATGCACCAC CGATATTAAC CTTTACATTA TTCGGTTTAT CCAAATTATT TATTTTAGAC 480 

ATCACTTGTC GTGTTGTGCC ACCCACATCT TTATTTGTTA CTTTAGCAGA TACCGTCGTT 54 0 

GCATAATCTC CTTGTTCnTG CGTCAATTTA CTTGGTGTCG TTGTTTTAAC TAACGTAGCG 6 00 

ATATCTCCCA ATTTAATCGT ACCACCAGTC GGCTTTTTCA AAG 64 3 
(2) INFORMATION FOR SEQ ID NO: 360: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2524 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360: 

TTTCAGGCyG TGCAAGGGCT TTTTCTTTTG CTTTTAAGAT TATGATTTAT CGTGCAAAGT 6 0 

TAAGTGGTCG TATATAGTTT TAGTTTTAAA AAGGTAATTA AATAAAATAG TTTGCCGAGG 120 

GAGATGTCAA AATGATTAAA ATACCTAGAG GGACGCAGGA TATTTTACCT GAAGATTCAA 180 

AGAAATGGCG TTACATTGAA AATCAATTAG ATGAATTAAT GACATTTTAT AATTATAAAG 24 0 

AAATAAGAAC ACCAATTTTT GAAAGTACAG ATCTTTTTGC AAGAGGTGTT GGTGATTCAA 3 00 

CCGATGTCGT ACAAAAAGAA ATGTATACAT TTAAAGATAA AGGCGATAGA AGTATTACAT 360 

40 TAAGACCTGA GGGAACAGCT GCAGTTGTGC GTTCATATAT TGAACATAAA ATGCAAGGTA 420 

ATCCAAACCA ACCAATTAAA CTTTATTACA ATGGACCGAT GTTTAGATAT GAACGTAAGC 4 80 

AAAAAGGACG CTATCGTCAA TTTAATCAAT TTGGTGTAGA AGCTATTGGT GCTGAAAATC 54 0 

CTAGCGTAGA TGCAGAAGTA TTAGCTATGG TTATGCATAT TTATCAATCA TTTGGATTAA 600 

AACATTTAAA GCTTGTTATT AATAGTGTAG GGGATATGGC GTCTCGAAAA GAATATAACG 66 0 

AAGCGTTAGT GAaACACTTT GAACCAGTAA TTCATGAATT TTGTTCAGAT TGTCAATCAC 72 0 

GTTTGCATAC AAATCCGATG CGAATTTTGG ATTGTAAAGT AGACCGTGAT AAAGAAGCGA 78 0 

TTAAGACTGC ACCTAGAATC ACTGATTTCT TAAATGAGGA ATCTAAGGCA TATTATGAAC 84 0 
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GTGGATTGGA 
ATGGTGcCAT 
ATGGTCCAAG 
TTGAAGAAGA 
TGGGTGATCA 
TTAAAGCAGA 
GTTTAGGTGC 
ATGTTAAAAA 
ATTTTAAGAA 
TTACTGAGGC 
ACCTTGGTGG 
TTAATCCTGC 
TTGTAGAAGT 
AAACTGGCCA 
CACCATTTTC 
GTTATTTAGA 
CACGTTCTAT 
TAACGAAGTC 
GTGAATTTTA 
GATTTGACAA 
GTCAACCTGA 
TGCAAATGGG 
ATGGCGCTTT 
CAGATACACG 
TTAAAGTATT 
GTGCAGCTGa 
ymTGGtGCTA 
ATGG 



TTATTATACA 
TACAACGCTT 
TGAAACAGGT 
AGGTATCGAA 
AGCAGATCGA 
TAAAGACTAT 
CAAGTTTACA 
TATGACAACT 
GTAGAGAGGG 
ATTTTTAGGA 
ATtGATTTtC 
ATTTTCAGAA 
TCAAGGTAGA 
AGTTGAAGTA 
TATAAATGAA 
TTTACGTCGT 
TCGTCAATAT 
AACACCTGAG 
TGCATTACCA 
ATACTACCAA 
ATTTACACAA 
TGAAGAAATG 
CCCACGCATG 
TTTTGAAATG 
TAAAGATACT 
ACAATATACT 
AgGCTAGCGT 
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CATACAGCAT TTGAATTAAT GATGGATAAC CCTAACTATG 
TGTGGTGGTG GCCCrTATAA TGGTrTATTA GAATTGCTAG 
ATTGGmTG CGCTAAGTAT AGAACGATTA TTGCTTGCAC 
TTAGATATTC AAGAAAACTT AGATTTATTC ATTGTTACAA 
TATGCTGTGA AGCTATTAAA TO.T-TGAGA CATAATGGTA 
TTACAGCGTA AAATTAAAGG ACAAATGAAA CAAGCAGACC 
ATCGTTATTG GTGATCAAGA ATTAGAAAAT AATAAAATCG 
GGTGAATCTG AAACAATTCA ATTAGACGCA TTAGTCGAAT 
CGTTAAAATA TGAGTAAGAG AACAACTTAT TGTGGATTAG 
CAAGAAATTA CATTAAAAGG ATGGGTTAAC AATCGTCGTG 
GTTGATTtAA GAGATAGAGA AGGAATTGTA CmAGTCGTGT 
GAGGCaTTGA AAATTGCTGA AACAGTACGT 
GTTACGAAGC GTGACCcTGA AACAGTTAAT 
CAAGTTACAA ATATTAAAGT GATTAATAAA 
GAAAATGTTA ACGTTGATGA AAATATTCGA 
CAAGAGTTAG CGCAAACATT TAAAATGAGA 
TTGGATGATG AAGGGTTCTT TGACATCGAA 
GGTGCACGTG ACTATTTAGT ACCATCTCGT 
CAATCACCAC AATTATTTAA GCAATTATTG 
ATCGTAAAAT GCTTCCGTGA CGAAGATTTA 
GTCGATATTG AAATGAGTTT TGTAGACCAA 
CTTAAAAAAG TTGTTAAAGA AGTTAAAGGC 
ACATATAAAG AAGCGATGCG TCGCTATGGT 
GAATTAATTG ACGTTTCTCA ATTAGGACGT 
GTTGAAAATG ATGGTGAAAT TAAAGCAATT 
CGTAAAGaTA tGGGaTGCTT TAACAGAATT 
GGGGTTAAAG TTGTGGGAAG GTGGTTTTGA 



CCTAAAATTA 
TCTGAGACAC 
TTAAAATACC 
CATCAAATTA 
ACACCAGTAC 
GTTCATGATG 
ATGATTAGTG 
CGTGCAGATC 
GAAGATGTGA 
GTTGAAATTA 
TCTGATAAAC 
GATATGGACT 
GTCGCTAAAG 
TGTaAACaTC 
CAAGGTCCCA 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2524 
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(A) LENGTH: 1507 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: £ 
10 TCGTTGAGTA AAAGTCCAGA AAATTGGATG 

GAGATTAATA TACCGGGTTC ACATGATAGT 
TCAGTTTGGG CAAAGACTCA AGATAAAGAT 
TTTTTTGATA TTAGAGGTAG AGCAAGTGCT 
GTTTATTTGC ATCATGAATT AGGAAAATTT 
TATCCAAACG AAACAATTGT GATGTCTATG 

20 

ACGAAGACAT TTGAAGAAaT TTTTAGAGAA 
CTTTTTtACA CAGGAAGTAA TGCGAATCCT 
CTATTCAATA GAATGGGGGG TACGTACATA 
ATTCAATGGG CAGACAATGC GACATTTGAA 
AAAGTACAAG ATGAGTATAA AGATTACTAT 
30 TTGGCTAAAG CTAAAACGGA TAGTAACAAA 

GCGTCTGGAG GCAGCGCATT TAATAGTACT 
ATTGCAAAAA CGATTAAAGC AAATGGGAAA 
GCAGGATATA CGTGGCCTGG ATATGATGaT 
TAAGGATTCA ATAATGATAT TAAGACGAGT 
CTACTCGTTT TTATTTTGAA AATAAGTAAT 

40 

GTTTGTGAAA TTTTTGATAA TATTAAAGTG 
AATTAACAAA AACAAATCAT TTTAAAAAGA 
CAAATTAGTG TTAGGCATTA TTTTTCTGTT 
CATAGGTAAA GAAGCGGAAG TTAAGAAAAG 
TAAAAATCTA GAGGATTTAT ACGATAAGGA 
TGATAAAGGT ACATGGATTA TAAATTCTGA 
GGTAGCTAAA GGCATGGTTC TATATATGAA 
mTATGTCGAT GTGACTAAGG ACGAGGATGA 



;EQ id NO: 361: 

AGTAAACTTG ATGATGGAAA ACATTTAACT 6 0 

GGCTCATTCA CTTTAAAGGA TCCAGTAAAA 12 0 

TACCTTACCC AAATGAAGTC GGGAGTCAGG 18 0 

GATAATATGA TTTCAGTTCA TCACGGCATG 24 0 

CTCGATGATG CTAAATATTA CTTGAGTGCT 3 00 

AAAAAGGACT ACGATAGCGA TTCTAAAGTT 360 

TATTATTATA ATAACCCGCA ATATCAGAAT 420 

ACTTTAAAAG AAACGAAAGG TAAAATTGTC 480 

AAAAGTGGTT ATGGTGCTGA CACGTCAGGT 54 0 

ACGAAAATTA ATAATGGTAG CTTAAATTTA 6 00 

GATAAAAAAG TTGAAGCTGT TAAAAATTTA 6 60 

GACAATGTAT ATGTGAATTT CTTGAGTGTA 72 0 

TATAACTATG CATCACATAT AAATCCTGAA 780 

GCTAGAACGG GTTGGCTGAT TGTTGACTAT 84 0 

ATCGTAAGTG AAATTATAGA TAGTAATAAA 900 

ATGAAAATAG TTAGATTCTA ATTATTTTCA 960 

AATTCAACAA TATTATAAAT TGAACAGATT 1020 

AAAAAGTGTT ATAAATTGAT AAATATATGT 108 0 

AGAGAGTTGT AAGATGATGa AACGATTAAA 114 0 

TTTAGTCATT AGTATCACTG CTGGTTGTGG 1200 

CTTTGAAAAA ACATTGAGTA TGTACCCTAT 1260 

AGGCTATCGT GATGATCAGT TTGATAAAAA 13 2 0 

AATGGTTATT CAACCTAATA ATGAAGATAT 138 0 

TAGAAATACC AAAACAACAA ATGGTTACTA 144 0 

AGGAAAACCG CACGACAATG AAAAAAGATA 150 0 
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(2) INFORMATION FOR SEQ ID NO: 362: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1216 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 362: 





TACATGTTTC 


GGATGCTACT TTATTTAGTT TGAAGGGTGC ATTATGGACG 


TTAGCGCAAG 


60 


/5 


AAGTTTATCA 


AGAATGGTAT 


TTAGGA 1 L-ijA 




AfJATfiTTGAA 


AAGAAAATAG 


120 




CACGAACTAC 


TTTTAAGACA 


GGTTATATTT 


ATCAAGAAAT 


T ^ TT'T^Tfl A. n 2X 
1 n 1 ± ± 1. \JT\<xn. 


CCAGTAGATG 


180 




AAGTTAAGGT 


ACTTCTGAAT 


GATTTAAAAG 


GTGCTGGTTT 




ATTGCAACAG 


240 


20 


GTCGTCCTTA 


TACTGAGACT 


GTTGTGCCAT 


TTGAAAATTT 




CCATATTTTG 


300 




AAGCTGATTT 


TATTGCAACA 


GCAAGTGATG 


TTTTAGAAGC 


AGAGAATATG 


TATCCGCAAG 


36 0 




CACGACCATT 


AGG AAAGC L 


AATCCTTTTA 


GTTATATCGC 


AGCTTTATAT 


GGTAATAATC 


420 


25 


GCGATAAATA 


TG AAT CTTAT 


ATCAATAAGC 


AAGATAACAT 


TGTAAATAAA 


GATGACGTAT 


4 80 




TTATAGTAGG 




GCTGACTTAT 


TAAGTGCTCA 


AAAAATAGGT 


GCAACGTTTA 


54 0 


30 


TTGGAACATT 




AAAGGTAAGG 


ATGCTGCAGG 


TGAGTTAGAA 


GCGCATCATG 


b U U 


CCGACTATGT 


T ATTAAT CAT 


TTAGGTGAAC 


TTAGAGGTGT 


ACTAGATAAT 


TTGTAA 1 1 1 


c c n 
D b u 




ATTGTTGTTT 


GACAGCATAA 


CTTGTAGTGA ATGATTGAAC 


CAAAGGTTTC 


/\ X A 1 X Ot/\w X 1. 


720 


35 


ACAATGAAAT 


TAATAATGAA 


AAAATGCCAA 


GAAGCAATGG 


AAGTAATCCA 


r\ X VJ ^ V« X X A X 


780 




GGCATTTTGA 


ATTTACATAA 


ATTGTTTATG 


ACTGTACCGT 


CAATTCAGTT 


mYSAAAATTT 


840 




GATTGTATTC 


ACCAACTTGT 


TTAAGTTCAT 


CAATTATATT 


GTTTGAAACA 


GGTTGATCAA 


900 


40 


CGGATAAAAT 


CATTAGCGCA 


TCTCCGCCCG 


CTTCAGTTCT 


ACCTAAAGTC 


ATAGATGCAA 


960 




TGTTGATATT 


GTATTTACCr 


AACAATGCGC 


CAGTTTTTCC 


TACCATACCT 


GGAGTATCAT 


1020 




TATGATATGA 


CACAATTTGA 


TATTGATTTG 


GCTTTAAGTC 


TACAGAAAAA 


TTATTAATTC 


1080 


45 


TAACAATTCT 


TGGACCGAAA 


CCTGTAAAGA 


CAGAAGCGCC 


AACTTTAACG 


GAATCGCnAT 


1140 




CGnTTGATAG 


TTCTACCTCT 


AAGTAGTTAC 


TAAAACCTGT 


CTCTGCTTTA 


TTATTTTCAA 


1200 




TATTTAATGT 


CACTTG 










1216 


SO 


(2) INFORMATION FOR S 


EQ ID NO: 363 : 
















c 
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CO STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 363: 

KTCAPiPJ^HKV TTTPJKATPiAT ATTGATGGTC ACATTGTAAA TrmAATAGAA AATAAATTTG 6 0 

ATCAAATATT ACAAGAACCA TTAAATCCAT TAAATTATGA TACTGTCAGT GGATTAGCTG 120 

GGATAGGGAG ATATTTGCTA AATAGAGTAG ATGAGAATGA ATTTAATGTT AAAGCATTAA 18 0 

AAAGCATATT AGTATACTTT AAAGATATTC AATATTCTAA AAATAGCTGG GTAGTCCCAC 24 0 

AAGArAGTCA ATTTTTAGAG TCTGATAAAA ATTATTTTAC TGAAGGTAAT ATCAATCTTG 3 00 

GCCTTGCACA TGGaGTGCTA GGACCGATGT CTTTATTTGC ATTATGCGTG ATTAAAGGAA 360 

TTACGATTGA AAATCATCAG CACATATTAA AAGACATGTA CAAATTTATC ATGGACGAAA 420 

AATTTTGTAA CCACGAAAGA TGGTTGCAGC GTTACGATTT AATTTCTGAA CGTAATCATT 4 80 

TCAATTTTAT TCGGAATGGT TGCTCTTATG GCAATACGGG TGTAATGACG ACGTTGTTTT 54 0 

TAATCGGCCA AGCATTACAA GATGATGAAA TAATTAAAAT GTCTAAAAAA GTGATGCTAC 600 

AAGTAGTAAA TGATAAAGAT GAAAATTTAA TAAGTCCAAC TATTTGTCAT GGATTGTCAT 66 0 

CACAAATATT AATGTTAACA ATTATGAATT TGAATTTTGA ATTAAATGAA GTGTCTGATT 72 0 

ATATCACTGT ATTAATAAAT AAACTGATTT CTCATTATAA GGAAGATTAT CTGGTGAATT 78 0 

TTATAGACAT TAATGAAAAT AAGCAAGATG TATTTAAAAG TAGGAAAGTT GGCCTTTTAG 84 0 

AAGGTGAATT AGGGGTCATC TAACATT 8 67 

(2) INFORMATION FOR SEQ ID NO: 364: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10813 base pairs 
(B> TYPE; nucleic acid 
40 iC) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



25 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 364: 

TACCTTTTCT TtTAAATCAT TTTATATTTT CCCaCTAATA TCCGcTGtTA ATCaATCCtG 6 0 

ACATCCtTGT ATCaCTATGA CAATTAATTG TTAAATACAT GAATTTCTAC ATTTTATGAA 12 0 

AAAATCCATT TTTATTACAA TTCAACACTT TATATGACAA CTTCATTACA GTTACTTTTA 180 

TTGTTGATTG CTTACATTGT TTTCTAAAAA AAATTTGTTA TCATAATTAA CGTTGAATAA 24 0 

AGAAAAAAAT TAACTTGGGA GATAAAAATG GAATATAAAA AGATACTAAT TCGTTTATTA 3 00 
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CAATCGCATG CCGCAGTTAA TTATTATAGT AAAAACCAAT GTACATGGTG GGCATTTAAA 4 20 

CGTCGCGCAC AAGTCGGTAA ACCTGTTTCT AATAGATGGG GCAATGCTAA AAATTGGTAT 4 80 

TACAATGCAC GTPAArCPiPJi ATATGCGACT GGTCGTACAC CAAGAAAATT TGCTGTCATG 540 

CAATCAACTG CAGGATATTA TGGACATGTC GCAGTTGTTG AACAAGTATA TAAAAACGGT 600 

AGTATTAAAG TTTCAGAATA CAACTTTTAT CGCCCATTAA AATACAATAC ACGTGTACTA 660 

AGCAAAAAGG CAGCACGTAA CTTTAACTAT ATTTACTAAT CAAAAAACTT CTATCACGAA 720 

CGCTTCAATT TCCTGTATGC GTGTGATAGA AGTrTTTATT TTATGAAATT ATATTATTAC 780 

TTCTACAAAT TTCAAATTGC CGTAATTGAA CGTATATTTC TTCTTCAACT ATTATTTCAT 84 0 

CTTTAGCATA ATCTATATAT AAAATTTTAT GCTATTATTT AAATAATTCG CTATAACTTA 900 

ACATACGTTT TCGATATAAA CCTTGTTCTA AATCTCAATA ATTTTTTGCT GTTTTCATCG 960 

TCATTAGTTA AAAAAATAAT TTAACTGAGT TTTTGAGACC TGTTTAAATT GATGACTCAA 102 0 

AACCTTTATC CCCTTTTCAC TCGGTTTAAT TGCTTTAATA TTTAACACAG TCTCATT-QTA lOSO 

ATTTTGCTAC TAAGTTTGAA ATATTTCGAT TCX3AATGTGA AcATCATTTA TTATTACTTT 114 0 

CGACAGCTAG AAAAATATTG TTAAACCAAA ACTATAATTA CCACTTTTAT ATAGAACTAT 120 0 

ATATAAATTT TAACTTTGAG ACAGAACTGC TAGTCAGGTT TATGAATATA TTTCTTTAGT 126 0 

TTACTTGaTA TACTTATTGG TAAATCATTA TTTATTTGAA GAGCATTTAT AACAAAAAGT 132 0 

ATAATCCCAG TTATAGCGAT TCCATGACCA TTTTTATCAG TCAAAGATTC ATCAACAAAA 13 80 

ACTTTTTGAT AAATTTATAT ATTTGTATAA TTTATTATGG TAGATATCTA CACCCGTATC 144 0 

TATAACAGCT ACAGTTATCA TTGCGTCTTA TCCAAATAGT TTTTAAGAAA TAAATATAAT 1500 

TCATTAATAT CATGGCTATT TGTAACGTCA GtGCTTaAGA CCTTGTCAAT TATTGTTTTA 1560 

TCTGTTTCTA AGAGCCCCTG ATTTTCTGCA TACATATTTT GTATATCGCT GTTAATCTCA 1620 

TTTAATTTTT TAAATTTTTT TCTAACCTTT TTCAAAATAA AATGCTCTTT AATTCCATCT 168 0 

CTTAAATACA TATAAATCTC TATTATAATC TCCATAAATG TCATATCTCC TCTCTCAAAA 174 0 

AACATTATAC TCAATATATT AAAAATAAGA AAGGTTTTTT CAGAATTTTT AGTATTTTCA 18 00 

-5 GATATTTATT CTTAAGTATC TTATTATTAC TTCTTTTTAA CGCTAGTGGa ATAATTAATA 186 0 

ATAATTGTAT TTAAATGCAA TGCTGTTGTT CTTTCATACT TACAAGCAAG TCATAAGAAA 192 0 

TGAGAAATTA AATTCTTATA TGATGAATAA ATGATGACTG CAGAAGAAAT TGCTAAAAAA 19 8 0 

^0 

GGAGTGGAGT GAGGCATTCT ACTGTTCATA TTGTTAACAA ATATCAAAAT GATAGGAACT 204 0 

r.---- AA^.A^TOTTA AAATCGAGAC AAGAAGAAAC TCGTTCCAAA TTTGAATCCT 2100 
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AAAACTTCTA TCACGAACGC TTCAATTTCC TGTATGCGTG TGATAGAAGT TTTTATTTTT 222 0 

GTTCATATTA ATTTATCTAA GCGCTACGAT GGaCTGACTT AAACTTTTCT TTAAAATCGC 22 80 

5 

TATTCGCCAT TTACTATTGT TGTCTAATTT CTTGTAAAAT ATGTTCCGCT GCTTGTGTAT 2 340 

TTGCACGGGG TTCTTTTTTC AAAGCTTCAG CTACTTTAGC AATTTCATCA CCTTTTGCCC 2400 

CTACAACGAT AGCTAATGAT TTATATTGTA AACTCATATG ACCTTGTTGA ATACCTTCTG 24 6 0 

10 

ACACAAGCGC GCGACATGCT GCAAAGTTTT GCGCTAAACC AACGGCAGCA ACTACATGAC 252 0 

CTAATTCTTG TGCTGACTCT ACATTTAGTA GCTCTAATGA AGCTTTAGCA ATTGGTAATA 258 0 

75 CTTTTGTACC ACCGCCAACG ATTGCCAATG TCATAGGCAC TTCAATTGTA CCAATCAATC 2640 

GTTGACGATC TTGATCGTAA CGCCATGTAG CAATACCACG ATACTGTCCG TCACGACTCG 2 700 

CGTATGCATG CGCACTTGCT TCTGCACCAC GCGTATCATT TCCTGTTGCT AAAACAACAG 27 60 

CATGTATGCC ATTCATAACA CCTTTATTAT GTGTTGCTGC ACGATGAATA TCTACTTGGG 2 82 0 

CCAATACAGA AGCACGTTCC ATTCGTTTGG CAACCTCTTC TCCAGTTCTC TCGCCCCTTG 28 8 0 

CTAAATCTTT AACATCAATT TCGCCTTGAA CTTTAACAAC GGACGCTGTT GCATGATTGG 2 94 0 

25 

ATAAAATACT CATTAAAATG TCGCTTTGCG GAAATTCATT TTTTAAAAAT GCAGTTATGG 300 0 

CCTCTAAAAT CGTATTAAGC ATATTAGCGC CCATAGCATC TTTCGTATCA ACAAATACTT 306 0 

TTAAAGATAG TAACTGTTGC TCAGGAAATG TATCAATCGC TATACGTTGG TAACCACCAC 312 0 

30 

CACGCGCTTT AATAGAAGGA TATGCCTCAT CCGCAATTTT ATGAATTTGC TTTTCTAAAG 3180 

CTTTAATGTC TGCTGATAAT TTTTCAGTAT CGTCAACGCC ATCAAAGACG ATTTGACCTA 3 24 0 

TCATAATACG TTCAGAAGAT ACCGTTTTAA ATCCGCCAGT CTGATTCACT AGCTTTGCAC 3 3 00 

35 

CATAACTAGC TGCAGCGACA ACTGAAGGCT CTTCCACCAT CATAGGTACA ACATATGCCT 3360 

TATCGTCCAC AATGATATTC GGTAATAATC CAACGGGTAA TGCACCTTGC GCGATGACAT 3420 

40 TTTCAATTAA ACTATTGGCT ACTTCTTCAT CGATTAATGG ATGATTCAGT AAAATGTCGA 34 80 

ATTGTTCTTC TGATAACCAT TGCTTATCAA CCAATTGTTG TAACTTTTCT TTACGAGATA 3 54 0 

AATGTCGAAA ATTCTTATCT AAATTTTGCA TGGACGTACT CCTTTTACTT CACATAATTT 3 600 

TTAACATTTC AATCACTACT ATTTTTACCA CAAAATAACG TCATTCGTCT TAAAATTCAA 3 660 

TTGAATAATT GTCGTTTTGA CTTTAAAATA AAACAAGGTA AATTAAAACG CTTACAAGAA 372 0 

ACGACAAATc ATTTTTAAAT TTAGTATATT TCTTTGTATA AAATTAGCAT ATTCTGATAT 37 8 0 

50 

GATACAAGTG TTGCTTTTAT AAATTTGAAA GGATGTAAAA CCTTATGACA ATAGGTATCG 3 84 0 

ATAAAATAAA CTTTTACGTT CCAAAGTACT ATGTAGACAT GGCTAAATTA GCAGAAGCAC 3 9 00 
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CTGTAAACCA AGACATCGTT TCAATGGGCG CTAACGCTGC TAAGGACATT ATAACAGACG 4 020 
PJKGACAPJKAA GAAAATTGGT ATGGTAATTG TGGCAACTGA ATCAGCAGTT GATGCTGCTA 4 080 
AAGCAGCCGC TGTTCAAATT CACAACTTAT TAGGTATTCA ACCTTTTGCA CGCTGCTTTG 414 0 
AAATGAAAGA AGCTTGTTAT GCTGCAACAC CAGCAATTCA ATTAGCTAAA GATTATTTAG 42 00 
CAACTAGACC GAATGAAAAA GTATTAGTTA TTGCTACAGA TACAGCACGT TATGGATlXiA 426 0 
ATTCAGGCGG CGAGCCAACA CAAGGTGcTG GCGCAGTTGC GATGG1TATT GCACATAATC 4 32 0 
CAAGCATTTT GGCATTAAAT GAAGATGCTG TTGCTTACAC TGAAGACGTT TATGATTTCT 4 380 
GGCGTCCAAC TGGACATAAA TATCCATTAG TTGATGGTGC ATTATCTAAA GATGCTTATA 44 4 0 
TCCGCTCATT CCAACAAAGC TGGAATGAAT ACGCAAAACX5 TCAAGGTAAG TCGCTAGCTG 4 500 
ACTTCGCATC TCTATGCTTC CATGTTCCAT TTACAAAAAT GGGTAAAAAG GCATTAGAGT 4 56 0 
CAATCATTGA TAACGCTGAT GAAACAACTC AAGAGCXTTTT ACGTTCAGGA TATGAAGATG 462 0 
CTGTAGATTA TAACCGTTAT GTCGGTAATA TTTATACTGG ATCATTATAT TTAAGCCTAA 4 6 60 
TATCATTACT TGAAAATCGA GATTTACAAG CTGGTGAAAC AATCGGTTTA TTCAGTTATG 4 74 0 
GCTCAGGTTC AGTTGGTGAA TTTTATAGTG CGACATTAGT TGAAGGCTAC AAAGATCATT 4 800 
TAGATCAAGC TGGACATAAA GCATTATTAA ATAACCGTAC TGAAGTATCT GTTGATGCAT 4 860 
ATGAAACATT CTTCAAACGT TTTGATGACG TTGAATTTGA CGAAGAACAA GATGCTGTTC 4 920 
ATGAAGATCG TCATATTTTC TACTTATCAA ATATTGAAAA TAACGTTCGT GAATATCACA 4 980 
GACCAGAGTA GTCGGTGTAT TTAAAACACA TATAATAAAA CCTAAAAGCA GCAGTAAGAC 5040 
CACTTCTAAT TGAAATCGTC TTACTGCTGT TCTCTATTTA TAACACTTCG TATTGAATGA 5100 
ATTCATTATG CCTATTTGAC ACATTATTGA AGTTTTCCTA ATGCCTGGAT CCTTTATACG 516 0 
TTACGGCTTC GTGCTATGTT TTGGTACATA AAGCTTTGAC ATATCGATAT TCTCCAACTC 522 0 
TAACAGCTTA ATTTTATTAT TAATCGTTCC ACCGAACCCT GTTAAGCTAC CCGTTTTACC 5280 
GACAACACGA TGACATGGCA CGATAATAGA TAATGGATTA CTTCCGACTG CACCTCCAAC 534 0 
CGCTTGGGCT GACATTTTTG GCTTGTTAAG CAGCTTGCCT ACTTTnTGG CAATAGCACC 54 00 
ATACGTTGTT AGAGTCCCAT AAGGAACCTG TCTTAATTCA TTCCAAACAC ACTGTTGAAA 54 6 0 
ATGACTACCT GTTGGCTTTA AAGGTATTGT GATTTCAGGA TTGTCACCTT TAAAATACGC 5 52 0 
GTCTAACCAC TGTGTCGCCT CTCTAAATAT CGCTAAAGAC GTATTTTCTT CCCTAGTACC 55 8 0 
ATCACCTTGT TGATTTTCAA ACAAAACAGC GGTCAGACTT ACCCCATCAC TCAAAAGTTC 564 0 
- -^-Aa- -ATA^.TAACT CTTATACTCC ATAAAAATTC CCCCTTTTTC 5700 
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ATAAGTCGTC AATTACGTAT ATAAACACGT AATACCAGCT ATCACTTTGC TGCAATATAC 582 0 

AGTTACATAT CTTACTACAC GTGCTAACCT CTTACTTTGT AAACCAAATC TTAAATTAAA 5880 

5 

ATATTGAAAA TGCAATGAAT CCTTAATATT TTATTAAACC TATAATTACT TATTAAAAAT 594 0 

AACACACAAT ATTCATAAAG TTTTAAAAAT ATTCTGTTTT ATCACCTACT ATTAGTGGAA 6 000 

AAGTACAATT GCAATTGTAT ATAGTTTGCA TAACGCTTCA AAAGTAATTT CTTTTTTGTT 606 0 

10 

TAGTTCAAAA AAATTTAGAG GTGATGTTAT ATGAATAACG GTTTTTTCAA TAGCGACTTT 612 0 

GATTCAATTT TTCGAAGAAT GATGAAAGAT ATGCAAGGTT CAAATCAAGT CGGAAACAAA 618 0 

15 AAGTACTATA TTAATGGTAA AGAAGTTTCA CCTGAAGAAC TAGCGCAACT CACACAACAA 6240 

GGTGGCAATC ACTCTGCTGA ACAAAGTGCG CAAgcTTTTC AACAAGCAGC ACAAAGACAA 63 00 

CAAGGGCAAC AAGGTGGCAA CGGCAATTAT TTAGAACAAA TTGGTCGTAA CCTTACGCAA 6360 

GAAGCACGTG ACGGTTTATT AGATCCAGTC ATTGGTCGTG ATAAAGAAAT TCAAGAAACT 6420 

GCTGAAGTTT TAAGTAGACG AACTAAAAAC AATCCTATAT TAGTTGGAGA AGCTGGTGTT 64 8 0 

GGTAAAACTG CGATTGTTGA AGGTTTAGCA CAGGCAATCG TTGAAGGAAA TGTACCAGCA 654 0 

GCAATCAAAG ACAAAGAAAT TATTTCTGTA GACATTTCAT CATTAGAAGC TGGAACGCAA 66 0 0 

TATCGTGGTG CTTTTGAAGA AAATATTCAA AAATTAATCG AAGGTGTTAA ATCTTCACAA 66 60 

AATGCCGTAC TATTCTTTGA TGAAATCCAT CAAATTATCG GTTCAGGTGC CACAGGAAGT 6720 

30 

GATTCAGGTA GCAAAGGGTT ATCTGATATT TTGAAACCTG CATTAAGTCG TGGTGAGATT 6 7 80 

TCTATTATTG GTGCAACAAC ACAAGATGAA TATCGAAACA ATATTCTTAA AGATGCTGCA 6 84 0 

TTAACGCGCA GATTTAATGA AGTGCTTGTT AATGAACCAA GCGCTAAAGA TACTGTTGAA 6 900 

35 

ATTTTAAAAG GTATTCGCGA AAAATTCGAA GAACACCATC AAGTAAAATT ACCAGATGAC 6 96 0 

GTATTAAAAG CATGTGTTGA CTTATCAATT CAATATATTC CACAACGATT ATTACCAGAT 702 0 

40 AAAGCAATCG ATGTGTTAGA TATTACAGCA GCACATTTAT CTGCGCAAAG TCCAGCTGTC 7080 

GATAAAGTTG AAACTGAAAA ACGAATTTCT GAATTAGAAA ATGATAAACG TAAAGCAGTA 714 0 

AGTGCTGAAG AATPiTPJU^ AGCTGACGAC ATTCAAAATG AAATCAAATC ATTACAAGAT 7200 

AAATTAGAAA ATAGTAATGG TGAACATACT GCTGTTGCTA CAGTTCATGA TATTTCAGAT 7 260 

ACTATTCAAC GATTAACTGG TATTCCAGTT TCTCAAATGG ATGATAACGA TATTGAACGT 73 2 0 

TTAAAAAATA TTTCTAATCG TTTAAGAAGT AAAATCATAG GTCAAGATCA AGCTGTAGAA 73 80 

50 

ATGGTTTCAC GTGCAATTCG CCGTAATCGT GCTGGGTTTG ATGACGGCAA CCGTCCAATT 744 0 

GGCAGTTTCC TATTTGTTGG CCCTACTGGT GTTGGTAAAA CAGAGCTTGC TAAACAATTA 75 00 
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GACACAACAG CTGTTTCAAA AATGATTGGT ACAACTGCTG GTTATGTTGG TTATGATGAC 7620 

AATTCAAATA CGTTAACTGA AAAAGTACGC CGTAATCCAT ACTCAGTCAT TCTATTTGAT 76 80 

5 

GAAATCGAAA AAGCAAATCC ACAAATTTTA ACATTGTTAT TACAAGTAAT GGATGATGGT 7 74 0 

AATTTGACTG ATGGTCAAGG TAATGTCATC AACTTTAAAA ATACAATTAT TATTTGTACA 7 BOO 

TCAAATGCTG GCTTTGGCAA TGGCAATGAC GCTGAAGAAA AAGATATTAT GCACGAAATG 7 86 0 

10 

AAAAAATTCT TCCGCCCTGA ATTCCTTAAC CX3CTTCAACG GCATCGTTGA ATTCTTACAT 7 92 0 

TTAGATAAAG ATGCATTGCA AGATATCGTC AACTTATTAT TAGACGATGT ACAAGTTACA 7980 

TTAGACAAAA AAGGTATTAC GATGGACGTT TCTCAAGATG CGAAAGATTG GTTAATTGAA 804 0 

GAAGGCTATG ATGAAGAATT AGGTGCACGT CCATTAAGAC GTATTGTTGA ACAGCAAGTA 8100 

CGTGACAAAA TTACAGATTA CTATTTAGAT CATACAGACG TTAAACATGT GGATATAGAT 8160 

20 GTTGAGGATA ACGAATTAGT CGTAAAAGGT AAATAACGAC ACTTTAACAT ATCGCGCATC 8220 

AAAAATGAGC ATCAGGTCGC CCTTGCCTGT GCTCATTTTT TTAATTATTT CCCTGGAAAA S2 8 0 

TGATTCGCTG TGTGCTGTTC TGTTCCACAA CAATCACGAT TAATGTCACA TGTACCACAT 834 0 

25 TTTCCTTGTT TTGAACGCTT GAAAAATTTT ACTAGTGTAT ATAAGGCATA TCCGAAAATT 84 0 0 

GCTAAAAAAA TTAAAATGTT AATAATGACT GACACTTTAA CCACTCCTTA AACAAATAAA 84 6 0 

TGTCCGACTT GATAAAAAAT GAATGTTAAG ACATATGCAG TGACTAGAGG ATAGGCAACT 8 52 0 

30 

GCAAGTGCCG TCCATTTCCA TGAATAAGTC TCTTTACGGA TTGCTGCTAC TGTAGAAACA 858 0 

CAAGGAATAT ACAATAGTAT AAATATCATA AATGCATACG CAGATAGCGG TGTGAATTGA 864 0 

TTTTGAATCA CATTAACAAG GCCTGCATCA CCTGATGAAT AGATAATCGC CATCGAACTT 8700 

35 

ACGATAACTT CTTTTGCTAA AAATCCTGGC AcTAACGTAG CACCTGCTTG CCATGTTCCA 8760 

AATCCGAGCG GTTGCACTAA CATACCAAAG AAACTACCAA CCATATGTAA AAAACTTTGA 882 0 

TTGATATTCA CATTGATACC ATGTGGTCCT ACATAACTTA ATAGCCAAAT GACTACTGAG 888 0 

40 

CCGCCAAAAA TAAATGTACC TGCTTTACGA ACAAAGCCCT TAGCCTTTTC CCAAGTACTA 894 0 

CGCCACAACG TTTTAATGGA AGGCACACGG TATGTTGGCA ATTCCACAAT AAAGATTGCA 90 0 0 

45 TTATCATTTT TTAAAATCGT CTTAGTAAGT ACTGTACTGA CTAAAAATGC CATAATAATA 90 6 0 

CCTAAAACAT ACAGGCTTAA TACTACTAAA GATTGATTCT CTTTGAAAAA GATACCTACG 912 0 

AACAACGCAT ACACTGGCAG TCTAGCAGAG CATGACATGA ATGGTGCAAT TAATATCGTT 918 0 

GTTAAACGCT CTTTTTCATT TTCAATACTG CGCGCAGCCA TAATACTCGG TACATTACAA 924 0 

^--^^-lA TAATCATTGG TATAAAAGAC TTCCCGCTTA AACCGAACGA TTCCATAATA 9300 
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AAAAAGAGCA CAACAATTTG TGGTACAAAG ACTAATACTG ATCCTACACC AGCAATAATG 94 20 

CCATCTGTAA TTAAATCTTG TAAAAATGGT ATAACACCAA GATAATTCAT AATCGTCTTC 94 8 0 

^ ACACTATCTG TAAATGTACC ACCTATAAAT GCATCGAGTT GATCCGACAA AGGTGTGCCA 954 0 

ATCCATGTAA ATGTAGTTTG AAAGATCAAC CACATAATTG CTAGAAAGAT AGGCATCCCT 96 00 

ATATATTTAT GTGTTAATAT CTTGTCTATT CTAGAGCTGA AATATTGCTT ATCTTCATCT 96 6 0 

10 

GGATACGTTA CCACGTCTTG CAATAACGTC TCAATATAAT GATTGCGTAT ACGCTCCATC 972 0 

TCTCGACGAA CAGATACAGC CCCTACTTGT TCAGCAACTT GATCACGTAA ACTCGACAAT 978 0 

TTATTTACAA CCTCTGAATT AAGTTCGTTT GCAATTTGCA TGTTATTTAA TAAGAATTGA 984 0 

IS 

ATCGCAATAA ACCTAGCTTG ATACTTATCA TGAGATGTCT CTGTCATTAT TATTTGACAC 9900 

ATATTTTTAA TTGTCTCTTC AATCTTCTCA CCATAATTGA TTTTAAAATG CGGTTGATAC 9 96 0 

CCTTCCCCTA GATGCTTTAT TTCGCCAAGT AAATATTTTG TTCCTTTGCC TGTACGTGCC 10 020 

ACAACTGGAA AAATAGGTGT TTTTAACTTT TTCATCAATT TATGATAATC GATTTTTATC 10080 

CCGCGCTTTG TAGCTACATC AATCATATTT AATCCGATGT ATATTGGTTG ATTAAGTTCT 1014 0 

25 AACAATTGTA CTGriAATTG CATATTTCTT TTTAGTTGAC TCGCATCAAC AATGTTAATG 1020 0 

ATTCCTGAAA ATGAATCGTT TAATAAATAG TCTGTCACTA CAGTTTCATC TTTAG/^TC 1026 0 

GGCGATAAAT CATATGTACC TGGTAAATCA ATTAATTGTC CTACATTTTC TTTAAGTTTC 10320 

qn 

CCTACTTTTT TCTCTACCGT TACGCCACTC CAGTTGCCTA TATATTCATA CGAACCAGTT 10380 

AAAGCGTTAA ACAAAGATGT TTTACCAACA TTAGGATTTC CTAAAATACA ATAATTTTCC 10440 

ATTCGTCCGG CTCCTATTCT TCTAATGCAA TAGAACAAGC ATCGCAATGT CTAATACTTA 10500 

35 

ACTGTTGTCC GTTTACTTCA ATAATACATG GCCCTTTAAA TAAACATTTT TGTTTAATCG 1056 0 

TTATGATAGC GTCATCTGTT AACCCAAAGG CACTTAGACG ATACAACATA TTCTCATTAG 1062 0 

CAATATCCAT TCGCTTTATT TTATAAGCCT TATTCATTTC ACCATTTTTA ATGTTTAACA 10680 

40 

TACTATTTCG CTCTCCTATT AGAAATAATA ATCATTATCA CTTAAAAATC ATAACCCTTA 10 740 

AAATTGTAGC TCGCAATACT TTATTTAAAT AATTTTCATT TTTCATGTAA AATTTGTGAC 10 800 

ATTGCAAAAA TGT 10 813 

(2) INFORMATION FOR SEQ ID NO: 365: 

(i) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 6 8 04 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 365: 

TGATGGATTA GCAGACATTT TACGAGCGAA TGGTTTCAAA GTGTTTGGTC CAAATAAGCA 6o 

AGCAGCTCAA ATCGAAGGCT CAAAATTATT TGCTAAAAAG ATAATGGAAA AATATAATAT 12 0 

TCCAACTGCT GATTATAAAG AAGTTGAGCG AAAAAAGGAT GCTTTAACAT ATATTGAAAA 180 

CTGTGAATTG CCCGTTGTTG TCAAGAAAGA TGGGTTAGCT GCTGGGAAAG GCGTTATTAT 24 0 

TGCAGATACT ATTGAAGCAG CCAGAAGTGC TATTGAGATT ATGTATGGTG ATGAAGAAGA 300 

AGGTACTGTT GTATTTGAAA CGTTTTTAGA AGGTGAAGAG TTCTCGCTAA TGACATTTGT 360 

TAATGGTGAT TTAGCAGTAC CTTTCGACTG TATTGCACAA GATCATAAAC GCGCATTTGA 42 0 

TCATGATGAA GGACCAAATA CTGGTGGTAT GGGGGCTTAT TGTCCmgTAC CACATATTAG 480 

TGACGATGTT TTAAmACTTA CAAATGAAAC AATTGCACAw CCCATTGCAA AGGCAATGCT 54 0 

TAATGAAGGT TATCAATTCT TCGGTGTATT ATACATTGGT GCTATTTTAA CTAAAGATGG 600 

TCCAAAAGTA ATAGAATTTA ATGCCCGTTT TGGTGATCCT GAAGCTCAAU TATTATTAAG 660 

TCGCATGGAA AGTGATTTAA TGCAGCATAT TATTGATTTA GATGAAGGAA AACGTACTGA 72 0 

25 ATTCAAATGG AAAAATGAAT CTATTGTAGG GGTCATGTTG GCATCAAAAG GATATCCTGA 780 

TGCATATGAA AAAGGGCATA AAGTAAGTGG CTTTGATTTA AATGAAAACT ATTTTGTTAG 840 

TGGATTAAAG AAGCAAGGTG ATACCTTTGT TACTTcAGGT GGTAGAGTTA TACTTGCCAT 900 

CGGAAAAGGT GACAATGTAC AAGATGCACA GCGAGACGCA TACAAAAAAG TATCACAAAT 960 

ACAAAGTGAC CATTTATTCT ATCGTCATGA CATTGCGAAT AAAGCACTAC AACTTAAATA 1020 

AGTAAATTTA AAATACTAAG aTTAGCTATG AACGAATCTA TAACGATAGA TTTTTTCATA 1080 

GCTTTTTTAG TTGTAGAGTC TAGGACATTG ATTTCTGTAC CAAATTTGTG ATTATGCATA 114 0 

TGTAATACAA AAGAGGCGCC ACAACATGTT TGGATGAACA AAATAACATG TTTGTGGCAC 1200 

CTCTTTTGTT TAGTATGGAA TAAATGGTTT TCTTTTTCTA TACAATGAAT TTCTAATTTA 1260 

GTATCTATAC AATTATGGAT AAAATTTAAC CTACACGACC AAGACGAACA TCATCTATGC 1320 

CCGTGATGGG TAAGGTGATT GAACAATAAT ATGCCATAGT AATAATGGCA ATTAAAACTA 13 8 0 

TAATAAAGAT TATATCTTTA TATGAGAAAG GTACGTTGTA ATAGTAAGTA CGAGGACCAT 14 4 0 

CTCTAAATCC TTTCGACTCC ATCGCAACTG ATAATTGATG TGCCTTTCTA ATATTTTGGC 15 00 

TTAATAGAGG TATAATTAAA TGCTTAAATC GCTTTAACCC TCTATAATTT GCCGCGTCTA IS 60 

so TCATCTGATA GCGCATTTTT AAAGATCTGC GAAgcTGTAt TAAAGAACTA ATCATTAAAG 162 0 

CI^ATCATACG AATGGCAGCC ATGAATGCAT AAGCAACTTT TGATTTAACC TTTAAATGTT 16 80 
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ATGAAATAGC AATGGTTCTT AATGATACAT GTAAACCACG AACTAAACTT TCTGTTGTAA 18 0 0 

TATGGATAAA TCCGAATTTC AAAATTGTAT GGCTACCATT CCCGTATAAA ATCATGAACA 1860 

GGGAAGAGAG TAATGCAAAG CCAATACTTA TAGTTATAAA AATTGCTGTA ATTTTAAACT 192 0 

GAGTACCATT AAACATCAAT AAGAAAACTA ACATTAAGAT AGTGATATAA AGCATAAAAT 1980 

CGAAATTATG CACAAATATA ATAAAGAAAA ATAGTATAAT TCCAAGAAAT AGTTTCGTTA 2 04 0 

TAATGTTGAC ATCATCAACA AATGATTGCC GAACTTTCCA TTGCTCATAC ATTCGTATCA 2100 

CCATCACAAT CTAGTAACGC ACCATCTGAA ATTTTAAGTC TTCTTGATGG ATAACGTTCA 2160 

ATTATTTCAT CGTCATGTGT AaCCATGACA ATACTTTGTC CCAAATTAAT TCGCTTTTGG 222 0 

AAAAGTTTGA TCAACTGGAA TGTATTATGG CTATCAAGTC CAAATGTCGG TTCATCTAAA 228 0 

AAGATAATAT CAGCTTTAGA ACTTAGTGCG GTAgcTACGC TAAGGCGTCG TTTTTGACCA 234 0 

ATAGACAACT CATAAGGATG TTGATCTTTT ACATTTTGTA AATCTAAAAG TTTTAAAAGT 24 0 0 

TGTATCGTTT CATCATCACT TTGATCTTTA GAAAGGTGAT TAAAATGAAT GTTAATTTCA 2460 

TCATAAACCG AATTTGTTAT AAATTGTAAT TCTGGGTTTT GATAAACTAG GTACATGTGT 2 520 

25 TTTGCTGCAT GTTTAATTTT TGTTAAACGC TGATTTTCAA AATAAACATC ACCTTGATAT 2580 

TTAATCAATT GCATAATTGA TTCAAGCAAG GTTGTTTTAC CACTACCATT TGCCCCTGTA 2 64 0 

ATTGTAATCC ACTCACCTAG ACCAATTTCT AAATCTGAGA ATGAGAGCAA TGTTGATTTA 2 700 

CCGCGAATAA TACGTCCATT TTTAAATTGT AATAAGTGTG AGTTTGTTGT TGGAAAGTCA 2760 

ACACGACTTG GTGCGAATTC CCATGCACGT GGATGCCACA CACCATATTC ACTGAGTAAA 2 820 

TGAACATACT TCTGTAATAT GATTTCAGGA CATTCATCGG CAATGATATT TCCGTTATAA 2 8 80 

TCCATCAAAA TGACGCGGTC GACATGATTC CAGATGTGTT TAACTTTATG TTCAACGATT 2 940 

ACAACCGTTT GATCTTCCCA AAGTTCAATT AGTTTAGTCC ATAAATCTTC TGTTGCTTGA 3 000 

ACATCTAACA TTGCTGTCGG TTCATCTAAA AACAATGTTT TTGATTGTTG AAGAATGGTT 3 060 

TCAACAATTG CCAATTTCTG TTTCATCCCG CCACTTAAAT CTTTGATATA CGTTTCAGGG 3120 

GTAACATTTA AATTGACCAT ATTTAAAGCA TTGATAATTA ACGCATCCAT GTCTTCACGT 3180 

GGTAATTGTC TATTTTCTAA AACGAATGCA AGTTCTTCGT ATACTTTTGG CATACAAAAC 3240 

TGGCTATCAG GGTCTTGGAA AATAACGCCA CTTAATGGGT CAACGATTAG TTCATCATAT 33 00 

TTCATAGGTA ATTCAATTAA ATTAGGAACA ATACCACTTA ATACATTCAG AAGTGTACTT 3 3 60 

TTACCGCAAC CAGAAGGACC GAGTAAAAGT ACTTTTTCTT TGTCTTGAAT AGTGATATTT 34 20 

AAATGATCGA AAATTTTACG TTGACCACTT GGATATTTTA ATCGTAAATC ACTTACTTTT 3 4 80 
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ArrTTGTTAC GCCK^TCTTA TCTAAAGCTT TTACTAAA;.G GTAAGATAGG ACGCCGGCGA 
CTACTGCACC ACTAATTAAT CTAAATACGA TGAATAATGT TAAGTTCCAA CCTGCAACTT 
CATTTAAATA ACCATAGAAA TAATCTATCG GGAAAGCCGC GATrGCTCTA CAAAAACCTG 
CTAACATAGC TACCATAACT GAACGTGATT GATATTTAAA AATTGCAAAG ACAAGTTCAC 
ACGCTAAACC TTGTATAAAA GCGTAAACGA TTGTCGGAAT ATCGAAACGA CCCATAATGA 
TAGrrrCGCC GGCACCTGCA GCAAATTCAG CCAGTAAAGC AATACCTGGT riTGGAATAA 
riAGATAGCA GACAATCGCT GCCATGAACC AAACCCCGTT TGrTAATTGT TCGAGGTGAA 
GGCCTGTAGC TTGCACACCA TTGTAAACAA ACCACCATAA ATTGTAAATA ACTGCGAATA 
CTACTGAAAT AAGTACGGTT ACTAGTATTT CAGATAGCTT TAAACCTTIT GACATTTTTA 
CATCCTCCTA ATAAAAAAAC GCACAACCAT CCATAGGAAA GTTATGCGTT CACAATATAT 
ATTAGTAAAA CATATGTATA GTAACACTTT CCTACGCTAG TTCAAGCTAG ATCAGGTTCA 
AAGGGrrTGA GGGCAAGCCT CATCTCAGTA TAAAACACCC CTAGTGTGTG CGATT-TA. . . 
AATTAATTAT ACTGTAAGAC GTTTGTAAAC TTATGTCAAT AGGTTCTCTT CATGAAATTT 
CGTITAATTC GATTTAAAAT TTATAATATT AGCATTGGAT rTAAATTGAA GATGTAGTAG 
GAATGTTAGT AATTAAAGAT ATAAAAATAT GTGACATGTA ATAATATTGA GCIGATAAAT 
GAAGAGGGAT ACTTATCAAT CATACCTCTT TAACAACAGT GAAGAACCCG TGCATAATGG 
CTTACGAATT ATAGTTTATA AGGAAGAAGA GGGATACATG CGCCGAGCAC ATGCATAAAA 
GCCCCTAACA ACTAAAAGTT GTAAGGAAGG AGAGGGATAC ATGCGCCGAG CACATGCATA 
;VAATCCCCTA ACAACTAAAA GTT^STAAGGG GArTTAAAlT AATTTAGTGT ATCTTCGATA 

TCTTGrrrrG kTTGaTTAAT ATcrTCTGTT TrrrcTTCTr rrTTATCTTT TAATrrrrcT 
TCAAcrrcrrr tagctttttc TGcrGcrrrr TTATrrrGAT rrrcArrAGA catgattaat 

TCCrrCCCAAA TTGGATAATT AnTATATAT AAATCTTACC CGGTTGTACT TTCOrrAAAC 
TTTTCTAAGT CTATAGCACT ATTrATTCAT TTATCTAAAG ACAACAACAT TAGATTAATA 
TATAATGATT TTGAGGTGAA CATAATGTCT TTTCnAGGA AACACGCCGA AATTATmT 
AGCTATTTAA TCGGTmwCGT TcACTCTTCA CTGGkcTCAT TATTTTAATT AACTTGCCAT 
TAATTAAACA ATTAAATGGT GGTAAAAAAG TTGATACACA TGTTCATAAT GTGTGGGAAT 
TTCTGAATGC ATrTTTCAGT GAAArTATTA AAGTAATGAG TCGATTTATA GGTAATTTCC 
CtATAGTTAG TGCAATTGTG ATAATTATAT TCGGTATTTT AGTTATGTTG ATTGGTCATA 
^.-^TArrTAG AACTATTAAG TATGACTATG ATATTTCTAT CTTTrTCTTA GTTATCGGTA 
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TTTTCATTAT TCCATTTACA ATTCATATAG GATATATCGT CTATAAAGAT GAATTGAATC 54 0 0 

AGGAAAATGT AAAAAATCAT TTCATGTGGA TAATTGTGAG TTATGGTATA AGTTACTTAA 54 60 

TTACACAAAT TGCATTGTAT GOCAGAATTG ATGCTAATGA AATAGAGTCA ATTGATATCT 552 0 

TAAGTGTCAA TGCTTTCTTT ATAATTATGT GGTTACTTGG TCAAATGGCT ATTTGGAATT 55 8 0 

TCTTGTTCTT GCGCCGAGCT TTACCTTTAA CAAAGCAAGA ATTAGGTGAA GAGGAGCCAG 564 0 

AATTATCAAG AACAAGTAAA GGGAATGTCA CGAATCAAAC TAAAATTCAC TTGAAACAAC 57 0 0 

TCCAAGATAA GACTACAGAA TATGCACGTA AGACAAGAAG AAGTGTCGAT TTAGATAAAA 576 0 

TTAGAGCTAA AAGAGATAAA TTCAAAAAGA AAGTTAATGA TATTATCGAT ATTCAAGAAG 5 82 0 

ACGATATTCC TGATTGGATG AGAAAACCGA AATGGGTTAA ACCAATGTAT GTCGAACTAT 58 8 0 

TTTGTGGTGT CGTCATCTTT TTATTCACAT TTTTAGAATT TAATAATCGT AATGCATTAT 5 94 0 

TTGTATCTGG TGATTGGAAA TTATCACAGA CACAATATGT TATTGAATGG GTTACATTAT 60 0 0 

TAATTCTGTT ATTCATTATT ATCGCATATA TCGCTACAAC GTTAACTTTC CACTTGAAAG 60 6 0 

GTAAGTTTTA TTATTTACAA TTATTTATGG GGAGCATTTT ATTCTTTAAA TTGTTAACGG 612 0 

25 AATTTATAAA TATAATGATT CATGGACTAT TACTTTCAGT GTTCATTACG CCAACATTAC 6180 

TATTAATGTT ATTGGCAATC ATCATTTCTT ATTCGTTACA ATTACGAGAG CGACCATAAT 624 0 

TAAAAGCATT ATAAAAGTAC TATCTATTAA ACATTTTGAT GTGTACGCTA TAAGTTAGAT 63 0 0 

ATATCTCTAA CTTACTTAGA TACAGGTCAA TGAAGTTTAT GGATAGTACT TTTTTTGTAA 63 60 

CTAGATTTGA TTGATTCAGG TGATGTGAAT TAAGTATTGA rAATTGTATA CAAAGTTTTA 642 0 

GTGCAAATAA AATAGTTGAA AAGTTATCCA TTTGTAAAAT CAAGAAAACT AGTAAATAGT 64 80 

TGAAGCGACT TATGGaATTT GCGAAACGAT ATATAGTATT TCCTTTGTAG AAATTTmACA 654 0 

TATATCATTC AAATTACTAA TTTGTTAAAA TCAACAGTAA GATTAGAAGT AGATGATATT 66 00 

GAAATTTGGC AAACAaTtTA ATCTATATAA AACTACAACG AAACACAGAA AGGAAGTTGT 66 6 0 

CAGATGAAAA TAGCAACTCT GAACAAAGGC AAAGAAACAA AATATTTTAA TGGATATCCT 6720 

TTAATTGAAG AAGAGGATAT CTATTCACAA GATCATTTAA AAGAAGGAGA TATTTTTCAA 6 7 80 

ATTGTGACTG ATAAATCACA ATAT 6 8 04 

(2) INFORMATION FOR SEQ ID NO: 366: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1717 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 366: 
aaaAGAGACG CATTaAAACA AGCTATACAA ATTATCGATA AATTAACATG GGGTGTTTAG 
TGGTGGTTAA AGAAATTTTG AGACTATTAT TCTTACTAGC GATGTATGAG CTAGGTAAGT 
ATGTAACTGA GCAAGTATAT ATTATGATGA CGGCTAATGA TGATGTAGAG GCGCCAAGTG 
ACTTTGAAAA AATCAGAGCT GAAGTTTCAT GGTAATAGCT ATTATCATTT TTGAATTAAT 
TATATTAATG TGTGTAGCAA TAGCACTGGA GGTGTTGTAA ATATGTGGAT TGTCATTTCA 
ATTGTrrTAT CTATATnTTT ATTGATCTTG TTAAGTAGCA TTTCTCATAA GATGAAAACC 
ATAGAAGCAT TGGAGTATAT GAATGCTTAT CTTTTCAAGC AGTTAGTAAA AAATAATGGT 
GTTGAAGGTT TAGAAGATTA TGAAAATGAA GTTGAACGAA TTAGAAAAAG ATTCAAAAGC 
TAAAGAGAGG CGTTGGCITC TCTGCTCTAT CyAAAATAAT GAAAGGAGCC saACATGTTA 
GaCmAAGtCA CTCAAATAGA AACAATTAAA TATCATCGTG ATGTCTCATA TTCTTATGCT 
GCTAGTCGTT TATCTACACA TTGGACTAAT CACAATATGG CTTGGTCTGA ClT-TATGCAG 
AAGCTAGCAC AAACAGTTAG AACTAAAGAA GATTTAACTG AGTACAATAA AATGTCTAAG 
TCTGAACAAG CCGATATAAA AGATGTTGGC GGATTTGTCG GTOGATATTI AAAAGAAGGC 
AAACGGCGTG CTCGTCAAGT CATGAATCGT TCAATGCTAA CACTTGATAT CGATTATGCA 
GCCCAAGATA TGACTGACAT ATTATCTATG TTTTATGATr TTGCATATTG TTTATATTCA 
ACACATAAGC ATAGAGAGAT AAGTCCAAGA CTGCGTTTAG TGATTCCTTT AAAACGAAAT 
GTAAATGCAG ATGAGTATGA AGCTATTGGG CGTAAAGTCG CAGATATCGT TGGCATGGAT 
TACTTCGATG ATACAACTTA TCAACCACAT AGGTTAATGT ATrGGCCTTC AACTAGTAAC 



50 



GATGCGGAAT TrrTCnTAC CTATGAAGAT TTACCTTTGT TAGACCCAGA TAAAATATTA 
AATGAATATG TTGATTGGAC TGACACATTA GAATGGCCAA CGTCTTCAAG GGAAGAGAGT 
AAGACTAAAA GATTAGCAGA TAAGCAAGGC GACCCAGAAG AAAAGCCGGG AATTGTTGGT 
GCATTITGTA GAGCCTATAC GATAGAAGAA GCTATAGAAA CTTTTATTCC TGATTTATAC 
GAAAAACATT CTACTAACCG TTATACCTAT CATGAAGGTT CAACTGCAGG TGGATTGGTG 
TTATACGAAA ATAACAAGTT TGCCTATTCT CATCATAATA CGGATCCCGT AAGCGGTATG 
CTTGTGAACA GTTTTGATTT AGTACGCATA CACTTATATG GTCCTCAAGA TGAAGAAACT 
AAAACAGATA CTCCGGTTAA TCGACTACCT AGTTATAAAG CAATGCAGCa AAGAGCGCAA 
AATGATGAGG TTGTTAAAAA GCAATTAATT AATGACAAAA TGTCTGATGC AATGCAGGaT 
rrCGATGAAT. GAGAAAATAG CGATGATGCA TGGTCTGAGA CGTTnGAAAT TACTTCGAAA 
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(2) INFORMATION FOR SEQ ID NO: 367: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1847 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOUXSY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367: 

AAGATTAGGC ATCAAGACAG GGTCGCGATT GTTTGAAATC CCACATAGnA ATGATATTTa 6 0 

CaTTATCAAT CCaAGTATGC GTAAATATCT TAATGTTTCA GTTGCTATTT CTAAGATTGC 12 0 

ATTGCGTTAT ATTCCACCTG AAGATTTACA CCAATATAGT ATTGACGAAT TTTTTATGGA 180 

TGTTACTGAT AGCTATCATA GATTTAGTTC TACAGTACAT GCATTTTGCG AAAGACTTAA 24 0 

ACGTGAAATT TATGAAGAAA CAGGCATTTA TTGTACTGTG GGCATTGGTT CTAATATGTT 3 00 

ATTAAGTAAA ATTGCTATGG ATGTTGAAGC GAacATAGTc AAAATGGTAT AGCTGAATGG 3 60 

CGATATCaAG ATGTACCAAC GAAATTATGG CCAATTCmGC CCtTGCGAGA TTTTTGGGGT 420 

25 ATTAATCGTC GAACAGAAGC CAAATTGAAT AAAAGAGGAA TTTTTACTAT AGGAGATTTA 4 80 

GCGAAATATC CATATAAATT TTTAAAAAAA GAGTTCGGTA TTTTAGGTGT TGATATGCAT 54 0 

CTACATGCGA ATGGGATAGA TCAGAGTAAA GTACGTGAAA AGCACAAGAT CAGCAATCCA 600 

TCGATATGCA AAAGTCAAAT ATTAATGAGA GATTATCATT TTGATGAAGC AAAAGTAGTA 66 0 

ATGCAAGAGT TAATTGAAGA TGTTGCTAGC AGAGTTCGAG CAAGAAAAAA AGTGGCAAGA 72 0 

ACGATACATT TTGCCTTTGG CTATAGTGAT GAAGGCGGTG TACATAAGCA ATATACTTTG 78 0 

35 

AAAGATCCAA CAAACTTAGA AAAAGATATT TATAAAGTAG TAATGCATTT CGCAGATAAA 84 0 

TTATGTAATA AACAAGCACT ATATCGTACG CTAAGTATAT CTTTGAGTCA ATTTATTAAT 900 

GAGGATGAGC GACgTTAAGT CTGTTTGAAG ATGAATACCA ACGCAAACGT GACGAATGTC 960 

40 

TAGCTAAAAC GATAGACCAA TTACATTTGA AATACGGCAA AGGTATTGTG TCCAAAGCAG 102 0 

TATCGTTTAC AGAAGCAGGT ACAAAACACG GCAGATTAGG TTTAATGGCT GGACATAAAA 10 80 

TGTAATGACT ATACGGTTTA AGTAATATAT AACTGTGATT CGTATAAAAT AAGTCTCTAA 114 0 

AGATAAATAT TTCATATATC ACAATAGATT TTCACAATAA TATCTAAGAA TACATGGAAT 1200 

TTATCAAAAG AGACTTAATA ATTATTGGAT ATAACAATCA AAATCACTCA ATGCTTGCAT 1260 

50 ACCGCGTTCT CGGTCAGTAG GGTTTTTGAA ACTAATTTTT AAAGCACCGT ATATATCTTC 1320 

GCGTACTTCT AAGATTCTTA AGTTGCTTAT AGATATGTTA TGTAAACTCA GGATATAAGT 13 80 

55 
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10 



TAGTCCACCT AGTTGTTTAG CGGGTAGTGC GTCGCGATAC GATTTAGCTT GGGCAAAAAA 1500 

TGATAACAAT TTTTCAGAAT CATTGCTTTC AATTAGTCTt TCTAAATCTT GAAACTGACT 156 0 

TTTTAGCTGT CGAATCATTT CTAAAATATA CGTTTTATTA CTCAAGGTGA TATCTTTCCA 1620 

CATTTGTGCA TTACTACTAG CTATACGAGT GATATCACGA AAACCACCAG CTGCAAGTTT 168 0 

ATTAACTAAA TGATGTTCTT GACCGTTCTT TTGACTAACA TGAACTAAAC TAGATGCAAC 174 0 

GATATGAGGT AAATGACTTA CGACGCTTGT TACGTAGTCG TGTTCTTCAG CAGTAGTTAC 180 0 

AATAAATTTA GCAAGAGTAG GTGATAACAG TTCTTTTAAC GTGTTTG 184 7 
^5 (2) INFORMATION FOR SEQ ID NO: 368: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 94 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDSDNESS : doilble 
20 (D) TOPOLOGY: linear 



25 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 368: 

AACAAAAGGC ATAAGTTCGT GAATTAATGC GTATACAAGG ATAAAGCTTA TAACAGTAGT 6 0 

AATTGTTGCT ATCAAACGAA CAACATATAT TCTATTTTCA GATAGCAATC GATTCACTAA 12 0 

TCGATAATTT ACGTATACAA GAATTAACAA CAGCACAATA TAAACAATAA TCATATTTGG 180 

CCCCATTTAA TTTTTAAATT TGTTTTCACA TCATTTTCTA CTTTATTATC ATACTAATTT 24 0 

TAAAGGCAAA GGTGGACATC GGCAACCTCT CGTAAACTAT TTATCAAAAA TAAACGTATC 3 00 

TCATTGTTAT GATATTTATA AATCAATTCG TTTTTATTAA AGTCTTTTTC AACAAGTTTG 360 

TCACTATCTA TTAAATAATC ACGCATGCAT CCTTTTAAGA AATCATCTTT ATAACTTGGT 4 20 

GTGTACCATT TTCCATCCTC TTCAATGACA ATGTTGCCAA TATCAAATTC AAGGACCTTG 480 

CCGTCCTCTG AAGT 4 94 
12) INFORMATION FOR SEQ ID NO: 369: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2518 base pairs 

(B) TYPE: nucleic acid 

(C) 5TRANDEDNESS : do'oble 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 369: 
r-AArTAAAAA TGGGTAAATC CCTGGATTTa AATCTACTTC 60 
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GATAATTTCn AATGTAGCGA CCATCATTAC 
GTTGACCAAT ACTATCGCCT TCTTCTTCGA 
^ CAATTTCTAA TGAAGATTTA ATTCGGTTAT 

CAATAGCGAC ATCTGTACCA TGTCCTTGGT 
TTTCAATATT TTTAGGAATA TCTCCCAATA 

10 

CACCAGCCGT ATGAGAACTT GAAGGGCCCA 
TTTGATAATC ATAGCTCTTT GCCATAATTA 
CGGCATTTTA AAGTTGATAT TCATTAAATT 

15 

GGCTAAGACA ATGCTGACAA TAATCACTGT 
ATTAAAGCCA AACAATACGA TGGCGCCTGC 

20 TACTAGTCCG CTAAATGTCA CGATACATGC 

ATATAGTTGT ACTGGATATT GCGCTATTAA 
TGCAAATGCT TTTGACGGTC CACCAATGTT 

25 TGAGCTACCT GTACATGTTA GTGCTCCAAT 

ACTTGTTAAT ACCATTGAAC TTAGCGGTGT 
AATGACCGCT AATGCATATG GATTGTTATC 

30 TTTTAATGTT GCTAGCACAC CAGGTGTAAT 

TGCAAATAAG ATCACCACAA TTAAGTCCAA 
AATTAAAAAA GcTACGCCAT AAGCTGCGAT 

35 

TACTAAACCA ACAATGACCG CAAATACTGG 
ACCTACTGcG ATACCGCTTA AACTTCCTGC 
ATCAAATACG CCACCAATAG CATAACTTAA 

40 

TGCACCGGAT AATGCTTGTA GTCCTTGTTT 
CATGATCACT AAAACCAAGA CTAATGTGCC 

45 ACCTCTCTCT ATGTTTATTT TATTTTCAGA 

TTTTCAACTT TTATCTGTTA TCAAATCaAA 
AGATTTTTAA AATGGATTGT TTCAAAAACT 

50 ACTTTTTACT CTATATTTCA AACAGTTGTG 

GAAAGTAAGT AATATTTCTA AAAACTTTTA 

55 



GAGATGATTT GATATAAGCA CAGTTTGGAT 18 0 

TGATATCTAT TTTAATACCA TCATCAGCTG 24 0 

CAAATGTTGA ATATCCCATT GCTCCACCCA 3 00 

GTGTTTGAGC AAATGATTCA TAATAATGTA 36 0 

TTGCGCGTGC TGAATTCCCA ATCTTTACTG 42 0 

TCATAACTGG TCCGATAATA TCGAAAGCAC 4 80 

AACACTCTCC TTAATATGAT TCTTTTTGCT 54 0 

AAACTTATTA ATTAGTGTTC CAATAATATA 600 

TGCGATTGTT ACAATTGATG TCACTGCATT 66 0 

AATTGGTGTT GCCATACCTT TGACACCTAT 720 

GTTGACGACC CCAATCAGTG CATTTGTACC 78 0 

ATCAATTTGC GTCAATGGCT CAATACAAAC 84 0 

TAATTTTCGG AATAAAATAA GGTTAACAAA 90 0 

AGCCATAGGA ACACCTGTCA GTCCTAATAA 96 0 

CATACCTGTA ACAGGAATCA CTAGTCCTAA 1020 

ACCTACCGCA GTGACAGCAC TACCTATTTG 10 8 0 

GATTGATGCA AGTCCGAAAG CAATTGCTGG 114 0 

GCCTTCTGGA ACTTTCTTTT CAATCCATTT 120 0 

GAATGCTGGT AATAATTTAA AGTCATGTAA 1260 

TGCAACGCCT AAGTTTAAGC ACGTTAGAAT 1320 

TAAATCCCCA ATATCTTGTA GAAATTTAAT 13 30 

GAATGCTTGT GGTAGAAATG TCGCACAAGC 144 0 

ACCGTACGGT GCATACTTTA AAAATAGCGT 1500 

TACACCTAAC AGAATATCCA TTTCCCAAAA 156 0 

CCATAAACAT CGTACACCCT TAAGAAAACG 1620 

TATTTAAGTG AAATATTTCA TATTTGTGAA 16 8 0 

ACTTATGTTG TCGTTAATAT TTACTAATTA 174 0 

TGACAGTTTT TTGATAACTT TTTTACATCT 1800 

ATATTTATAC ACTTTATCTT TCGAGCTATT 1860 



EP0 786 519 A2 



ACACATATAT TTGCAATAAG ATAATTAAAG TAGGATATTA TTTTTAGTTT TCTGATAGGA 1980 

ATGATGATAG TCTATAGGTT GAATCTTTAC TTTTTTTTAA AGCTAAATTT ACATCAACTT 2040 

^ AACAATGGTT GGTTATACTG AAGATGAAAA TATTTAGTAT AACTTAGTGG AGGCGATAAA 2100 

GGTGCAATTG AGTCATTCCG TTAAAGTTGC AATTTCTATC TATTTAGCAC TTATCTTTAT 216 0 

AACGTTCACT TCTTATTTAG TCATTATTTT ATATACGAGT ATGACTGGAC ATGATGTATC 2220 

w 

ACATTTCGTG TTAGATAGTC AGCATTCTCA TCATGGATCT CTTACGCAAA AACATTTGAG 22 80 

TCTTCCTGAA ATCTCATTTA AATAGTTTAT CCTCTGTGTT TCAACATTCA TTTCCCATAT 234 0 

CGATTCATTT ATCTATCATC TAGACCACTA CATCTTAGAT GATTTTTTTA TTTTCTCATT 24 00 

15 

TCACTCTTTC TTAAAAGTCG ATATAATGAA TTAAATCATT ATCATACACC GACATATTTT 24 60 

ATGTTGTTGG TGTTAAGTTT aAAGGGGTGA GATACTTGGC GAATaATCaT TCAGCTTT 2 518 
(2) INFORMATION FOR SEQ ID NO: 370: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 790 base pairs 

(B) TYPE! nucleic acid 

(C) STRANDEDNESS : double 
25 (D) TOPOLOGY: linear 



30 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370: 

ATACTACTGG ACCGTCnACC AATTGACAAA ACCTGAAAAC TGGATTTTTA TTGCAGGATT 60 

ACATCGAACC TGGAAGAACA ATCCCCAGGA GTTGATGATA TTCGAGATAT GGCATACAAT 120 

CAAGGTAGTT TAGATAAGAC AATTTATGAA ATTTCTAAAC GCACAGTACT ATTTTTAATA 180 

CAGAAAGATA TTACGGTATA TAATAAGACG ATTGACTGTT TAAATTATTA TAACTATAGT 240 

GACGAAAGAA TAAAGGATGA TTAAATGAAT TCACAAGAAT TATTAGCAAT TGCTGTGGAT 3 00 

GCAATTGACA ATACCCCAGG CGAAGATACG ATTTCTTTAG AAATGAAAGG TATCAGCGAT 360 

ATGACAGATT ATTTTGTTGT AACGCACGGA AATAATGAAC GACAAGTTCA AGCGATTGCT 42 0 

AGAGCGGTGA AAGAAGTAGC CAATGAACAA AATATAGAAG TAAAACGTAT GGAAGGATAC 4 80 

AATGAAGCGC GTTGGATATT AATTGACTTA GCTGATGTTG TGGTACATGT TTTCCATAAA 54 0 

GACGAAAGAA ATTATTATAA TATTGAAAAG TTATATCAAG ATGCACCATT ACAATCATAT 6 00 

AGTCAGGTTG CGTATTAATT ATGTCGCAAT ATGCAGAAAT GAGCCTAGTG TACGATCAAT 66 0 

50 TGACTCAAGA TCAACCATAT GAAAAATGGT TTGAAATTGT AAAAAATCAC TGCAAAGATG 72 0 

AATCAAATAT TTTAGATATT GGATGCGGTA CTGGTAGTTa ACAGTTCAAT TAGAAGCTTT 780 



40 



75 



20 
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(2) INFORMATION FOR SEQ ID NO: 371: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 1823 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOUXSY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371: 

ATAGATGAAG GTGCAAATAT TGAAaTAGGT TATTTACCTG GACGCTTGAA ATGGTTAGTT 6 0 

GCTGATTTAT TAACTAAACA AGGATTAAAA GTAGTTAACG ACGATATGAC AGGAAGAACG 120 

TTAAAAGATC GTAAATTATT AACAGGTGAC AGTCCTTTAG CTTCAAATGA GTTAGGAAAA 18 0 

TTAGCAGTTA ATGAAATGTT AAATGCAATA CAAAATAAAT AATTAAATAT TAATTAGAGG 240 

AGCCTCATAT GTAAATGTAT GAGGGCTCTT TTTTTTGGCA AAATTTAAGT GATACTTGTA 3 00 

AAATAGAACC TATTATGAGT ATGATTTAAG AAAACGCTTG CAAAACTAAT AACCGCAACT 3 60 

AGCGATATGG AGGAAACATG ATGTCTTATA GCATTGGAAT TGATTATGGA ACTGCTTCAG 4 20 

25 GCCGTGTGTT TTTAATTAAT ACAACTAACG GTCAAGTAGT ATCAAAATTT GTGAAACCAT 4 80 

ATACACATGG TGTCATTGAG AGTGAATTAA ATGGTTTGAA AATACCACAT ACATATGCAC 54 0 

TTCAAAATAG TAATGATTAT tTAGAAATTA TGGAAGAAGG AATATCATAT ATAGTACGTG 600 

AATCAAAAAT AGATCCAGAC AATATAGTAG GTATTGGTAT AGACTTTACT TCATCTACTA 660 

TTATTTTTAC TGACGAAAAC CTTAACCCGG TACATAACTT AAAACAATTT AAAAACAATC 72 0 

CACATGCGTA TGTGAAACTT TGGAAACATC ATGGTGCATA TAAAGAGGCA GAGAAATTAT 78 0 

ATCAAACTGC TATTGAAAAT AATAATAAGT GGTTAGGCCA TTATGGATAT AATGTTAGTA 84 0 

GTGAATGGAT GATTCCCAAA ATAATGGAGG TCATGAATCG AGCACCAGAA ATTATGGAAA 900 

AAACGGCTTA TATTATGGAA GCGGGCGATT GGATTGTAAA TAAATTAACT AATAAAAATG 960 

TACGCTCGAA TTGTGGATTA GGTTTCAAAG CATTTTGGGA AGAAGAAACA GGGTTTCATT 1020 

ATGATTTATT TGATAAAATA GACCCCAAAT TATCAAAAGT AATTCAAGAT AAAGTATCTG 1080 

CACCGGTTGT TAATATTGGT GAAGCAGTAG GGAAACTGGA TGATAAAATG GCACAGAAAT 114 0 

TAGGATTATC AAAAGAAACT ATGGTAAGTC CTTTTATTAT TGATGCCCAT GCTAGTTTAT 1200 

TAGGTATTGG GTCTGAAAAA GATAAAGAAA TGACTATGGT GATGGGAACA AGCACATGCC 126 0 

^0 ATCTTATGTT AAATGAAAAG CAACATCAAG TGCCAGGTAT ATCAGGTTCT GTAAAAGGAG 13 2 0 

CAATTATTCC AGAATTATTT GCTTATGAAG CGGGGCAATC AGCAGTAGGT GATTTGTTTG 13 80 



30 



35 



40 



45 



55 



EP0 786 519 A2 



10 



15 



20 



CTGTATTTGA ATTAATGAAT GAAAAGATAA AACATCAAAT GCCAGGTGAA AGTGGGCTCA 1500 

TTGCTCTTGA TTGGCATAAT GGAAATCGAA GTGTATTAAG TGATAGCAAT TTAACAGGTT 156 0 

GTATCTTTGG ATTAACTTTA CAAACTAAGC ATGAGGaTAT TTATAGAGCm TATTTaGaAG 162 0 

CTACaGCATT TGGTACtAAG ATGATTATGC mACAGTATCA AGATTGGCAT ATGGaAGTAG 1680 

aAAAGGtATT TGCaTGTGGc gGTAtACCtA AAAAGAATGC TGTTATGATG GATATCTATG 1740 

CGAATGTACT GAATAAAAAA CTAATTGTTA TGGATAGTGA GTATGCACCA GCAATAGGCG 1800 

CAGCAATATT AGGTGCAGTC AGT 182 3 
(2) INFORMATION FOR SEQ ID NO: 372: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1600 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 72: 

25 ACGATCATCT GCATCAGCGA ATTCCGATGc AaTTcATATG tCCtAATAAC AAAAGAATAG 60 

GTTTTaAAAG ATATGCATAC CGTAATGaTA GATATAGTTT TAAACGTGAC TTCAAGCTAT 12 0 

ATGaATGTGA TGACTGTTCA TCATGTTCTT TGAGACATCA ATGCATGAAG CCAAATTCGA 180 

AATCCAATAA GAAAATTATG AAGAATTATA ATTGGGAATA CTTTAAAGCC CAAATTAATC 24 0 

AAAAGCTTTC TGAACCAGAA ACGAAAAAAT CTATAGTCAA AGAAAAATTG ATGTAGAGCC 3 00 

TGTTTTTGGA TTTATGAAGG CTATTTTGGG TtTCACTCGA ATGTCAGTTC GGGAATAAAT 360 

AAAGTTAAAC GAGAGCTAGG TTTTGTATTA ATGGCACTTA ATATAAGGAA AATAgcaGCT 42 0 

CAACGAGCTG TACATTATAA AATACATATC AAAAAAGCTG ATTTCTATCA AATAAATAAT 4 80 

AGAAATCAGC TTTTTTACAT TGCCTAAGAA CTTTAAGGAA CTTAATGTCC CAAGCTCTTT 54 0 

40 

TTTGTTATAT CTAATTCGTA ATTTATGATT GTTTATTCGG TCCTTTGATG TTCACTAAAT 60 0 

GTGACTTTAA ATCTTGTTCT AATTGTTGTA ATTCTTTTTC AGCTAATTGT CGTTCTTCGC 66 0 

GACCGTGTTG TTGAATAATT AATGTTTCTT CAATTGTCTC AATAATGTTA CGCTGTGTAC 72 0 

GTTTCAATGT ATCAAGATCA ACAATGCCAC GCTCATTTTC TGTTGCAGTT TCAATCGCa.T 78 0 

TTTGTTTCAA CA'ITTCAGCA TTTGCTGTTA ATAAATCATT AGTTGTATCA GTGACAGCTC 84 0 

so GTTGTGCAGC AACTGCATTA CGCTGTCTCA TTAATGTAAG CGCAATGGCC ATTTGATTTT 90 0 

TCTATAGTGG AATACTTGTC AAAATTGAAC TTTGTATCTT CTCGGCAAGT GCTTGATTAA 96 0 
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AGTCATATAT GCGTTTATCT AGTCTATCTA TAAATTGCTG CATATCTGCA ACTTGTTGAA 1080 

TATCCATTTG ATTAGTGGAt TgtGCGCTTG CTGTTGCAAT TGTGGTAGCT TTTCATTTTC 114 0 

^ TAATTGCAAC TTTTTTTGcT GTGCAGCAAT GATATGCAAT GATAAGTCAT CAAAGTATTG 12 0 0 

TTTGTTTTTA TCATATAGCG TATCTAATAA TTCAATATCT CTTGTTAAAT GTGTTTGATG 12 6 0 

TTTCTGCAGT TGAATCGTTA TGCGATCGAC TTGAGCACTA ACTGATTGCA TTCTTGAAAA 1320 

10 

GATTTCATTG ATAGACGACT TTGCTCTGCT AAAAATTCTT TTTAACATAG ATGGTTTATC 13 8 0 

AGTATTTAAC TCATTTGGAT TAACTGACTT TAGTTTTGAC ATTAGATCTG ACAAAGTATC 14 4 0 

TCCAATAGGA CCAACATCTT TACTTTGTAC TTCATCCAAC ATTTGATGTG AAAATTGAGA 150 0 

CATTTGTTTC TGkAAATCAG AACCAAACGC TAATAAACCT TCATTGTCTA AAGGGTTAAT 156 0 

TTGTTTACTG ATTGTGTCTA CCTGTTTTTG TTGTTCAATT 16 00 
20 (2) INFORMATION FOR SEQ ID NO: 373: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1227 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373: 

ATGTTGATAA TGGTAGCTTC TACAAGAATA AAGACCAACA AGTTGGTGCA ACAATTCTTG 6 0 

aTAGTAAAAC TGGTGGTTTA GTTGCTATAT CTGGTGGACG TGATTTCAAA GACGTCGTTA 12 0 

ACAGAAACCA AGCAACAGAT CCTCACCCTA CTGGTTCATC TTTAAAACCT TTCTTAGCGT 180 

ATGGACCTGC CATTGAAAAT ATGAAATGGG CAACAAACCA TGCGATTCAA GATGAATCTT 24 0 

CATATCAAGT TGATGGTTCT ACATTTAGAA ACTATGATAC GAAGAGTCAC GGTACTGTAT 3 00 

CTATTTATGA TGCTTTACGA CAAAGTTTCA ATATCCCAGC TTTAAAAGCT TGGCAATCAG 360 

TTAAGCAAAA TGCTGGTAAT GATGCACCTA AGAAATTCGC TGCCAAACTT GGCTTAAACT 420 

ACGAAGGCGA TATTGGTCCA TCTGAAGTAC TTGGTGGTTC TGCTTCAGAA TTCTCACCAA 4 90 

CACAATTAGC ATCAGCATTT GCTGCAATCG CTAACGGTGG TACTTATAAC AACGCGCATT 54 0 

CAATTCAAAA AGTAGTTACT CGTGATGGTG AAACAATCGA ATACGATCAT ACTAGCCATA 6 00 

AAGCGATGAG TGATTACACT GCATACATGT TAGCTGAGAT GCTAAAAGGT ACATTTAAAC 6 60 

CATATGGTTC TGCATATGGC CATGGTGTAT CTGGAGTAAA TATGGGTGCT AAGACAGGTA 720 

CTGGTACTTA CGGTGCTGAA ACTTATTCAC AATATAATTT ACCTGATAAT GCAGCGAAAG 7 80 
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AAGTTAAACA ATATGGTGAA AACTCATTTG TrGGACATAG CCAACAAGAA TATCCACAGT 900 

TCTTATATGA AAATGTGATG TCAAAAATTT CATCTAGAGA TGGCGAAGAC TTTAAACGTC 960 

CTAGCTCAGT AAGTGGTAGT ATCCCATCAA TCAATGITTC TGGTAGTCAA GATAACAACA 1020 

CTACAAATCG TAGTACACAC GGTGGTAGTG ACACATCAGC AAACAGCAGT GGTACTGCAC 1080 

AATCAAATAA CAATACTAGA TCTCAACAAT CTAGAAACAG CGGTGGATTA ACAGGTATAT 114 0 

TCAACTAATC CACTCAACAT AAAATCCTCA GTTATACCAT ATTTATGGTG TAGCCGAGGA 1200 

TTTTnTTAGG TTCTTCATCT TTTATGG 12 27 
(2) INFORMATION FOR SEQ ID NO: 374: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1953 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 
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CCATATGGtG 


CAACATTCTT 


CGTATTTAGT 


GATTATTTAA 


AACCAGCGTT 


ACGTTTATCA 


60 


TCAATTATGG 


GATTAAATgC 


aACGTTCATC 


TTCACACATG AyTCaATTGC 


AGTAGGTGAA 


120 


GATGGTCCTA 


CTCATGAACC 


AATTGAGCAA 


TTAGCTGGAT 


TAAGAGCCAT 


TCCAAATATG 


190 


AATGTTATCC 


GTCCTGCTGA 


TGGTAATGAA 


ACAAGAGTAG 


CATGGGAAGT 


TGCCTTAGAA 


240 


TCTGAATCTA 


CACCTACTTC 


ATTAGTATTG 


ACACGTCAAA 


ACTTACCGGT 


ATTAGATGTA 


300 


CCAGAAGATG 


TAGTTGAAGA 


AGGCGTTCGA 


AAAGGTGCCT 


ATACAGTTTA 


TGGCTCTGAA 


360 


GAGACACCAG 


AATTCCTATT 


ATTAGCTTCA 


GGTTCAGAAG 


TTAGTCTTGC 


AGTTGAAGCT 


420 


GCTAAAGATC 


TTGAAAAACA 


AGGTAAATCA 


GTACGTGTTG 


TTTCAATGCC 


TAACTGGAAT 


480 


GCATTTGAAC 


AACAATCTGA 


AGAATATAAA 


GAATCAGTTA 


TTCCATCAAG 


CGTAACAAAA 


540 


CGTGTTGCGA 


TTGAAATGGC 


TTCACCGCTT 


GGATGGCATA 


AATATGTAGG 


TACTGCAGGT 


600 


AAAGTTATTG 


CTATTGACGG 


CTTTGGCGCA 


AGTGCACCTG 


GCGATTTAGT 


AGTTGAAAAA 


660 


TATGGATTTA 


CAAAAGAAAA 


TATCTTAAAC 


CAAGTTATGA 


GCTTATAAGA 


ATAATTTATA 


720 


AAGCGAGTAT 


GTTTAGAAGT 


CTAGGATGCA 


TAATCTTAGG 


CTT C TTT*^ AA 


AGTGTTGAAA 


7B0 


TTTAGAGTAT 


AGCACTTAAA 


CTACATCATA 


AGTGATAAGT 


TATGAAAGTA 


TACTATTTCA 


840 


GATTAATCTT 


TAAAAGCTCT 


GTTATAACAG 


CATGATTTTT 


GATATTATTT 


TTAGTATCGA 


900 


-^ATTAAAATA 


CTTGAATAAA 


CTAGTTCTTG 


AAATAATGTG 


ATGAATTTAG 


TAAAATTCAG 


960 
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TGCATTAATT 


TTAGGTTTAA 


TTGGAGGTTT 


CCTTTTAGCT 


AGAAAATATA 


TGATGGACTA 


1080 


CTTGAAGAAA 


AACCCACCAA 


TCAACGAAGA 


AATGCTTCGT 


ATGATGATGA 


TGCAAATGGG 


1140 


TCAAAAACCT 


TCTCAGAAGA 


AAATTAATCA 


AATGATGACG 


ATGATGAATA 


AAAATATGGA 


1200 


TCAAAATATG 


AAGAGTGCGA 


AAAAGTAAAT 


TCGCAATTGA 


TAGAGGCTAT 


TTTCCAGATA 


1260 


TGGAAATGGC 


CTCTTTTTAT 


AATCAAATTA 


ATAAGAATAA 


ATATGTTTAT 


TAAAATTAAA 


1320 


GTTAACAAAA 


TGACGAATAG 


ACTGAGAAAT 


GCTATAATTC 


ATTTTGTATG 


ATTTACAGAG 


1380 


AGTTTATTTA 


ACGAGAAGGT 


GTCyGCGTGC 


TCTATTTAAT 


ATTTTCAATC 


ATTGTAGCTT 


1440 


TATTTATGGG 


AACTATAGTT 


ATAGTTATTC 


GTATGAAAGC 


TCAAAATTAT 


CCGGTAAAkG 


1500 


AGAAAAAAAT 


AGTTTTGCCa 


CCgTTTTTTA 


tGGCgACCGG 


TGCATTGATG 


TACGTCGTTC 


1560 


CaTATTTTAG 


GCTAACAGGA 


TCGGAAATGC 


TAGAAGCCTT 


TATAATTGGT 


TTGCTTTTTT 


1620 


CtACAGTTCT 


AATTTGGACT 


TCTCGATTTG 


AAGTCAAAGG 


TACAGAAATT 


TATATGAAAC 


1680 


GATCTAAAGC 


ATTTCCAGTT 


ATTTTGATTT 


CATTACTTAT 


CATTCGTACT 


GTGATGAAAA 


1740 


TATTCATTAG 


TAATGAAATA 


GATCCTGGAG 


AATTAGGCGG 


CATGTTCTTT 


TTATTAGCAT 


1800 


TCTGTATGAT 


TGTTCCTTGG 


AGAGCAGCAA 


TGCTATATAA 


ATACAAAAAA 


CTAAAGAAAA 


1860 


CATTAATCAA 


TTAATTACTT 


TTaAAACCAC 


TTGTGATCGA 


CTTCTAAATC 


AGTCAATGAG 


1920 


TGGGTTTAAT 


nTTACTTGGA 


AAAGGnGGAA 


AGG 






1953 



(2) INFORiMATION FOR SEQ ID NO: 3 75: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3787 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiable 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375: 
ACATTTGATC AAnTTATCGA CATTAAAGAT GAATTCAnTT GATCGTnTCA ATGATTATCC 60 

TGTTGAAGTA GCACGTTTGC TTGATATAGT GGAmATaAAA GTACACGCAT TACATTCAGG 120 

TATCcACGTT GATTAAAGAT AAAGGGAAAA TAATTGATAT TCATTTATCT GTAAAAGCCA 180 

CTGAAAATAT TGATGGCGAA GTGCTGTTCA AAGCAACACA ACCTTTAGGT AGAACAATGA 24 0 

AGGTTGGTGT TCAAAATAAT GCAATGrCAA TTACTTTAAC GAAACAAAAT CAATGGCTTG 3 00 

ATAGTTTGAA GTTTTTAGTT AAGTGCATTG AAGAAAGTAT GAGAATCAGT GATGAAGCAT 3 60 

AAAGAAGCAT TTAATGGCGT TGTCGTGTTA ACTGCTGCAT TAATTGTCAT TAAAATTCTG 420 
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CAACAAGTGT ATCCAATTGT AGCATTAGGA ATGATATTAT CGATGAATGC CATTCCTAGT 54 0 

GCAATTACAC AAAATATAGG GAAGTATCAT AGTGACGAAG CATATGCAAA AGCAGTCGCT 600 

TATATACAAT TAGTTGGTAT ATTATTATTT ATTGCTATTT TTGTGTTTGC GAACAATATT 660 

GCACATATGA TGGGTGATGG CCATTTAACA CCAATGATTC AAGCTGCAAG TTTAAGCTTT 72 0 

ATATTTATAG GTATGCTTGG CGTGTTAAGA GGTTATTATC AATCTGCAAA TAATATGACA 78 0 

GTTCCGGCTA TTTCCCAGGT TATAGAACAA GTTATACGAG TAGGTATTAT CATTGTTACT 84 0 

ATTGTTATTT TTGTAGACAG AGGTTGGACG ATATATGAAG CGGGAACAAT TGCTATTTTA 900 

GCATCAACGA TAGGTTTTTT AGGTTCTTCA ATTTATTTAG TAGCGCACCG ACCTTTTAAG 960 

TTTAAAATGG TAAATAACAC TGCAAAGATC GTTTGGAAAC AGTTCGCACT TTCGGTTTTG 1020 

ATTTTCGCTA TCAGTCAATT AATCGTAATT TTATGGCAAG TGATTGATAG TGTTACTATT 1080 

ATTAAGTCAC TTCAAGCGAT ACGCGTGCCA TTCGATGTTG CCATAACTGA AAAAGGAGTC 1140 

TATGACCGTG GTGCATCATT TATTCAGATG GGATTGATTG TAACTACAAC ATTTAGTTTT 1200 

GCGCTCATTC CTCTGTTAAG TGACGCAATC AAAATGAATA ATCAGGTACT TATGAATCGT 12 60 

25 TATGCAAATG CGTCATTAAA GATTACGATT TTAATAAGTA CAGCAGCGGG AATAGGATTA 13 20 

ATTAATTTAT TGCCTTTAAT GAACGGTGTG TTTTTTAAGA CGAATGATTT AACCTTAACG 13 80 

TTAAGTGTTT ATATGATTAC GGTCATTTGT GTATCGTTAA TTATGATGGA TATGGCATTA 144 0 

TTACAAGCGC AACATGCTGT GAGACCTATT TTTGTTGGTA TGACGGCAGG ATTGGTTATT 1500 

AAATTTATAC TTAATATCAT TTTGATTCGT TTAAGTGGCA TTATTGGTGC GAGCATTAGT 1560 

ACTGTTGTAT CATTAATTAT ATTCGGTACG ATTATCCATA TTGCTGTCAC GAGAAAATAC 1620 

CACTTATATG CGATGAGACG ATTTTTTATC AATGTTGTTT TAGGTATGGT ATTTATGTCG 16 80 

ATTGTTGTTC AATGCGTGTT AAACATAGTG ACAACACACG GTAGAATCAC TGGACTCATT 174 0 

GAATTATTAT GTGCAGCAGT ATTAGGTATC ATTGCATTGT TTTTCTATAT TTTTAGATTT 18 00 

AATGTTTTGA CATATAAAGA GTTAACTTAT TTACCATTTG GTTCAAAGTT GTATCAAATT 18 60 

AAGAAAGGAA GACGTTGATG GCACATACCA TTACGATTGT TGGCTTAGGA AACTATGGCA 192 0 

TTGATGATTT GCCGCTAGGG ATATATAAAT TTTTAAAGAC ACAAGATAAA GTTTATGCAA 19 80 

GAACGTTAGA TCATCCAGTT ATAGAATCAT TGCAAGATGA ATTAACATTT CAGAGTTTTG 2 04 0 

ACCATGTTTA TGAAGCACAT AACCAATTTG AAGATGTCTA TATTGATATT GTGGCGCAAT 210 0 

50 TGGTTGAAGC TGCTAATGAA AAAGATATTG TCTATGCGGT TCCGGGTCAT CCTAGAGTTG 216 0 

-T^^AGAr-AAC TACAGTGAAA TTACTGGCTT TAGCAAAGGA CAATACTGAT ATAGATGTGA 22 2 0 



30 



35 



40 



10 



15 



20 



EP0 786 519 A2 

ATGATGGCTT CACACTGTTA GATGCGACAT CATTACAAGA AGTAACACTT AATGTTAGAA 234 0 

CGCATACATT GATTACGCAA GTTTATAGTG CAATGGTTGC TGCTAATTTG AAAATCACTT 24 0 0 

TAATGGAACG ATATCCTGAT GATTACCCTG TTCAAATTGT CACTGGTGCA CGAAGCGATG 24 6 0 

GTGCGGATAA CGTTGTGACA TGCCCATTAT ATGAATTGGA TCATGATGAA AATGCATTCA 252 0 

ATAATTTGAC GAGTGTAITC GTACCAAAAA TCATAACATC GACATATTTG TATCATGACT 25 BO 

TTGATTTTGC AACGGAAGTG ATTGATACTT TAGTTGATGA AGATAAAGGT TGTCCATGGG 264 0 

ATAAAGTGCA AACGCaTGmA AcgCTAAAGC GTTATTTACT TGAAGAAACA TTTGAATTGT 27 00 

TCGAAGCTAT TGACAATGAA GATGATTGGC ATATGATTGA AGAACTAGGA GATATTTTAT 2760 

TACAAGTGTT ATTGCATACT AGTATTGGTA AAAAAGAAGG GTATATCGAC ATTAAAGAAG 2 82 0 

TGATTACAAG TCTTAATGCT AAAATGATTC GTAGACACCC ACACATATTT GGTGATGCCA 2 8 80 

ATGCTGAAAC TATCGATGAC TTAAAAGAAA TTTGGTCTAA GGCGAAAGAT GCTGAAGGTA 294 0 

AACAGCCAAG AGTTAAATTT GAAAAAGTAT TTGCAGAGCA TTTTTTAAAT TTATATGAGA 3 0 00 

AGACGAAGGA TAAGTCATTT GATGAGGCCG CGTTAAAGCA GTGGCTAGAA AAAGGGGAGA 3 06 0 

25 GTAATACATG AGATTAGATA AATATTTAAA AGTATCACGG TTAATAAAGC GACGTACGCT 312 0 

AGCAAAAGAA GTAAGTGATC AAGGTAGAAT TACAATAAAT GGTAATGTTG CTAAAGCTGG 318 0 

ATCGGATGTT AAAGTTGAAG ATGTGCTGAC GATTCGCTTT GGTCAAAAAT TAGTAACAGT 3 24 0 

TAAAGTAACT GCATTAAATG AACATGCATC TAAAGATAAC GCGAAGGGTA TGTATGAAAT 3 3 00 

CATTGAAGAG CGTCGACTTG AAGAAGCGTA AATTGGAGGT GACAAGCAAT GAAAAATAAA 33 6 0 

GTAGAACATA TAGAAAATCA GTACACGTCG CAAGAGAACA AGAAAAAACA ACGTCAAAAA 3420 

ATGAAAATGC GTGTTGTTCg TAGGCGTATT ACAGTATTTG CGGGCGTATT aCTTGCGATA 34 8 0 

ATTGTTGTTT TATCaATCTT GCTTGTTGTC CAAAAACATC GCAATGATAT TGATGCACAG 354 0 

GAGCGAAAAG CGAAAGAAGC ACAGTTTCAA AAGCAACAAA ATGAAGAAAT TGCGTTAAAA 36 00 

GAAAAGTTGA ATAATCTGAA TGACAAAGAT TACATTGAAA AAATTGCGCG TGATGATTAT 3 6 60 

TACTTAAGCA ACAAAGGTGA AGTGATTTTT AGGTTGCCAG AAGACAAAGA TTCGTCTAGC 3 720 

TCAAAATCTT CGAAAAAATA AATCCAAATT GATTCAAAAT TATCCGAGTA TAGACATTGT 3 7 80 

GAAAAAA 3787 
(2) INFORMATION FOR SEQ ID NO: 376: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1644 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376: 
TAAACCATTT CAAACTGAGG AACGCnAAGA CGGsACgTTT CCAGATTTAG AAGTArTTAA 
;^TGAATGT GATTTAAGCT ATGACATAAC GTCACrTTAT ACrrTTAAGC AACCTGTATC 
;.CCACACCrr GCATTTAAAA TGACAGATCA AATTTrrCTA AATAAGCAGC GTCTATTAGA 
TAAOGTAAAA GrTTTAGATA AGGAATTTGA rrTTATCTTA ATTGAGGG.^ CTOGGGGAAT 
TOCCGTACCA ATATATGAAG GTACAGATGA TTTCTACATG ACTAAAGATC TAATCAATGA 
rrGTGCAGAT TGTGTCATC^ GTGTGTTGCC ATCAAAATTA GGTGCTATTA GCGATGCCAT 
TorrCACCAA GArrATGTTA ATCAGAATGT ATCGGCGAGT AATTrrrrAA TAATGAATCG 
CTATACAGAC AGCTATATTG AAAAAGACAA TCAAATGACG ATIGGAAAAT TAACAAATAA 
;^CAGTCTAT ACATTTGAAG AACATGCCAC GTATGAAAAT rrCTCAOAAO CArmTAAA 
ACAATTAATA GGAGTTAAAA ATGAATTACA CACAACAACT TAAACAAAAA GACTCAC-.J^-T 
ATGTTTGGCA TCCATTTAC^ CAAAXGGGTG TATATAGCAA AGAAGAAGCA ATCATCATTG 
;^GGAAA GGGTAGTTAC CTTTACGATA CGAATGGCaA TAAATArTTA GATGGrTATG 
CATCGTTGTG GGTC^TGTG CATGGTCATA ATAACAAATA CT^TAAG GTAATTAAAA 
AGCAACTCAA TAAAATTCCC C^TTCTACGC T^CTAOGATC ATCAAATATT CCGTCAATAG 
AACTTGCGGA AAAATTAATC GAAATCACGC CAAGTAATCT AAGAAAAGTA TrTTATTCTG 
ATACAGGCAG TGCGTCTGTT GAAATCGCAA TAAAGATGGC ATATCAGTAT TGGAAAAATA 
rXGATAGAGA AAAATATGCC AAGAAAAACA AGTTTATAAC GCTAAATCAC GGTTATCATG 
CGGATACGAT TGGTGCGGTA AGTGTTGGTG GTATC^GAC CTTTCATAAA ATATTTAAAG 
ACrrAATATT TGAGAATATT CAAGTAGAAA GCCCATCTTT CTATCGCAGT AATTACGATA 
CTGAAAATGA AATGATGACA GCTATmAA CGAATATAGA GCAAATTCTA ATTGAAAGAA 
ATGATGAAAT CGCAGGCTTT ATATTGGAAC CGrrGATTCA AGGTGCGACA GGCTTGrrTG 
TTCATCCTAA AGGCTTTTTG AAAGAAGTCG AGAAATTGTG CAAAAAATAC GATGTCTTAT 
TAATOTGA TGAGGTAGCA GrrGGrTTTG GGAGAACTGG AAAGATGTTT GCATGCAATC 
.TGAAGATGT TCAACCGGAT ATTATGTGI. TAOGTAAGOC GATTAC.GGT OGCTACTTAC 
OVCrrGCAGC TACATIGACA TCTAAAAAAA TATACAATGC ATTrrrAAGT GATTCGCATG 
OTGTGAATAC CrTTTTCCAr GGTCaTACAT ACaCCGGAAA TCAAATcGTT TGTaCGGrTG 
..rrar-aAAA TATaAGaCTT TATGaAAAAC GTaAGTtnAT TGTgCACATa TTGaAACGaC 
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(2) INFORMATION FOR SEQ ID NO: 377: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377: 
ATGATTTTtA aAAATCATTA AGTTAAGGTr GATACACATC TTGTCATATG ATCAAATGGT 60 

TTCGCCAAAA ATCAATAATC AGACAACAAA ATGTGCGAAC TCGATATTTT ACACGACTCT 12 0 

CTTTACCAAT TCTGCCCCGA ATTACACTTA AAACGACTCA ACAGCTTAAC GTTGGCTTGC 180 

CACGCtnTTAC TTGACTGTAA AACTCTCACT CTTACCGAAC TTGGCCGTAA CCTGCCAACC 24 0 

AAAGCGAGAA CAAAACATAA CATCAAACGA ATCGACCGAT TGTTAGGTAA TCGTCACCTC 3 00 

CACAAAGAGC GACTCGCTGT ATACCGTTGG CATGCTAGCT TTATCTGTTC GGGCAATACG 3 60 

aTGCCCATTG TACTTGTTGA CTGGTCTGat ATCcGTGAGC AAAAACGGCT TATGGTnTTG 42 0 

25 CG AGCTTCAG T 431 

(2) INFORMATION FOR SEQ ID NO: 37B: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2006 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378: 

TTTnTATAAC GTATTATAAA TCGTTAAAAA TTTTGGTTGT GTTTGCGTCA CGTAGACAAC 60 

CTCCATAAAG TTACTTAATC ACTCTCATCA TACAATAATT TTTACTCAAA TTGGAAnAAT 12 0 

TATAAAAATT AAATATAGAT AGGCTTTGAA AATTAGmT ATACAAGGTT AGTAGCTGTA 180 

ACTGTAAAAT GTTCTTAATA TTGTCAAAAT GTAATGCTTG AAAGCGCTTT TAAAaAATAT 24 0 

TATTATATAC ATGGTTAGAC AAATAGACAA ATCACTATAC AAATATTGGG AGGAATATTT 3 00 

TATGAAATCA ACACCACACA TTAAACCAAT GAATGACGTC GAAATTGCAG AAACGGTTCT 36 0 

ATTGCCAGGA GATCCGTTAA GAGCTAAGTT CATTGCAGAA ACTTATTTGG ATGATGTGGA 420 

50 ACAGTTCAAT ACAGTGCGAA ACATGTTTGG TTTTACCGGA ACATATAAAG GTAAAAAAGT 4 80 

TTCTGTCATG GGTTCAGGTA TGkGTATGCC ATCTATTGGC ATTTACTCTT ATGAATTAAT 54 0 
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CATTGATTTA TATGATGTGA TTaTTkCACA AGGTGCCTCT ACTGATTCAA ATTACGTTCA 660 

ACAATATCAA TTACCAGGTC ATTTTGCGCC AATTGCTTCT TATCAATTAT TAGAAAAAGC 720 

AGTTGAAACA GCACGTGACA AAGGTGTACG TCATCATGTA GGTAATGTGT TATCAAGTGA 780 

TATTTTCTAT AACGCGGATA CAACAGCGAG TGAACGTTGG ATGCGTATGG GTATTTTAGG 84 0 

TGTAGAAATG GAATCaGCTG CaTTATACAT GaATGCAaTT TACGCTGGTG TCGAAGOVTT 90 0 

AGGTGTGTTC ACAGTGAGCG ATCATTTAAT TCATGAAACG TCAACAAcAC CTGAGGAAAG 960 

GGAACGTGCA TTTaCAGATA TGATTGAAAT TGCACTGTCA TTGGTGTAGA TGATTATGAA 102 0 

TGTTGAATAT TCTAAAATAA AGAAAGCAGT ACCTATTTTA TTATTCTTAT TTGTATTCAG 1080 

TTTGGTTATA GACAACTCAT TTAAATTGAT TTCTGTAGCC ATTGCTGATG ACTTAAACAT 1140 

ATCTGTAACG ACAGTAAGTT GGCAAGCGAC ATTAGCCGGT TTAGTAATTG GTATTGGCGC 1200 

TGTAGTATAC GCTTCATTAT CTGATGCCAT TAGTATACGC ACACTATTTA TTTATGGCGT 126 0 

GATATTAATC ATTATCGGAT CAATI'ATTGG TTACATTTTC CAACATCAAT TCCCATTACT 132 0 

TTTAGTTGGA CGTATTATTC AAACTGCCGG TTTAGCTGCT GCAGAGACAT TATATGTGAT 13 80 

ATATGTTGCA AAGTATCTTT CTAAAGAGGA CCAGAAGACT TACCTTGGCT TAAGTACGAG 144 0 

CAGTTATTCC TTGTCATTAG TTATCGGTAC ATTATCAGGT GGATTTATTT CTACGTATTT 1500 

ACACTGGACA AATATGTTTT TAATTGCATT AATCGTAGTA TTTACGTTGC CATTCCTATT 1560 

TAAATTATTA CCAAAAGAAA ATAATACGAA TAAAGCTCAT TTAGATTTTG TTGGCTTAAT 162 0 

TCTAGTGGCA ACTATTGCTA CAACAGTCAT GCTGTTTATT ACQAACTTTA ATTGGTTATA 16 80 

TATGATTGGT GCCTTAATTG CGATTATCGT TTTTGCGCTA TATATTAAAA ATGCGCAACG 174 0 

35 TCCATTAGTA AATAAATCAT TTTTCCAAAA TAAACGTTAT GCTTCATTTT TATTTATAGT 1800 

ATTTGTAATG TATGCTATCC AATTGGGTTA TATTTTTACG TTCCCATTCA TAATGGAGCA 186 0 

AATTTATCAT CTGcAACTAG ACACAACATC ACTGTTATTA GTACCGGGgT TaTATAGTAG 192 0 

CAGTCATTGT TGGtGgCACT AAGTGGgTtA AAATCGGgCG rAATATCTGG AATTCCAAAA 1980 

CCAAGCGGAT TATCACAGCC AATTAA 2 0 06 
(2} INFORMATION FOR SEQ ID NO: 379: 

-J 5 

(i) SEQUENCE CHARACTERISTICS: 

(Al LENGTH: 4 858 base pairs 
(B} TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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TGGGGAAAAA AAGACCCAGC AGTATTAGAA GAATCGTTAA ATATTTCTAT TGAAGAAATG 



SO 



AATCGTATCA TAAAATTAGT CGAAGAATTA CTTGAATTGA CTAAAGGAGA TGTAAATGAC 12 0 

ATTTCTTCTG AAGCACAGAC CGTGCATATT AATGATGAAA TTCGCTCGCG AATACACTCA 180 

TTAAAACAAT TGCATCCTGA TTATCAATTT GATACGGATC TGACATCTAA AAATCTAGAA 24 0 

ATTAAAATGA AACCTCATCA ATTCGAACAA TTATTTTTAA TCTTTATTGA TAATGCAATC 3 00 

AAATATGATG TGAAGAATAA GAAAATTAAA GTTAAGACAA GGTTAAAAAA TAAGCAAAAA 360 

ATAATTGAAA TTACAGATCA TGGAATTGGT ATTCCAGAGG AAGATCAAGA TTTCATTTTT 4 20 

^5 GATCGCTTTT ATCgAGTGGA TAAATCTCGT TCAAGAAGTC AAGGCGGTAA TGGACTCGGA 4 80 

TTATCTATTG CTCAAAAAAT CATTCAATTA AACGGAGGAT CGATTAAAAT TAAAAGTGAA 54 0 

ATTAACAAAG GAACAACGTT TAAAATCATA TTTTAATCAT GACTGAGACG TCAATCAAAG 60 0 

TCATAGGATC AATTTTTTAA GTACACATTA GCTGTGACTA ATGTATAAGA ACAACTATAA 66 0 

AACAAATAAA CAGTGGTTCT TTATCATTTC TGTTGTACTC CCAAAATTTA CAATAAAATA 72 0 

CATCTATAAA CCTAGAAGAA TCAACGCTTT TGTTGATTCT TCTTTTTAGC AGATAAATAG 780 

GTAAATCTAC TTTAACAAAT AACTAAATAG TGATATTATT ACATTGTAAG CGTTTCAACA 84 0 

TTTTTGTGGA GGGTGTAAAA TGACTAACGA AAGAAAAGAA GTTTCAGAGG CTCCTGTAAA 900 

CTTCGGTGCG AATTTAGGTC TAATGTTAGA TCTATATGAT GACTTTTTAC AAGATCCATC 96 0 

ATCTGTACCA GAAGATTTAC AAGTCTTATT CAGCACAATT AAGAATGATG ACTCAATTGT 1020 

ACCAGCTTTA AAAAGTACAA GTAGTCAAAA TAGCGACGGC ACAATTAAGC GTGTCATGCG 108 0 

TTTAATTGAT AATATTCGCC AATACGGGCA TCTTAAAGCC GATATTTATC CTGTAAATCC 114 0 

TCCAAAAAGG AAACATGTAC CTAAATTAGA GATTGAAGAC TTTGATTTAG ATCAACAGAC 12 0 0 

TTTGGAAGGT ATATCAGCAG GAATTGTTTC AGATCACTTT GCCGACATTT ATGATAATGC 126 0 

TTATGAAGCA ATTTTAAGAA TGGAAAAACG TTACAAAGGA CCAATTGCAT TTGAGTATAC 132 0 

ACATATTAAT AACAATACCG AACGTGGTTG GTTAAAAAGA AGAATTGAAA CGCCATATAA 13 8 0 

AGTAACGTTA AATAATAACG AAAAAAGGGC ACTATTCAAA CAATTAGCGT ATGTTGAAGG 144 0 

GTTTGAAAAA TATCTTCATA AAAACTTCGT TGGTGCAAAG CGTTTTTCAA TTGAAGGGGT 1500 

AGACGCACTT GTACCGATGT TACAACGTAC TATTACGATT GCTGCGAAAG AAGGTATTAA 156 0 

AAATATACAA ATAGGCATGG CTCACCGTGG ACGTTTAAAC GTTTTAACGC ATGTCTTAGA 162 0 

AAAACCGTAC GAAATGATGA TTTCAGAATT TATGCATACA GATCCAATGA AATTCTTACC 16 80 

TGAAGATGGT AGCTTGCAGT TAACTGCTGG ATGGACTGGT GATGTGAAAT ATCACCTTGG 174 0 
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AAGTCACTTG GAAATTGTTG CACCTOTTGT 
TACACAACGA GCTGGGGCTC CGACGACTGA 
^ TGGCGATGCT GCTTATCCTG GTCAAGGAAT 

GAAAGGCTAT TCTACGGGTG GTTCATTGCA 
TACAGAACCA ATTGATGCAC GTTCAACAAC 

10 

TGTGCCAATA TTCCATGTCA ATGCAGATGA 
TGCAATGGAA TTTAGAAAAG AGTTTCATAA 
TCGTTTCGGA CATAACGAAA TGGATGAACC 
TATTCGCAAA CATGACTCTG TTGAATATGT 
CATTTCAGAA GATGAAATGC ATTCATTTAT 

20 TCATGATAAA ATTAATAAAG CTGATAAAAT 

TCTTGCATTA CCGTTACAAG CAGACGAACA 
AAATGATGCA TTGTTAACAT ATCCGGATGG 

25 TCTTGAGAAG CGTCATGAGC CGTTTAATAA 

AGAACAACTT GCATTTGCGA CAATTTTACA 
AGATAGTGAA CGTGGTACAT TCAGTCATAG 

30 

TGAAACATAT ACACCTTTAC ATCATGTTCC 
TTCTCCGCTT TCAGAAGCAG CAGTAGTTGG 
AAAAAGCTTC AATATTTGGG AAGCACAATA 
TTTTGACAAC TTCTTATTCA GTTCTCGCTC 
ATTCTTACCT CATGCATATG AGGGTCAAGG 
40 ATTTTTACAA TTAGCTGCTG AAAATAATTG 

TTATTTCCAC TTATTGCGTG CACAAGCGGC 
GGTTGTTATG TCACCAAAAA GCTTACTGAG 

45 

ATTTACTTCT GGTGCATTTG AGCCAATTTT 
AAAAGTTATT TTGGCAACTG GTAAAATGTT 
TCCAGACGAA TCAGTATTAC TCGTTGCGAT 

BO 

GATTGAAGCA TTACTAGCAC AATTGCCAAA 



TGAGGGGCGT ACGAGAGCAG CACAAGATGA 1660 

TCATGATAAA GCAATGCCAA TTATTATACA 192 0 

TAACTTCGAA ACAATGAACT TAGGAAACTT 1980 

TATTATTACT AACAATAGAA TTGGATTTAC 204 0 

TTATTCTACA GATGTGGCCA AAGGTTATGA 2100 

CGTTGAAGCT ACTATTGAAG CAATTGATAT 2160 

AGACGTCGTT ATTGATTTAG TAGGTTATCG 22 20 

ATCAATTACT AATCCaGTTC CTTATCAGAA 22 8 0 

GTTTGGTAAA AAGCTTGTTA ATGAAGGTGT 234 0 

AGAACAAGTC CAAAAGGAAC TAAGACAAGC 24 00 

GGATAATCCA GATATGGAAA AGCCTGCAGA 24 60 

ATCATTTACT TTTGATCACT TGAAAGAAAT 2520 

CTTTAACATT TTGAAAAAGT TAAACAAAGT 258 0 

AGAAGATGGT TTAGTTGATT GGGCACAAGC 2 64 0 

AGATGGTACA CCGATTCGCT TAACTGGTCA 27 00 

GCATGCCGTG TTACATGATG AGCAAACAGG 2760 

TGATCAAAAA GCGACATTTG ATATACACAA 2 82 0 

TTTTGAATAC GGCTATAATG TGGAAAACAA 28 80 

TGGTGATTTT GCAAATATGT CACAAATGAT 2 940 

AAAATGGGGA GAACGTTCAG GATTAACATT 3 000 

GCCTGAACAT TCATCAGCAA GATTAGAGCG 3 060 

CACAGTTGTC AACTTATCTA GTTCAAGTAA 3120 

TAGTTTAGAT TCTGAACAAA TGCGACCATT 3180 

AAATAAAACA GTTGCAAAAC CAATTGATGA 3 240 

GACAGAATCA TATCAAGCGG ATAAGGTTAC 3 3 00 

CATTGATTTA AAAGAAGCAT TAGCTAAAAA 3 36 0 

TGAAAGATTG TATCCATTCC CAGAGGAAGA 3 4 20 

CCTTGAAGAA GTGTCATGGG TACAAGAAGA 3 4 80 

' * ' ' ■ ■-^f. -^^^/^^r.nr. 1 c 4 n 
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AGAAATTCAT AAACTTGTTC AAAATAAAAT TATAGAAAAT GCATTAAAAA ATAACTAGGG 3 66 0 

GGAAATAAGT CATGCCAGAG GTTAAAGTTC CAGAATTAGC AGAATCTATT ACAGAAGGTA 3 7 20 

5 CCATTGCAGA ATGGTTGAAA AACGTAGGGG ATAGCGTAGA AAAAGGTGAA GCTATTCTTG 3 7 80 

AATTAGAAAC TGATAAAGTT AATGTCGAAG TTGTATCTGA AGAAGCAGGT GTATTATCTG 3 84 0 

AACAACTTGC AAGTGAAGGC GACACTGTAG AAGTTGGACA rGCAATTGCT ATCATCGGCG 3 900 

10 

AAGGTAGTGG CAATGCTTCT AAAGAAAATA GTAACGACAA TACTCCACAA CAAAATGAAG 3 9 60 

AAACAAATAA TAAAAAAGAA GAAACAACAA ATAATTCGGT AGATAAAGCT GAAGTAAATC 4 020 

AAGCAAATGA TGACAATCAG CAACGTATTA ATGctACGCC TTCTGCGCGT CGATATGCTC 4 080 

GTGAAAATGG TGTGAATCTT GCTGAAGTAA GTCCGAAAAC AAATGATGTG GTTCGTAAAG 414 0 

AAGATATTGA TAAGAAACAA CAGGCACCGG CATCAACACA AACAACACAA CAAGCATCTG 4 2 00 

20 CAAAAGAAGA GAAAAAATAC AATCAATATC CTACAAAACC AGTGATTCGT GAAAAAATGT 4260 

CACGTAGAAA GAAAACAGCT GCCAAAAAAT TATTAGAGGT ATCTAATAAT ACAGCTATGT 432 0 

TAACAACATT TAACGAAtTG AcATGACAAA TGTTATGGAA TTGCGTAAAC GTAAGAAAGA 43 80 

25 - 

ACAATTTATG AAAGATCATG ATGGTACTAA ATTAGGATTT ATGTCATTCT TTACTAAAGC 444 0 

TTCTGTAGCA GCTTTGAAAA AGTATCCAGA AGTTAATGCA GAAATCGACG GCGACGACAT 4 500 

GATTACGAAA CAATATTATG ATATTGGTGT AGCTGTTTCT ACAGATGATG GATTATTAGT 4 56 0 

30 

ACCATTTGTA AGAGATTGTG ATAAAAAGAA TTTTGCAGAA ATCGAAGCAG AAATTGcTAA 4 62 0 

TTTAGCAGTT AAAGCaCGAG AGAAAAAACT TGGCTTAGAT GATATGGTTA ATGGTTCATT 4 680 

TACGATTACA AATGGCGGTA TTTTTGGATC AATGATGAGT ACGCCAATTA TCAATGGTAA 4 74 0 

TCAAGCTGCA ATCTTAGGCA TGCATTCAAT TATTACAAGA CCAATTGCGA TTGATCAAGA 4 800 

TACAATCGAA AATCGTCCAA TGATGTATAT TGCATTAAGC TATGATCATA GAATTATT 4 858 
40 (2) INFORMATION FOR SEQ ID NO: 380: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2222 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 380: 

50 

ATCAGTCACA CGGTAGGCAT ATAAAATGAG TCGTTTCTAC AACATTTTTA AACAGTTCAT 6 0 

TCAATATTAT TTTTATCTAA TAATnATATT GGGAGGATTA TACCTTTATA CACACCATGC 12 0 
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ATGCTATTTA 
GCGATACTTA 
TATCATCGGT 
ACTAAAGGTG 
GTGAGTTGGA 
CTTGTTACXX3 
AGACCAACTG 
GAAGGTAACA 
CTGTACATTT 
ACATTGTGGT 
ATAATACTGT 
GGTTATGTAC 
ACATTAACGC 
TTACTTGCTG 
TTAATTGTTT 
ATGCTTTCGA 
TTATATACAA 
GGTTTATCAG 
GGTGTAGCAC 
GGTTTAGTTG 
TAATTAACAG 
TAGTAATATC 
AAATTCAGTA 
GAAAGGAGTG 
GAGGCGTTGA 
TACTTAAAAA 
TTAACAGAGA 
AAAATAAACA 



GCTAAAGCTA 
GTTGCAATTA 
ATAATTATTG 
AGCAAATAAA 
ATTTATTCGG 
cGTTTCTTGT 
GCAAACAAAA 
TGGCTTGGAA 
TTATAGCTAA 
GGAAATCACC 
TAACTCACTT 
AGCCGTTTTG 
TTGGTCTGCG 
GCTTATTCTT 
GGCAAGCATT 
TGGTTTATAT 
TCACAGGAGG 
CATTAGGAGC 
GTCAACCAGA 
AGGCATTACC 
ATAAAAGAGG 
TTTGAACTTT 
GGTCCTAACC 
TCATGAACCC 
GTGGGGTACT 
GTTCGCATGG 
TATCGATGAC 
AAAACTTAAA 



AAAGACCAGA 
TTGCCTGTAT 
GTTTAATGAT 
TTAAGAAAGA 
TTTTGaTATC 
TTTTCTACTT 
TTTCGTTGAA 
AAAAGGTGGT 
TATGTTAGGT 
GACAGCnGAT 
TTATGGAATT 
GCCATTGGCA 
TTTGTACGGT 
TAACGAACCA 
TTCAATATTT 
GTCACATAAA 
AAATTAAATT 
AGGTATCGGT 
AGCACGTGGT 
TATCATCGGT 
TCGGGACAAA 
ATTAAATAGT 
TCGTCGTTTT 
GTGACTGAAA 
GTGATTGTAC 
GGTCCATTGA 
GCAGAACAAG 
GAAACACAAG 



CACTATGCAT 
GATTTGGTAC 
TTCATATGTT 
GGTGAGATTA 
GTTTTCAATT 
GcTATCATTT 
TGGATTTTTG 
CAATTCCACT 
CTTCCGTTTT 
GCAACAGTGA 
AAAATGCGTG 
ATTATTAATG 
AACATATTTG 
GCATGGGGTT 
GTAGGAACAA 
GTGGCAGATG 
ATGAATTTAA 
AACGGTTTAA 
CAATTAATGG 
GTAGTAATTG 
GCGCATAGGA 
TGAGATATGA 
TTTCTATATA 
CAGCTAACTT 
AGGTCCTAAC 
AAGATGTAAT 
CTAAGTTAAA 
AAGAAGTTCA 



ATTTCAACTG 
CTTAATAAAG 
GTAGTTATCA 
TGGATCACAA 
TATCAAGTAT 
GTACGCGTAA 
ATTTCGTGAG 
TCTTAGCAGT 
CTATAGTAAC 
CTTTAACGTT 
GTACGAAACA 
TTTTTGAAGA 
CAGGTGAGAT 
GGATTATTAG 
TCCAAGCATA 
AACACTAAAA 
TCGCAGCAGC 
TCGTTTCAAG 
GTATCATGTT 
CATTCATGAC 
CATAATTCAT 
ACGCACCATG 
ACACTAGCGA 
ATTCGTTCTT 
TTTCATCGTG 
GGATAAACGT 
TGCACAGAAA 
AAAGATTTTA 
AGCAAACGTA 



GAAATATGTG 
CGCATGTAAG 
TACGTCCTTT 
ATCCCCGCTC 
ATTGATGATA 
TTTGAAAAAA 
GGGAATCATT 
AACGCTGaTT 
GAAAGATCAC 
GTCTACAACG 
ATATCTTAAA 
GTTCACTTCA 
ACTATTAACA 
TATCCCAGGA 
TATCTTTATT 
ATTTCAATAA 
AATCGCAATT 
AACAGTTGAA 
CATTGGTGTA 
ATTTGCTGGA 
GATGCATATA 
CCTATCGCAT 
TTATTTTAAT 
GGTGCAGCTG 
TTAcTTGCGT 
GAAAGAGATA 
CTTGAAGAAG 
GAAGATGCTA 
CGTGCAAACG 



240 

300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
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TTAATAATCa AGTATCTGAA CTATCAGTGT TAATTGCTTC TAAAGTTCTT AGAAAAGAAA 204 0 

TTTCTGAACA AGACCAAAAA GCATTGGTTG ACAAGTATCT AAAAGAGGCA GGCGATAAAT 2100 

AATGGTAAAA GTAGCTAACA AGTATgcTAA AGCATTATTT GACGTGTCAT TAGATACAAA 216 0 

TAATTTAGAG ACTATTAATG AAGAATTAAC AGTTATAAAT GAAGCAGTAA AAGATAAAAT 222 0 

"TG 2222 



(2) INFORMATION FOR SEQ ID NO: 3 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 3 base pairs 

(B) TYPE: nucleic acid 

^5 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381: 

TAACAATTCC TaTATTCATG TTTAATACGA AACACTACAT TTACATTGTA ATTCACTATC 6 0 

TTTTGAAGTA ATAAAGTGAT TTGTTCAATC GATAGCTCAT TGCTTGTGTC GATTGTAACA 12 0 

25 ATTATATGCA AGTTTTCAGG ATTAACACCT AATCTTTGAA TGATTTGTTT AATAGTATAA 18 0 

TAATCCATCC PiPiTPAAIKATT ACTTCCTTTA ATATAAATGT TTTTAGGTTG ATACATTTGA 24 0 

CTCCTTTTTG TAGGCTCAAA AGGTATATCA ATCTCGCGCA TACTTGAAGA ACTTTGATTA 300 

30 

GTATCATCAA ATAATTCAAT TATATTTTTA TCAATTTCAA CTAATTGGGA ATGGTTAATT 36 0 

GAATGAGATG TTGGACTATA TCTTTTTCTA ATTAATTTAG GAGTGTTACC ATACGTTTCT 4 20 

TTAAAGAGGT GTATAAAACG AGAATAATGA TTAAAACCAT GACTACTTGC GATTTCTTCA 4 80 

35 

ATAGGCTTCT TAGrAGTTAA AATATCAATT AAGCAATGCT CCAGTCTAAT ATGATTTAAA 54 0 

TATTGAATAA AATTACTATA AGGTGTCGCT TTAAACATGT CACTTAAAGC TTTGTTTGTA 600 

ATACTAACTT GATTAATGAC ATCTTTCCTA TTTATCTTTT TATGGTGGTT GTTTGTTAAA 66 0 

40 

TAATCGTGCA CTTCTTCGGC TACTAAATGA CGACTACCAT CGTATGTATT TAATGACATT 72 0 

AATTCAACAC ACATGTAATT AATAATCTTA TCATTAGCAT TATAAGACTG TTGTTTAATT 780 

45 TTGCTGTAGA TTAAATACTT AATCAAGATT CTAAAAGTGC TAGCAACCTC ATTTGTTAAT 84 0 

ATCTTGCCAC GCATCATATA GTTTGCATTC ATAAACTTCA AAAATGTCTT TGCACTTACT 9 00 

TCAACAACAC AACATACACT ATCATCATTA CCATCAATTT GATACAAATC ATTCAACATA 96 0 

50 

ATAATGGTGA CATCATTTCT TTTTACATCA TATTGTTGTA GATTGATTGA AACTACACCT 102 0 

GTACCTTGTA GCCAATATAT AATTTTtAAG TTCGCTTCCT TCACACTACT CATTTTCATA 10 8 0 
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1143 



10 



ATT 

(2) INFORMATION FOR SEQ ID NO: 3 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 506 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382: 
GGACCACCrr nTTACCTGT ACGACTAAAG CCAACTGriT TAAAGGCACC AACAGCATGC 
" CACACITGAA TAATTTCnTC TGATGGTCTA AAGCGAACCG TATAAATTAA TGGATGGAAA 

TCATCAACAA AAATATAATC GGCCTTACCA AGTAAATATG GCAATCTAAA CTTGTCTCTC 
CATTTGCGTC TATCCGTAAT ATTCTCCrTA AATACCGTIT TAATATCATA ATCAAAATCT 

20 

ACTTTTTGGC GTAGTAACTC ATCATATACA TACTTGAAAT luuv-i^nx^^ «..v.^^ 

GAATCTGATC TGAATAATAT TGTTTTGCTT CTTTTAATAT GTAGTAACTT TGTAATATTA 
.5 AAAATAGCrr TAAATAAGAA ACTTCTACTT TCAAATGAAG CTTTATGaCC TTOnTaTGA 

AGCCAGTGTG cACTTgTCGC AATGaCCCCT GaTTTCyCyT GagGtAAGGk GaTTTCmATA 

TCAAATACAA ATTCGTTAAC GTCACT 
^° (2) INFORMATION FOR SEQ ID NO: 383: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 



40 


AGCATCTGCA AtGAGTcTCT 


AATACAACGA 


TACGTTTTGC ATCTTTAGGT 


ACTTTTACTG 


60 




TACCATTTTC ATCTTTTACC 


GAAATAGTAT 


CTTTAGTTGA 


TGATTCTTTT 


TTACTTGAAT 


120 


45 


TATCCGTATT ACCACAAGCT 


GCAACTAAAA 


GTAAGGCAAC 


TATTAATCCC 


AATATACTAA 


180 




AAGTTTTTAG ACCTCTCATC 


GTTCCACTCC 


TTAATATGTA 


TAACTTCATT 


TATTATTTTA 






TTGATAACAA TTATCATTGT 


CAAGTAGCGT 


TCAATCTTTT 


TTATATTTCT 


AAAATGTATG 


300 


50 


ACTATATATT TCCTCTAATA ATTATGACTA 


CAATTAGCAC 


ATTTCCTTAG 


ACAAAATACT 


360 






TCATCTTTGC 


ATTAATACAA 


TTGACACCAT 


TTAGCATGAC 


420 
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(2) INFORMATION FOR SEQ ID NO: 3 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 862 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



15 



20 



^0 , . , 

(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 384: 

TGCACTTGAT ACTTTAGCAC ATGAAATTGC TAATCGATTA GTTGCAAATG ACAAGAATGA 6 0 

AGCAACTTTG GAAATGACTA ATAAAATGGC AACGATTCGT TTTACAGAAC CTACGCTGAT 12 0 

TGCATTAGCA GGGGGTAATG TCAAAGCTTA CACTGAGCAT ATGACTATAT CTCCATATAA 18 0 

ATTGTATTTG TTAGATAAAG GCGATGTTTT AAAGTTTAGA GAAACAAGTT ATACATCGCG 24 0 

AGTGTATTTA GCTGTgGGAG GCGGATTTGA ATTAGATGCA TGGTTAGGAT CTAACTCAAC 3 00 

CGACTTTAAT GTAAAAATTG GTGGTTTTAA AGGTAGAACA TTACAAGATG GCGATGAAAT 3 60 

AAAGCTTAAG AGAGATTATA CAGCTCGTCA TCATAAGTTA TTTGAAAACC TTGCTCACAC 42 0 

25 GAAACAAACA GATTGGGGTA TTGATGGATA CGCCTTGTCA TTTAATTATA TGTCTGATGT 4 80 

ATTTCATGTC GTTAAAAATA AAGGTACGGA AGATTTTAAA GAAGATGCCA TTCAAAGATT 54 0 

TGTGAAACAT GATTATAAAG TAACGAGCAA AGCAAATCGC ATGGGGATGA TGCTTGAAGG 600 

TGAAAAAATC AAAGCTTTTT ATGAAGATAT GCCACCGTAT CAGACTGTCA AAAAAGGAAC 660 

GATACAAATT AAGCGTGATG GCACACCTAT TATCCTATTA AATGATCATT ATACGCTAGG 72 0 

TAGCTACCCG CAAATCGGTA CAATCGCAAG TTATCATTTA ACGAAATTAG CACAAAAACC 780 

GCAAGGATCA CGTTTGAAAT TTCAATTTAT AGATATTTTA ACGGCTGAAA AGAACCTTGT 84 0 

TAAgTATAGT AACTGGTTAA AC 862 

(2) INFORMATION FOR SEQ ID NO: 3 35: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1027 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
45 (D) TOPOLOGY: linear 



30 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385: 
AAATGAGTTG ATGATGGCTC CTATTCCATC TATCTCTAGG TATGACAGAA SATAAACGGT 6 0 

CTTCTACCTG TCTsACATTA TCTTTCCAAC GATTAATACC TAAGCGTTTA GAAACACGTT 12 0 
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ATGTGTTTGT 
ATTCTTAGCT 
ACTGACTGCC 
ATTTACTAGA 
AT7ATCATGT 
TTCTAATGCT 
GTACCGTTTT 
AATCGTCAAT 
TAAAGCTAAG 
CGTGTTTATC 
TTGCAAATGT 
CnTCGTTTC 
GATTCATATA 
CACGTgTATC 



50 



CAGTTTTACG TCCTACACCT GCTAAACTTT CTAATTCCTT 
CATTAAATTG ATCAATCAAA GATTGACAAA GTTTCTTAAT 
ACAGACCGAT AGAACGAATA TCATTCATAA GTTCrTCATC 
CAGGCGTTTT GTATTTTTTA AACAGCTCAG TTGTTACTCT 
ATTGCGCTGA CAATAATACA GCAATAGTTA ATTCGAACGG 
ATTCTGCATC CGGAAACATA TTTGCTATAA CATCAATCAT 
TTACCATCAA GGTTCTCCCC GTTTAACCAA TCAAATTTAG 
GTCATTTTCG GTTTATTGAA mTTCTCTT ATTTTTCTAG 
TTGTTTTTCT TCCAATTAAG TAAAATACGA TCTATATATT 
CTATTCGCCT CGTCTAATGC CGCTTGTATA ATTGCAGTAT 
CATTGATTTA ACGTTTCTAT TTCATATGGA GATAACGGCC 
ACTCTAAATA ATTGTTTAAA TTTTTCTTTA CTATTTAGCT 
TGCTTCAATA TATGACTTAA TTTTTCGAAA AAAGGATCTA 
CTACCTTCyT CATCyTTTTG aACTkGtAAT tCTAGCAATT 
TACCATT 

(2) INFORMATION FOR SEQ ID NO: 3 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1006 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 386: 
AAGGnriGGA GGGAATTAAT nGATGGCAAA TCCCAGaTAA AGTACACaAC 
TGATGCATTT AATATTTCCA GTAGTTAGTA CGACGTTTGC AAGCTTTAAA 
GGGGCATACC AAAAGATTTC ATAGACTACT TATTTATTGA TGAAGCAGGA 
CTCAAGCAGC TGTGGGAGCA TTATATCGTT CAAAAAAAGT TGTAGCTGTA 
TTCAAATAGA ACCGGTTGTG ACTTTAGAAA GTCATTTAAT TGATAACATT 
ATCATGTTCC GGAATATCTA GTTTCTAAAG AAGCTTCTGT GCAGTCTGTT 
CCAATCAATA TGGTTTTTGG AAATCTGATG CTACTGATAG TAATCAAAAA 

^rxKCnATOTT TAAAACCTAT GTTCACGATa 



GGTATTTCTC 
TTGTTACGAT 
AAATAATCTT 
ACGTCTGTAC 
TTTAATTCAC 
TTTTTCTTAC 
AACTGTGTGC 
TGTTTTGACA 
TTTATTCAAA 
AACATCAATC 
ATCCTCTAAA 
CATACTTTGT 
TTCGGkAAAT 
AAATTTTGGA 



GCATGGAATG 
TCTATGTATG 
CAAGCAATAC 
GGTGATCCGA 
CGTAAAAATT 
GCAGACAACG 
ACCTGGATAG 
GCTAaCCAAa 



240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

f\ ^ r\ 

960 
1020 
1027 



60 
120 
180 
240 
300 
360 
420 
480 
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GGTATGACGT TAAAGGAAaC GCAGTTCAAA AACAATTTGT GAAAGAGCAT GGTGAAAAaG 6 00 

TAGTGGGATT ATTAGCTGAT GATTGGATTG AAGCAATTAA GGAAGGTAAA AATGAACCGA 6 60 

5 

GCTCATTTGT AATATCGCCT TTTTCAGCAG TACAGCAACA GATTAAACGT ATGTTAAAGC 72 0 

AACAACTACC GACTAGAATT GATATTGAAC GTACAAAAAT TAATCAATGG GTCGATAAAT 7 80 

CCATTGGTAC TGTTCATACT TTTCAAGGTA AAGAGGCTCA GAAGGTGTAT TTTGT7UVTAG 84 0 

10 

GTACTGATAA TACCCAAGAT GGTGCTGTGA ACTGGTCATG CGAAAAACCA AACTTGTTAA 900 

ACGTTGCAGT GACAAGAGCT AAGAAAGAGT TTaTGTAATT GGCGACATGC AAAGAATACA 960 

GATGAaACCA TTTTATGAGA CGATTTTTAn AGnAAGAAAT GTAAAA 1006 

(2) INFORMATION FOR SEQ ID NO: 3 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 662 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



25 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387: 

CGTTTCATCA AGTnATCGAT CCTAATTTAC AAAATATTCC GGTTAGACTT GAAGAAGGGC 6 0 

GTAAAATAAG AAAAGCCTTT AAACCAACTT CAAAAGATAG CGTTATATTA TCAGCAGATT 12 0 

ATTCTCAAAT TGAATTGCGT GTATTAGCAC ACATTACACA AGATGAGAGT ATGAAAGAAG 180 

CATTTATCAA CGGCGATGAT ATTCATACAG CAACTGCTAT GAAAGTATTT GGTGTAGAAG 24 0 

CTGATCAAGT CGATAGTTTA ATGCGTCGTC AAGCAAAAGC AGTTAACTTT GGAATTGTTT 3 00 

ATGGGATAAG TGATTATGGT TTAAGTCAAA GTrTAGGTAT TACTCGTAAA AAAGCAAAAG 360 

CATTCATTGA TGATTATTTA GCTAGTTTCC CAGGTGTAAA ACAATATATG TCTGATATTG 42 0 

TAAAAGATGC CAAAGCTTTA GGTTACGTGG AAACATTGCT ACATCGTCGA CGCTATATTC 48 0 

CTGATATTAC GAGTCGTAAC TTTAATTTAC GCGGCTTTGC TGAACGTACT GCTATGAATA 54 0 

CGCCAATACA GGGCAGTGCT GCAGATATCA TTAAACTGGC AATGGTTAAA TTTGCTCAAA 600 

AAATGAAAGA GACAACATAT CAAGCTAAAC TATTATTACA AGTACACGAT GAATTAATTT 660 

TT 662 
(2) INFORMATION FOR SEQ ID NO: 3 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 669 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388: 
TCATCCAAAT rrrOGAAATT CCACArTTTA CATATCGTAA TrTTTTAGGA AACTAGTGAA 
TATAACAAAT CCCTCCTCTC ATTTrTAAAA TAGATATATC ACTTCCCCAC TTTTACTTAA 
CTAAACTGCA ACGGTTCCTA ATACCAAAAT CCTGCCCTCT ArmTATCA ATTCAAGCAT 
ACTTAITGAA AAATGTTAAC GTTTTCTTGA TAATCATTGT AAGCGCAm ArTTTATAAA 
CTAACGTTTG AAATATACTA CAGGAGTGAC ACGTAATGAC TCAAATTACT GAAAAAGAAT 
TAAAAAAGAA GTATTTAGAT TTACTATCCC AAAATTTTGA TACTCCAGAA AAACTTGCAA 
CTGAAATTAT CAArTTAGAA TCAAmTAG AATTACCTAA AGGTACGGAA CATTTCGTCA 
GCGAm-ACA TGGIXJAATAT GAAGCTTTCC AACACGTATT ACGCAACGGT TCTGGGAACG 
TGCGAGCGAA AATCAATGAT ATTTTCAAAG AGAGACTTTC AACTAAGGAG CTTAATGACT 

^..^-^ ..afTMiLJiaT TaATTAAAAG TGATTTCCAA 

TAACTGCTCT TCTCTACIai v,vj*<j««vjj. v.... 

;^rrgCGGtC mACtTAATGt CyGGtATATC ACaAOnATCG aACATnAAT TGAGTTAATT 
AAATATTGT 

(2) INFORMATION FOR SEQ ID NO: 389: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1249 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(jci) SEQUENCE DESCRIPTION: SEQ ID NO: 389: 
CACATGGCTG TTAGAGATAT GAATGGCCAT GCGTTACCTT TAACAAAAGA TGGCAATTTT 
TATCAAACGA ATGTAGATGC AAATGGTGTT AATCATGGTG GTAGTGAAAT GGTGCAAAAT 
;^CAGGTC ATATGAGTCA ACAArGCCAT ATGAATCAGA ACACACATGA ACCAACAGCC 
ACACATGCAA CAAGGTCATA TGCAATCATC AAACCATCAA ATGATGAGTC CAAAAGCAAA 
TATGCATTCA TCAAATCATC AAATGAACCA AAGTAACAAA AAAGriTTAC CAGCTGCTGG 
TGAAAGTATC ACATCAAGTA TTCTT-ACTGC AAGTATTGCC CCACTACTAT TAGTATCTGG 
GTTATTCTTA GCATTTAGAC GACGITCAAC AAATAAATAA ACATAATACG ATTAATAATA 
GAAAAATCGT GTGATTATCT GaGGGAGCCT AGGACATAAA TCAATGTCCT AGGCtCnCTA 
.tGTTATATT OGCAGTAGTT GACTGAATGA AATTGCGCTT GTAACAAGCT TTTCCATTTC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
669 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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CGATTGTCTT ATACGTGTCA GTGTTAATTC AGATATTTCC TGTGGAATAT ACCACTTATT 66 0 

AATCATAATT GGATAAGGTG tTTGTGCGTA CAGTGTTTCA ATAATCAGCC AACAATGTGT 72 0 

^ ATCACCaTCA AACACGTGAC TATGATTTTk GAAGTGGGGC GCTTTGGTAA TAGACATTTT 7 80 

TAAATCTGAT TGATATGCAT TGCTATAAAT CGTTTGCTCA ACGAATGTCT TCATGTCGTC B4 0 

TTCGTTTTGT GTATTCACTT TAAATGTGTC AATGACATTT AACGGTATAA AGGTAAAGCA 900 

w 

AAATGCATCA GCTTGCTTAG AATGATTGTC CTTTTTTTGA TAATAGCGTT CCATTGCAAT 960 

GACGGCAGAA GGATGGTTTG CAAACAAATG ATTTGTATAT TCACTTTCTA AATCAACACG 10 20 

ATAATTAATT GATGACATAG ATACGCGAGC TAGCAATATT TGATCAAGTG GATGCTTAAA 1080 

75 

TTGATCCATA CTTGAAGCGT GTTGGGCATT TGTTTGTGGA ATAACAAAGT GTCCCTTCCC 114 0 

TCTTGTACTC TCTACGATGC CATCTTCGGC TAACAATTTt ATAGCTTGGC GCAAAGTCAT 12 0 0 

2^ ACGACTGGAC ATCAAAGCGC GCACAAAGTT CCTTTTCAGT AGGTAATGC 124 9 

(2) INFORMATION FOR SEQ ID NO: 3 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1788 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : do\ible 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 390: 

AATGATGATT CATTCAAAGA AATTAACGCT CGGTATATGC TTGGTATTAC TCATTATATT 60 

GATTGTAGGT TATGTCATTA TGACAAAAAC AAATGGTCGA AACGCCCAAA TTAAAGACAC 120 

35 

ATTTAATCAA ACATTAAAAT TATATCCAAC CAAAAATCTC GATGATTTTT ACGATAAAGA 180 

AGGCTTTCGA GATCAAGAAT TTAAAAAGGG TGATAAAGGT ACTTGGATAG TTAATTCTGA 24 0 

AATGGTAATC GAGCCAAAAG GTAAGGATAT GGAAACGAGA GGAATGGTGC TCTATATCAA 3 00 

40 

TCGCAATACT AGAACCACAA AAGGGTATTA TTTTATAAGT GAAATGACAG ATGACAGTAA 36 0 

CGGCAGACCA AAGGATGATG AAAAAAGGTA TCCGGtAAAA ATGGAACATA ATAAAATCAT 42 0 

45 ACCAACGAAG CCACTACCGA ATGACAAGTT AAAAAAAGAG ATTGAAAACT TTAAGTTCTT 4 30 

TGTACAATAT GGCAACTTTA AAGATATTAA TGATTATAAA GATGGTGATA TTTCATATAA 54 0 

TCCTAATGTA CCAAGTTATT CGGCAAAATA TCAATTGAAT AATGATGATT ATAATGTCCA 600 

ACAGTTaAGa AAAAGATATG ATATTCCAAC CAAACAAGCG CCGAAACTAT TATTGAAAGG 66 0 

CGATGGAGAT TTAAAAGGTT CATCCGTAGT TcTAGAAGTC TTGAATTTAC CTTTGTCGAA 72 0 
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AGGTATGAGT CAAACTGAAT ATCAAATAAA ACCTGGTAAT ATAACAAGTA ACTCTGAAGA 84 0 

AACAAGTTCG ATATCTAAAG TGAGCTGTGA AATATAGGTA GCCATTTCAA AAAATTTAAA 900 

GGTGAATTTG ATAATGTAGC TCAAGGAGAT TGGGTTAAAA AGGCGAAGAA TGAACTGGAT 960 

GATATTAGTA AGAAATTAAA AAATATTCAA AGAACGGAAG TTTAATAGCT TATATGATTC 1020 

TTGGAGCTAA GACAGCATGC GTTCATTCAT GCCATTATTA ATATAAGCAC CGCAACAAAA 1080 

AAGCTTCTAA TGTGATACAG GAACCTCATA TTCCGTATCA TGTTAGAAGC TTTTAATGTC 114 0 

TAAAGAACAT CTACATTTTA TCATATTTTC TGACTTATTA AACTTTTATA TAATTAAATA 12 00 

TTTCTTAATT TTCCAAAATA GTGATAAATT TGTGAAATAC ATCACAAATC CCTTTATTTA 12 60 

TTTGGAAATT CATGTAATAT TAGACTTGTA AGAAGTTAAT AAATAGAGAG AGACGAGAGA 1320 

GTTTATATAA ATACTATATA AACATTGGAG TGATGATTAT GAGAAAAGAG ATTGAAGCGC 138 0 

TTATTTTCTC AGACGTAAtA GCTATGATAT TTACGTGAAC ACTGGTGTAA ATCAAGGATT 144 0 

AATTGGTGAC ATCAAAGATG GTTACCTAAC TATTGATTCT ATGCCTTACA TTGATGCTGA 1500 

GCGTTTGTAT CACTTTGCTA TGGAACGTAA ATCGTTAGTC ACTAACTAGT TCTTATTGCC 156 0 

AATGATTACT ACCCCTAGTC GGCGGCAATT GAAGTGTGAT TGATGTAACT TGCCCTCGTT 162 0 

GGTGAGCAAT TGAGGGCAGA CCCCTTTAAT TAAGTAAACC CTAACTCCCC ACAAATCTGG 168 0 

AACGATACTA AAAGCCACGT CCTATATTGG ATGTGGCTTT AGTCAkACTT ATATTATTTT 174 0 

tAAAACGATT ACCTACAAGA TTTACATATA AAATTCTATC ATGnCTGC 1788 
(2) INFORMATION FOR SEQ ID NO: 391: 
{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 07 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 91: 

GCAAGTTTAG TTAAACAGAT ATTAAAACCA GATCAATATA CAGATGCAAA TTCTCTATTA 6 0 

GAGATTAGTT TGCAAACAGG TATGTTTATT GCAGGTGGAT TATCAGGAAT ATTATATAAA 12 0 

ATAAATGGAT TCACTCTAAT TATAGCGATG ACTATAATGA TGTTTCTAAT CAGCATTTTT 180 

ATGTTATTTA GATTGCATGT AGATAAACCA ACTCATTCAG AGGAAGAATC AACAAATAGC 24 0 

TTATTGCAAG AATATTTATT AGGATGGAAA TTTTTAAAAG ACAACATGAT GATATTTATT 3 00 

TTTGGAGTTA TTTCAATCAT ACCAATGGTG TTTACAATGA TCTTTAACAT ATCATTACCA 36 0 
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TATGGCATTG GAGGATTATG TGCAGGTTTA ATTTCAGCTA TTCTTTCGAA GAAAATCTCA 48 0 

ACTAAAGTAT TGATATTTTT GTTATATTTC ATATTAGTCA TAAATTCAGC ACTATTTATT 54 0 

TGGATAAACT CAGCATTTTA CTTATTCATA GGATCATTTA TACTAGGCTA CTCAATTTCA 600 

TCAATCAGAA TTTATATGAA TACAGCTATA ATGAACACTG TTTCAGATAA ATATGTCGOT 660 

CGCTCATTTA CGATATGGAC GTCAATTTCA TTGTTACTGC AAAGTTTAAT TGCTCCATnT 72 0 

TTAGGAAGAT GGATTAATGA AATTAATGAT AAATTCGGTT TCTATATTAT ACTCATTTTA 78 0 

TCCTTACTCA TATTTGTaCA CTGCTGCTTG TTAACAAAAC AGACAAAATA AAATATGCAC 84 0 

ATAAAGAAGA GTGACCGTCA CTCTTCTTTA ACAAGCGACC ATTTATCGaT GGGCTTAGTT 900 

CTCTCTGCAC CCACACTTCA CTACTTCACT TTTTCAAATC ATTTTTTATG GTCTTAAATA 960 

AATCAGTGAG ATTTGTTGCT TCGGTAAAGT CTAGAATTAA TATCATTTCT TTAGAACCTG 1020 

GATATGGCGA AACTAATGTA TTATCTTGCA ATTTCTGCTG GGCACTTTTA GTCGCCTTGA 1080 

CCAATAATCT ATTATCATAC AAACCACCTA TAACCACGCC ATCATAATAA ATAATATATT 114 0 

CTCCCATCAT CTTTCTTGTC TTAACCGCGT TTGAATTCAC ATGATTTAAA AATAAATCAT 1200 

25 GTACATCTTT CTTAGTCGCC ATTGTAATCG CTCCTTCAGT TTTATGTTTA ATCACATTCG 126 0 

TATTAATGAT TCATTTCGTG TTGCTCTTAA TTTTATCTAT AATTATATGC GTAGTTAAAA 132 0 

TCAAACCTAT GGAAAAGAAA ATAATGATAA CAATGTTAAG AAATATAGTT ATAAAATTAT 13 8 0 

AGTTTGGAAG GTATGCGAAT AGCAGAAAAA TAGGTATCGC AAAAAATAAA TCCCACCAAC 144 0 

CTAAACTTTT TAAAGAATGC TTTAAACCTT CCATAATATC ACCTTTATAA ATTTGTCTTT 1500 

GTTATAAGAT AACTAAAAAA TCGCTTTACT GTAAAAGTAG CCAAAGAAAA TTCTGAATCA 1560 

TATTCATAAG TAGTGTATCA TTAATAATGA ACAATTTAAT ACTATAATCC TTGATCTTTG 162 0 

TATTGATCAA CTTACCACAA CATTTATTTT AGACTACTCT TAGACTTCCC TTTCAAATGG 16 8 0 

TTGCATCTAT TGAAATTCCT TTTGTATAAG TTAGGCTTTT GTGGTAATAT CATCATGCAT 1740 

AAAAAATCGA GATACTAATT ATAAAGAGGG TATAAATATA TTATGAAAGA AAATTTTTGG 1800 

AGTGAATTAC CACGTCCATT TTTTATTTTG GCGCCAATGG AAGACGTTAC AGATATCGTC 186 0 

TTTCGACACG TTGTAAGTGA AGCAGCTAGA CCGGATGTGT TTTTCACTGA ATTTACAAAT 1920 

ACTGAAAGCT TTTGCCACCC TGAAGGCATA CATAGTGTGC GCGGACGCTT AACTTTTAGT 1980 

GAAGATGAAC AGCCGATGGT CGCTCATATA TGGGGAGATA AGCCAGAACA GTTCCGTGAA 2 04 0 

50 ACGAGTATTC AATTAGCTAA AATGGGCTTT AAAGGCATAG ACTTAAATAT GGGATGTCCT 2100 

GTAGCAAATG TTGCTAAAAA GGGTAAGGGT TCCGGCTTAA TCTTAAGACC TGACGTTGCT 2160 
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GGCTACTATG AAATCGATGA ATGGAAAGAT TGGTTGAAGC ACGTCTTCGA ACAAGACATT 
GCCAATTTAT CTATTCATCT TCGTACACGT AAAGAAATGA GTAAAGTAGA TGCACATTGG 
GAATTAATCG AAGCTATTAA AAATTTACGT GACGAAATTG CACCAAATAC ATTGTTAACA 



ATTAACG 

(2) INFORMATION FOR SEQ ID NO: 392: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2424 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 392: 
ATGATGOAT^ GCnCCGCCAA GAAGTTGAAC CAGTCTATTG GCTAGAATTG CTCAAAAAGC 

GAGAcaXGTG GTATTCaTAT GTTaGTAgCt ACGCAAAUAC ^lux^.^ ^v..™ - 

GGTriAATTA AAGCCAACAT ACCAACAAGA ATTGCATTTA TGGTATCATC aAGTGTAGAT 
TCGAGAACGA TATTAGACAG TGGTGGAGCA GAACGCTTGT TAGGATATGG CGATATGTTA 
TATCTTGGTA GCGGTATGAA TAAACCGATT AGAGTTCAAG GTACATTTGT TTCTGAI.3AC 
GWrrGATG ATCrrGTTGA riTTATCAAA CAACAAAGAG AACCGGACTA TCTATTTGAA 
GAAAAAGAAT TGTTCAAAAA AACACAAACA CAATCACAAG ATGaATTATT TGATGaTGTT 
TGTGCATTTA TGGTTAATGA AGGACATATT TCAACATCAT TAATCCAAAG ACATTTCCAA 
ATTGGCTATA ATAGAGCAGC AAGAAriATC GATCAATTAG AGCAACTCGG TTATGTTTCG 
AGTGCTAATG GrtCAAAACC AAGGGATGTT TATCTTACGG AAGCAGATTT AAATAAAGAA 
TAATTATGAG TAAGGAGrTT TATATAATGA CACACTATCA TTTIGTCGGA ArTAAAGGTT 
CTGGCATGAG TTCATTAGCA CAAATCATGC ATGATTTAGG ACATGAAGTT CAAGGATCGG 
ATATTGAGAA CTACGTArTT ACAGAAGrTG CTCTTAGAAA TAAGGGGATA AAAATATTAC 
CATTTGATGC TAATAACATA AAAGAAGATA IX^GTAGTtAT ACAAGGTAAT GCATTCGCGA 
GTAGCCaTGA AGAAAtAGTA CGTGCACATC AATTGaAATT AGATGTTGTA AGTTATAATG 
ATrrrnAGG ACAGATTATT GATCAATATA CTTCAGTAGC tgtaactggt gcacatggta 
AAACTTCTAC AACAGGTTTA TTATCACATG rTATGAATGG TGATAAAAAG ACrrCATTTT 
TAATTGGTGA TGGCACAGGT ATGGGATTGC CTGAAAGTGA rTATTTCGCT TTTGAGGCAT 
GTGAATATAG ACGTCACTTT TTAAGTTATA AACCTGATTA CGCAATTATG ACAAATATTG 
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TGGCACATAA TGTTAAAAAA GGTATTATTG CTTGGGGTGA TGATGAACAT CTACGTAAAA 12 6 0 

TTGAAGCAGA TGTTCCAATT TATTATTATG GATTTAAAGA TTCGGATGAC ATTTATGCTC 132 0 

^ AAAATATTCA AATTACGGAT AAAGGTACTG CTTTTGATGT GTATGTGGAT GGTGAGTTTT 13 8 0 

ATGATCACTT CCTGTCTCCA CAATATGGTG ACCATACAGT TTTAAATGCA TTAGCTGTAA 144 0 

TTGCGATTAG TTATTTAGAG AAGCTAGATG TTACAAATAT TAAAGAAGCA TTAGAAACGT 1500 

10 

TTGGTGGTGT TAAACGTCGT TTCAATGAAA CTACAATTGC AAATCAAGTT ATTGTAGATG 156 0 

ATTATGCACA CCATCCAAGA GAAATTAGTG CTACAATTGA AACAGCACGA AAGAAATATC 162 0 

CACATAAAGA AGTTGTTGCA GTATTTCAAC CACACACTTT CTCTAGAACA CAGGCATTTT 16 80 

15 

TAAATGAATT TGCAGAAAGT TTAAGTAAAG CAGATCGTGT ATTCTTATGT GAAATTTTTG 174 0 

GATCAATTAG AGAAAATACT GGCGCATTAA CGATACAAGA TTTAATTGAT AAAATTGAAG 18 00 

20 GTGCATCGTT AATTAATGAA GATTCTATTA ATGTATTAGA ACAATTTGAT AATGCTGTTA 1860 

TTTTATTTAT GGGTGCAGGT GATATTCAAA AATTACAAAA TGCATATTTA GATAAATTAG 1920 

GCATGAAAAA TGCGTTTTAA TATGTTTATA ATAGAGTAGT ATGGGTATTT ATTATTAATG 198 0 

25 ACATTATTAC ATGTTAATTA GGAGGCGTTT TTAATGGATT GGATTTTACC AATTGCTGGA 204 0 

ATTATCGCTG CGATTGCATT CTTAATTTTA TGTATCGGTA TCGTAGCTGT ATTAAATTCT 2100 

GTTAAGaAAA ACTTAGATTA TGTTGCAAAA ACACTTGACG GTGTAGAAGG TCAAGTTCAA 216 0 

GGTATTACTC GTGAAACAAC AGATTTACTT CATAAAGTAA ACCGTTTAAC TGAGGATATC 2220 

CAAGGTAAAG TAGATCGTTT AAACTCAGTT GTAGATGCTG TTAAAGGTAT CGGTGACTCA 22 8 0 

GTACAAACGT TAAACAGCTC TGTAGATCGT GTAACAAATT CAATTACACA TAATATTTCT 234 0 

35 

CAAAATGAAG ATAAAATCTC ACAAGTTGTT CAATGGTCAA ATGTTGCAAT GGAAATTGCA 24 00 

GACAAATGGC AAAATAGACA CTAC 2424 
(2) INFORMATION FOR SEQ ID NO: 3 93: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 
45 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 3 93: 

CnATAATATT CnTCAAnCCT TTAAATAGAG GAATAGACTG CCGACAGAGT CCGAGACTTT 6 0 

GTGGGTAGTT TTTTTAGTTT TGATAACGGA AGTTAGAGGC TCTCTGTCAA ATTGGGCAGA 12 0 
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AGTGTAAAGT TTTCAACATA ATACTATTAG TTCGGTCATG TATCGGACTG ATGGAAAAGC 240 

GTTTCACTTT TAATGACTCA TTAAGAACGG CCTGAAAATG TTTGGCGTAT TAAGTGCAAT 3 00 

GATAGTTTTG ACATTTAGTT TCTAATTGGT CATTACTGCC GAGCAAATCT AGTAGAGTAA 3 60 

TCATGTAAAT CTTTAATGTG CCATTTGATT CACTAGCGGT GTTAATAACT ACGGAAATTG 4 20 

CATTTCCGAC TGAAATTTTT GAAAAATATC AACGTACGCT ACAAATAAAA TTTTAAACTG 4 30 

TTATAAATGT GTCTCAATTT CATATGTTCA TCGACGATAT GAAGCGTATT ATGGTAAAAT 54 0 

GAAGAAATAA TAAACTTGTT AATAAATAAA ACATCACGAT TTGACTAAAG CACTTTATTA 600 

TTGTGTAGAT AATAGTTTTT TAACGAAATA AAAATGGCGA CTGGTTTTAA TAAATCAGCT 66 0 

AATGAATCAC TACACCTATA AGTATGAATA TAGTGATTAG AATGCTTTGT ATAGTTGGAT 72 0 

TTTGCAAAAT TGATGTTA 73 8 
20 (2) INFORMATION FOR SEQ ID NO: 3 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1270 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 394: 

AAAAGTTGTA ATTAAAAGTG GGATTTTACT TAAGnnAGAA GGAAACTATT TATATGACTA 60 

ATAAAAGAGn AGATGTCCGC AATATAGCAA TTATTGCTCA CGTTGACCAT GGTAAAACAA 120 

CTTTAGTAGA TGAGTTGTTA AAACAATCTG GTATATTCAG AGAAAATGAA CATGTCGATG 180 

AACGTGCAAT GGACTCTAAC GATATCGAAA GAGAGCGTGG AATTACGATT CTAGCCAAAA 24 0 

ATACGGCTGT TGATTATAAA GGTACACGTA TTAATATTTT GGATACACCA GGACATGCAG 3 00 

ACTTTGGTGG AGAAGTAGAA CGTATTATGA AAATGGTTGA TGGGGTTGTC TTAGTAGTAG 3 60 

ATGCGTATGA AGGTACAATG CCTCAAACAC GTTTTGTACT TAAAAAAGCG CTAGAACAAA 4 20 

ACCTGAAACC TGTTGTTGTT GTTAATAAAA TTGATAAACC ATCAGCACGT CCAGAGGGTG 4 60 

TTGTAGATGA AGTTTTAGAT TTATTTATTG AATTAGAAGC AAACGnTGAA CAATTAGAAT 54 0 

TCCCTGTTGT TTATGCTTCA GCAGTAAATG GTACAGCTAG CTTAGATCCT GAAAAGCAAG 6 00 

ATGATAATTT ACAATCATTA TATGAAACAA TTATTGATTc ATGTACCAGC TCCAATTGAT 6 60 

AACAGTGATG AGCCCATTAC AATTTCCAAG TAGCATTGTT GGACTACAAT GATTATGTTG 72 0 

^^A^^TATTGG 7ATTGGTCGT GTATTCAGAG GTAAAATGCG TGTCGGAGAT AATGTATCAC 78 0 
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GATTAAAACG TTTAGAAATT GAAGAAGCAC AAGCTGGAGA TTTAATTGCT GTTTCAGGTA 9 00 

TGGAAGACAT TAATGTTGGT GAAaCTGTAA CACCACATGA CCATCAAGAA GCATTGCCAG 960 

5 TTCTACGTAT TGATGAGCCT ACTCTTGAAA TGACATTTAA AGTTAACAAT TCTCCATTTG 102 0 

CTGGCCGTGA AGGTGACTTT GTAACAGCAC GTCAAATTCA AGAACGTTTA AATCAACAAT 1080 

TAGAAACAGA TGTATCTTTG AAAGTTTCTA ACACAGATTC TCCAGATACA TGGGTAGTTG 114 0 

10 

CTGGTCGCGG TGAATTGCAT TTATCAATCC TTATTGAAAA TATGCGTCGT GAAGGTTATG 12 00 

AATTACAAGT TTCAAAACCA CAAGTAATTA TTAAAGAAAT AGATGGTGTA ATGTGTGAAC 12 60 

CATTTGAACG 12 70 

15 

(2) INFORMATION FOR SEQ ID NO: 3 9B: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1365 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
CD) TOPOLOGY: linear 

25 ' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 95: 
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AAGACCAGGA 


GAACAmGTAA 


AACAATATAC 


AGTTGaAATC 


GCTCGTAAAT 


TAATGGAATT 


60 


TGATATAAAA 


TGCTCGTGAT 


TGCTTGTAAA 


TACGCAACTG 


CTGTnGCTTT 


AGAATATTTA 


120 


CAAAAGACCT 


TATCAATCCC 


CAGTGATTGG 


CGTAATTGAc 


CAGGTGCTAG 


AaCAGCAATA 


180 


ATGACTACTA 


GAAATCAAAA 


TGTATTAGTA 


CTAGGAACGG 


AAGGCACAAT 


TAAATCTGAA 


240 


GCATATCGTA 


CGCATATTAA 


ACGTATCAAT 


CCACATGTAG 


AGGTACATGg 


CGTTGCCTGT 


300 


CCAGGTTTTG 


TGCCACTTGT 


AGAACAAATG 


AGATATAGTG 


ATCCAACAAT 


TACAAGCATT 


360 


GTCATTCATC 


AAACACTGAA 


ACGTTGGCGT 


AATAGTGAGT 


CTGATACTGT 


CATTTTAGGA 


420 


TGTACCCACT 


ATCCATTGCT 


CTATAAACCT 


ATCTATGATT 


ATTTTGGTGG 


TAAAAAGACA 


480 


GTGATTTCGT 


CTGGATTAGA 


AACGGCTCGT 


GAAGTTAGTG 


CATTGCTAAC 


ATTTAGTAAT 


540 


GAACATGCAA 


GTTATACTGA 


ACATCCAGAT 


CATCGATTTT 


TTGCAACAGG 


TGATCCTACT 


600 


CACATTACTA 


ACATTATCAA 


AGAGTGGTTA 


AATTTATCTG 


TCAATGTGGA 


ACGTATATCA 


660 


GTGAATGACT 


AGGAGGATTT 


TTAATGAAAG 


AGATTGTTAT 


TGCATCGAAT 


AATCAAGGGA 


720 


AAATAAATGA 


CTTTAAAGTA 


ATATTTCCAG 


ATTACCACGT 


AATAGGTATT 


TCAGAACTAA 


780 


TACCAGATTT 


TGATGTGGAA 


GAAACAGGAT 


CAACATTTGA 


AGAAAATGCT 


ATATTAAAAT 


840 


CAGAAGCTGC 


TGCAAAAGCA 


TTGAATAAAA 


CGGTCATAGC 


TGATGACAGT 


GGACTAGAAG 


900 



55 



EP0 786 519 A2 



w 



15 



20 



25 



35 



40 



SO 



GCGATGAAGC AAATATTGAA AAATTATTAA ATAAGCTTGG TAATACAACT GATCGTCGTG 102 0 

CGCAATTtGT TTGTGTCATA AGTATGAGTG GCCCTGATAT GGAAACAAAA GTATTTAAAG 108 0 

GTACTGTTTC AGGTGAAATT GCAGATGGAA AATATGGCGA AAATGGTTTC GGATATGATC 114 0 

CGATATTTTA TGTACCGAAA TTAGATAAAA CCATGGCTCA ACTTTCAAAA GAACAAAAAG 12 00 

GGCAAATTAG TCATAGACGA AATGCGATTA ATTTACTTCA AGCTTTTCTT GAAGGTGATA 126 0 

AAAATGTCTA AATGGATTAT TGTGAGTGAT AACCATACTG AATCAGGCGT TTTATATCAA 132 0 

ATTTATGAAA TGCACCCAGA TGCAGATGTA TATTTACATT TAGGA 13 65 
(2) INFORMATION FOR SEQ ID NO: 396: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 83 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 396: 

AATTCCTGGT GCAATAATAA ATAGGATGAA AAAGATTCGG AAAATATGAT AACTCGTAAT 60 

CATAGCAACA TCGGCACCAG TAGCTAATGC AACTAAAACT ATCTGATTAA CCCCTCCTGG 120 

TGCTGCACCA AGaAACAATT CATTAATAGG ATTATTATCA AAGAAATGTA TGATATAAAC 180 

30 CATGATTAGC GCACCAATTA TCT^CATAAT ATTTTGAATT GTAATTGCGA TTGCTAGTCT 24 0 

ACC'TTTTAAA TCTGACAATA AATGCGCAAT TTGAACTCCA ATTCTAATCA TATATATTAG 300 

TTGTGCCATG TTCAACAACC AATGATCTAG TGTAAATGTT AAACCTGTAG AAAAATTCCA 360 

AACAATTAAT ACAATGAGTG GTGCTAATAA TTGAAATGTT GGAAACTTTA TTTTAGACAT 420 

AATTAGATAA ACTATAAAGA TAGCTATCGC TAAAATAACT ATTTGCCCTA TGTTTAATAC 480 

TTGTGATAAA GGCAAGACTT TTGTTAACTT TCCATTCGCA TGCATGTTAC CATCATGAAA 54 0 

AAAATATGAA ATGAACGGTA CTAAAACAAC AACAAATATA ATTCGTGATG TTTGCGTTAA 600 

GCTAACAACT AACAAATTAG CACGTTTGTC TTGTTCAGCC ATGACCAGCA TTTGTGTTAG 660 

TGCTCCTGGT ATAft.CACTTA AAATAGCTGT TTCTGTATTA ATACGTGCAA TTTTTTTAAA 72 0 

aACAAATG CC ATTACTATTG CAATTAATAA TATCGAAATA GATACAACAA TAATCGAAAG 76 0 

CCAATTGTTT TTAATATCCA TAACGACATT TTTCGTAAAC GTTGATCCGA TTTGCACACC 84 0 

TAATAGTACA ATACCTAATT CACTAAGTAA GAATGGCCAT TTAATATCAA GTTTGAAAAC 9 00 

T^ITTACACAA ATGATTGATG CGATAATAGG ACCAAACATA AATGGAAGTA ATACGTGCGA 96 0 
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TATCATTGCC ATGTTTTCCA CTTCTTTCAA TAAAAAATAA AATGACTAAA TTGCTGCTTG 108 0 

AGCTTCACGT TTGTTAAGAT AACAATATCC GCTAGCAGTT tTGACTACAA AGCATATATG 114 0 

GaCTTTCACT ATCAAGTCGC CGCCCATGCC TTATATACAT TTAAAAnGAG CCTGAACAAA 1200 

GTTCAGGCTC TCAATTTGTC CGTATATTTA TTTTACAATA CGACTTAAAG CCGTATCAAA 12 6 0 

TGCTTGAATC GTTTTTCAAT ATCTTCTTTC GTGTGTGCCG TAGATAAGAA TGTACCTTCA 132 0 

AATTGAGATG GnGGnAAAAA CACACCTCTT TGnCATTCTC GGTACATTTC TGCAATAATT 13 8 0 

TCC 1383 
(2) INFORMATION FOR SEQ ID NO: 3 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 415 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 97: 

25 TCCACTAAnA TGATTACAAT TGCATTAGTT TGGTGGAGTG CATTTACAAT CTTAACGGGT 6 0 

ATGATTAAGA ACCACGGTTT AATTaTTTAG TGAGATTCTT ATTTGGTGTA GGTGAGGCGC 12 0 

CAATGTACCC TTCTAATGCT GTGTTTAATT CATTTTGGTT CTCTAAAAAT GAAAAAGGTA 180 

30 GAGCATCAAG TGCATTATTA GCAGGATCAT ATTTCGGACC TGTATTAGCA CCAATAGTTA 24 0 

CAATTGCTAT TGTTAACGCA TTTAACTGGC AAGCAGTATT TTACATTTTT GGTGCAGTAG 30 0 

GTATTTTAAT GGCkGTATtA TGGGCGATTA TTGCCAAAGA CTTACCTGaG CrACATAGwa 36 0 

TGGTTAATGA AGCGGAGAAA CGTTTCATTA TGGAAAATCG TGATATCGTA GCTAC 415 



(2) INFORMATION FOR SEQ ID NO : 3 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1141 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 398: 
TTTTAGaTaA aTyCAATTnT CyATaCTaAA TgATTnTCTT ATTaCGTCAA TTcGCCTTTT 60 
aTTTTATCGT AATCTTTCCa CTGCAAAGCT AAAGCTTCTC CTATTCTAAG ACCAGAATAA 120 

50 

AATAACAGTC TAGTTAGCTG ACGAGAAGTA TCATTTGTGA TTTGTTCTAC TTTTTCATCA 180 
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AKTGTGGGGT CGTATAAGAG CTTGTAATGC TTTTTGGCGT 
CCTGCCCACA CAGATCGTGC ATAGTCAACA GAAAGACCTG 
CTGAAAGCAG TACATTGCGT AGTAGTGATT TTGCCAATAG 
TTTATGTGAG TATTATATTC TGTAGTTCGC TTnCTATrG 
rrmTAAAC GATCAAAAAA TATATATTCA AAGGGrrCAT 
ACArnTGTA TAAATTCGCT T^CAGCTAGT rrGGCATCTT 
TTCATTTTTC GnTGTTATT ACCGTATACA TCTTTATATC 
CCTGTATTAT CATCCTTATA TACTGGCATT TTGCrrCTCC 
ataataaggg TAGGCGGGCT acccgaaatt TAOTACTAGG 
AATAAAAAGT AGGTGATGTT atgacattta aaaacaatca 
taacgaatga agacattaga attttaaaaa atgtcttaga 
atgaatattc ggtatgtaat gaagaatccg attttgctta 

ATACAITAGA CAGCTTAGCT ATTTCGTCAA ATAATGTTTG 

cgattttaat accatgcata atagagtttc tgtgttcaat 

TTAAATGTTC TTCTATTAAA TCGTTGTTTk CCAtTTCGtk 

T 

(2) INFORMATION FOR SEQ ID NO: 399: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 706 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



AAITGATAAC 
CATCGTTTAA 
GGATATTTCC 
AGCGTGCAGA 
TGTCCGAGTA 
TCTTACGTTC 
TAATGGAAAA 
CTCCTCAAAA 
TACTAAATGT 
TAATTTCAAT 
AGATGCAGTC 
CTGTTTATTA 
AATTATCGAA 
AGCAGCTTTG 
TAaAAATGTt 



TGCTTTAAAA 
CAAATAATTC 
GAACCTTTCT 
AAGATTTTCA 
TCCATATTTA 
AAACCCACGC 
ATACCATTTA 
XTGGCAAAAA 
GATATAATAA 
GAATTAGTTT 
AGTGTTTATG 
AGAGACTTAT 
TTGTACTCTT 
ACTGAATGTk 
CyTATATTCC 



300 

360 

420 

480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1141 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 399: 
ACTGTGGTAG GTTmTAlT TTGAAGTATT AATCATAACA GACTAATAAT 
CTAATAACAC ATATTTAACT TGTATTCrrA AACTGGTATA ATAAATTTAT 
ATATTGTATG ACAGGGTATT CACmTATT AAAAGGTAAA ATTAAATAAA 
AACGTATTTA AATATATGAG GAGTAAACAA ATGGCTGATA GAACGAATAA 
ACAGGACGCT TTATTGCAAC TGCATCAATC GTATTCTCAA TATTArTGAT 
TTTGrrrCGT TGGATAATGC GACTGCCAAA GCAITACTTA ATTTAACGAA 
TCAGATAAAG CGATTGATTA CATTrTAAAC AGCTTTAGAT TCACTGGTAT 



CATGAGGTAA 

GTTGAAATGA 

GGTTTTATAG 

AGAAATTAAA 

TATTCATTAC 

TCAAAACACT 

TATGTATATT 



60 
120 
IBO 
240 
300 
360 
420 
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ATGTTTGCAG TTTATGTATC AAATAGTTTG TTTACGTTGA TTAATTTATC AATCACAATT 54 0 

CAAGCAATAA AAGCTGCACA CGGTGCGTAC TTAACATTGC CAATTTTAAT TGTTATTATA 6 00 

GGTTCGGTTG CATTAGCGAT TTATATGCTT GTTGTTTCTA TCAAACGTAA AAGTACATTT 6 60 

AATCGCTAGA AAATTGATTT TAACAATAAA AATATGAAAA AAAAnn 7 06 



(2) INFORMATION FOR SEQ ID NO: 4 00: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1187 base pairs 
IB) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 400: 

20 ACACAATCTG AAGATTCACG TTGTGGTGCT GGACATGATC GAAAAATTAG AGCTGAACAA 6 0 

ATGAnAGAAA TCAGTGATTT TGTTAAAAAG AAAAATATCC CTAAAGATGA AACGGTATAT 120 

ATAGGTGGCG ACCTTAATGT CAATAAAGGC ACTCCAGAGT TCAAAGATAT GCTTAArAAC 18 0 

TTGAATGTAA ATGATGTTCT ATATGCAGGT CATAATAGCA CATGGGACCC TCAATCAAAT 24 0 

TCAATTGCGA AATATAATTA CCCTAATGGT AAACCAGAAC ATTTAGACTA TATATTTACA 3 00 

GATAAAGATC ATAAACAACC AAAACAATTA GTCAATGAAG TTGTGACTGA AAAACCTAAG 3 60 

30 

CCATGGGATG TATATGCGTT CCCATATTAy aCGTTTACAA TGATTTTTCA GATCATTACC 420 

CAATCAAAGC CTATAGTAAA TAGTGCTCAA CTAACTAATA ACTTGCTtCG TTCTAAAAGG 4 80 

ACGAAGCGAG TTATATTGTT AAAATTTGAA TTGACTTACA TTTTAATAAA ATCATCTTAA 54 0 

35 

CAACTTTAAT TTTTCaTTAA TACAaGTCTT TACTCTACAC TCAAACnAGA TTCATACACT 6 00 

GCACGTCATA ATAAATCTAT CTATTCAAAT ATAAATAAAA GTTACCTACT ACATTCTATG 6 60 

40 TAGCAGGCAA CTTTTATTAC TTATTTCTTT TCATTATCAT TAAGTACTTT TACAAACTTC 720 

ACATTATGTG TCTTCCAATC AACTTCATAT AATGCTGATA ATTTTTCTTC TTTTTTATCT 780 

ACATGGTTTT CACCAGACCA ATAGCCCCAG AAACCATGGC GATTCCAATC TATTTTAAAC 84 0 

TCATCCATTG ATCTTTTATA ATGAACAACA AATTGTGATT TACCTTTGTC TTTTTTATCA 9 00 

TGTGACATAA CAGCTAAAAA TTCTGGATTA AACCCTTCAG ACACAGTTAC AGGCATTTTG 9 60 

TCTTTAGGTG TGAAATTATC TTTCGCCCAT AAATTTCCAT TTCGTGTTAA AGAAAAAATT 10 20 

50 

TCACTTTTAG TTCTATTATC ACTATCATTA GTTAATTGTC TCGTATGGTC ATGTCCCATA 10 80 

TTATTTATCA AATGTGCTTC TACTTTCCAA CCTACACCTT TATGTGACGT AGATTGATCA 1140 

55 
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(2) INFORMATION FOR SEQ ID NO: 401: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 847 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 401: 
CCAGAATTAT TTTTTCAAAA AGGACAATTT AACAATGTCG ATAACGTTAT CATAAGCAAA 
CCGATGAAAG GGACAATGCC TAGAGGTAAA ACGGAaGCTG AAGATCAACA GTATTATAAA 
ACATTGCAAA CTTCTTCGAA AGATCGTGCA GAAAATGTCA TGATTGTTGA TTTACTAAGA 
AACGATATAG GGAGAATATC ACAGAGTGGC TCAATTAAGG TGTATAAACT ATTITTrATT 
GAGGCATATA AAACTGTATT TCAAATGACT TCGATGGTAA GTGGAACTTT AAAAAATAAT 
ACAGACTTAA CTCAAATITr AACATCGTTA TTTCCTTGTG GTTCGATTAC AGGTGCACCG 
AAACTGAATA CAATGAAATA TAITAAACAA TTAGAAAGTT CACCTCGTGG TATATACTGC 
GGACAATTGG ACTATTACTT CCAACTGAAG ATGATAAAAT GATTTTTAAT ATTCCGATTC 
GCACTATTGA GTATAAATAT GGACAAGCGA rTTATGGAGT CGGAGCAGGT ATTACAATTG 
ATTCTAAGCC AAAAGATGAA GTGAATGAAT TTTACGCAAA AACCAAGATT TTGGAGATGT 
TATAATGCAA TTATTTGAAA CAATGAAAAT TGATAATGGA CATATCCCTA GACTTACTTA 
TCATACTAAT CGCATAAAAT GTTCTTCTGa GCGATTAAAC TTTAAATTTG ATGAACATGC 
ATGGCGAAAT GAATTAAACG ATGTAACAAC AAAGTATCAC AGT^TCAAT ATAGACTTAA 
AATCGTATTA AATGCTGAAA GCAAATTTGA AACGATAGTG TCACCTTTAC CTGAGAAAAG 



TAGTTTT 

40 (2) INFORMATION FOR SEQ ID NO: 402: 

(1) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 740 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
'^^ (D) TOPOLOGY: linear 



(XI} SEQUENCE DESCRIPTION: SEQ ID NO: 402: 
TGAAGATGAA GCAGAAGCAG AAGACAATCT GCTACGAGTA CAATCGAAGA AAGAAGAAGT 
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GTTAAACAGT TAAAAGATAA AGTGTCTAAA ATTGTGATTC AAATGAATAC ATTTGAAGAT 24 0 

GAAGCAAATG ATGTTCTTGT TAATGCTGTT TATGCAGAGA AATTAATTCA ATATGGAAAT 3 00 

^ AGATATCGTA AGGACTATAG CAATGTTGAT AAGAGCTTAA ATGAAGCTGA ACGATTATTT 3 60 

AAAAATAATC GCTATAAGCg TGCGATTGAA ATTGCAGAGC AAGCTCTTGA AAGTGTTGAG 42 0 

CCAGGTGTTA CTAAACATAT TGAAGAAGAA GTTATTAAGC AATAGAAACT AGTATGTAGT 4 80 

10 

TATACTTAAA TAATATGAGC ACTCTGTCAA ATTGGACTGA TGAGTTTAAT AATTGAAGTT 54 0 

AGCCAACGAT ACGTTGTCTA GCTTCTTTTT TATATGGATA AATGaAAGGG ACAATAAATA 600 

TAAATAGCAA TTGTTTAAAG ATAAACGTAA TCAAATGTGT TGTTTTAATT AATATAAGTA 66 0 

GTGAAAAAAG CATAATCACA CAGCTGTTTA AATAGAGTGA AATAGTCTAA TTCTTATTTA 72 0 

ATAAGTAGAA ATAAGATTAT 74 0 
20 t2) INFORMATION FOR SEQ ID NO: 403: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 630 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 403: 

ATGCCAATAA TTAAACCATG TAATAAATAT ACGTATAGCG TACGACTACC AATATAAGTA 6 0 

TATAATTTTT TCTTTGTTGA CATTAAATTT AGAAACGCAG TCATTGCGAT TAATATAATT 120 

CCATATAATA TAAGTCGTTT AAAAGGACTG AATATACTCT GTCCTTCATT TTCAAGTGAA 18 0 

GTATATGGTG AACTTCCCAA TAACCAATCT GCATTGATAG GATGAATCAC GTAAACGATA 24 0 

AAAAACAAAA TAAAGGTAAT GATAGATACT GGTATTAGTT TTTTATTTTT AAAAATAGCC 3 00 

40 GTATGTTTTT TGGTGAAAAT GTAACCTAGA TAAAATATTG GGAAAAATAC GATTGTCCTT 360 

GAAATGCTTA AGTAGCTATC GATGTTATCT GAAAAACCTG CTCCAATAGA TATAATAATT 420 

GAAACTGATA GCACTTTATA TGGATTAAAT CTTCTAACTA TTACTAAAAT GACATGAAAG 480 

AAAAATAGCG TGATCAAAAA CCATAACGCA AATACTGGGT TAAAAGGATC AAGTTGTAAT 54 0 

TCGTCACTTT TACCTGTTAA GAAATAATAA ATTGAAAAGA ATGCAAAAAA TATCATATAA 600 

GGTACTATCA AACGTTTTGA AATTTTTTCT 63 0 

(2) INFORMATION FOR SEQ ID NO: 404: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 54 base pairs 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 404: 
.^^COT TCAGrrrCTA AAAGTGGAAC AATCCAAGAA GTACGTAAAC GTOAATrrTA 
CGAAAAACCA AGCGTAAAAC GTAAAAAGAA ATCAGAAGCT GCACGTAAAC GTAAATTCAA 
.XAATTAATA CCTCH^TTGA CTCCCTCAAC ACOAATArTA ATTATATAAA ACAAACATCA 
CAAGTTAGTG IXrTGACACTA ATAT.3TGATG rrTTTr^rr GTCAArrm AATTAAAAAA 
.OrrATATAO ITTATAAATA ATCAAATTGA TATTCTATAC GTrCTTATAA CTATAAAGTA 
TArrCAATTT CATGTATAAT TAATGTGAOG GCGAGGT.^ ATTGTGAG^ ATAATAATTT 
rrrACAAATG ACAACTATCT TGCAATCAAC GGCTGGAGAT ACTTGGOrrO AACAAGrrAO 
^^r,..nrr. -rrTTTACGrr AATATTAACC TGTTTGACAT TCTTAGGATT 

CAAiAiMrtA^ vj*A>-*^-^ - 

TC«TATC» C^ACTCTA A«^TC« TOCAaCT<=OT ATTATCOCC CATTATCATT 
AC:nAITTTA T.TTTGGOAT TTCTAATCCA «K=AAATGTC AATAT=CAIT CTATCTTAAT 
ArrCTCAATI OaCGTTATAT TAOTTOTAAT ^AATTATTT OTA<rrTO«T= <:AOTAATT« 
lATTArn^OC ATOATAOr^ TAACTATAA3 CATA.CAACG OTO«T«TA ATTtGCrATT 
TAT^CO AAT«rrATC= TT^CTtOAT TTTAACOArT GTAGAATGOG TCAI.TTAGT 
OAAOATTTTC AACA=AAA=A TTCCOTTTTT CGATAAAOTT ATCTTAAAAO ATTCAACTAA 
rrCTOAOTCA 3G™CAArr CrCAT<=ATAA CCGCTC=CAC CTCOTAGOAA AOACT=CT=A 
AACAOTTACA OATCTTCOAC CT.CA«»AT TATTTTTOT CAAAAT»AC aTATT=AT«C 
KTTTCA«T KCAACrrrA TTTTOCGCAA TAAAACGGTA AAAATCCTK AAOTI«AA«= 
AACAA^OTA ,m<^ AA=TA»TTA ATTAAAAOOA =CCATACCAT GTITACrrTA 
A0TTTTATO3 TAATACCAOT TATOTAOTA OT^CATTAC ™rrTTArr CTCATITOTA 

cccATTS^rr TATCOATrrc a=cg™,ca s^gcottc ato™«tat A==.ACArr3 

OT-«=TAT«C OTTIACCTCG TOTATCTCCA AOAAAAG™ TAOCOCCATT AATTAAAGCC 
CACAAAGCAG GACTAGCATT AACAACAAAC CAATTAGAAT CGCATTATCT AGCAGOAGGA 
AATOTTGACA OAGT^TTCA CGCTAATATI GCTGCACAAC GTgcTGACAT TOATCTTCCT 
^CGAACGIG CTOCTGCAAT TGaCCTTGC. GGACGTGACG TA^GAAGC GGTTCAAATG 
T^rrAATC CTAAAGTCAT TGAAACACCA TTXATCGCAG GTGTAGCAAT G.ACGGTATT 
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AGTAAGCATC ATACAGAAGT ACTTGAAAAC CCAGATAATA TTTCTAAAAC AGTTTTAAGC 16 80 

AAAGGTTTAG ATTCAGGTAC TGCATTTGAA ATTTTATCAA TTGATATTGC TGACGTTGAT 174 0 

KTlKQTKf<M< ATATTGGTGC AGACTTACaA ACTGAACAAG CATTAGCAGA CAAAAATATT 18 0 0 

GCACAAGCAA AAGCTGAAGA ACGTAGAGCT ATGGCTGTAG CAACTGAGCA AGPJ^TGMiA 186 0 

GCGCGTGTAC AAGAAATGCA TGCTAAAGTA GTTGAAGCCG AATCTGAAGT ACCATTAGCT 192 0 

ATGGCTGAAG CATTACGTTC AGGTAATATC AGTGTTAAAG ATTATTATAA TTTGAAAAAT 1980 

ATCGAAGCTG ATACAGGCAT GAGAAATGCA ATTAATAAAC GAACTGATCA AAGTGATGAT 204 0 

GAGTCACCTG AACATTAAGT CGAGAGGTGA TTAAATGAGT GTCGGTATTC TAATTTTTGT 2100 

CATATCAGTG ATCATTTCTA TCATTACTAC TATGCGCGAA AATAGTCATA AAGATAGACA 216 0 

AAATCAAAAG CCACCTCAAA AAACATCTAC CGATAATGAA CCAAAAAAAG GTGGCTTTTT 222 0 

TGAAGAAATT GAGCGAACGT TTAAAGAAAT AAGTGAAGAA TTAAATGAAG AAGAAAAGAA 22 8 0 

ATCATCGAAA CGAAAATATG ATGATACGTT ACCACCTTTA TTCGATGAAC TTCCAAAGGA 234 0 

AGAGCCTAAA TCGAAACCTG TTGTAGAACC TATGGCACCT AAAAAACAAC AAGAAACAAA 24 00 

ACCGATGACA GAGAAACCAA TCACAGTGCC TAAAGCAGAA CCGGTGGAGC AGAAACATAG 24 60 

ACCTTCTAGA CAAGATAATT CTGACGAAAT TAGACGTCAA TTAGAAAAAT CACTTAGAGA 2 52 0 

TGATATTAAA ACGATTCGTA CTGACATTGA TAGAGAAAAA GAAAAGCAAA TTGCTAAAAT 25 8 0 

GGAAAAACGT GCTAGAGATA TTATTGAGGA TAAATACTTA TCTGAACGTA CAAAACGTTT 2 64 0 

GAAATTAAAG CAGCTGCTTA ATTCTCAAAA TGTCGAAAAA GATTTGACTA AATCAGCGTT 270 0 

CCAATTTGAT AAAGATGAAG TAATCAATGG TATGATATGG TCAGAAATTT TAGCTAAACC 2 76 0 

AAAACAATTA TAAAATTTTT TGAAAACAAG CACTATCGTA ATGGTAGTTG CTTGTTTTTT 282 0 

TACGTTAAGG AAAATTAAAA AACAAAGAGA ATTTTTCGAG AAATATTAGT TATTTAAATT 288 0 

40 ACAGCAAAAA ATTGATTAGT CTAAAATTGA ATCTGCTTTT ATGACAAGGT GAAAAGTATA 294 0 

AATGATTATT TTAAATTAAA GAAAAATGAG TAAGTCAATG CAAAGATGTT TAAATCAATC 3000 

AATTGCATGA TATAATTAAG TAGATATTAA AGCATCATAG AATGAATATA AATGATATAT 30 6 0 

GAAAAGGAGC GCGTGTATGC CTGGAATTAT ACAAATAGAC GATATGAACC AATCTCAAGC 3120 

TTTAATTGGA AATAATGATG AACATTTAAA AGCAATTGAA GAGAGTTTCG ATGTTGTCAT 318 0 

CCATGCAAGA GGACAAGaAG TTGCCGTTAA AGGTACAAAA ATAGAAAACG TAGAAAAAGC 324 0 

GGAATCAGTA TTAATCAATT TGCTGAAGGT TATTGATTTA GGTAATAATA TTACAATTAA 33 0 0 

AGATGTTGAA GCAGCTATTA AAATGGCGCA TAATAACACA ATTCAACATC TGTTAGATTT 33 6 0 
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34B0 
3540 
3600 
3660 



GCAACGTATA TATGTTAATG CCATCAAAAA TAATGATTTA GTATTTGGTA TAGGTCCTGC 
TGGTACAGGT AAGACATTCT TAGCTGTAGT TTATGCAGCA AAGCAACTCC GTAAAGGTGC 
' TGTTAAACGT ATIGTATTAA CAAGACCTGC TGTTGAAGCA GGAGAGTCAC TTGGATTrTT 

ACCAGGAGAT TTGAAAGAAA AGGTAGATCC ATATTTAAGA CCriTATATG ATGGTCTATA 

TACTGrrCTT GGGCGTGAAC AAACAGAGCG ATTTATTGAA AGAgGCATTA TCGAAATAGC 3720 

GCCACTTGCA TATATGCGCG GACGAACATT AGAAGATGCA TrTGTAATTC TTGATGAGGC 3780 

GCAGAATACG ACACATGCGC AAATGAAAAT GTTTTTAACA AGACTAGGTT TTGGCTCAAA 3840 

AATGGTAGTT ACTGGTX3ACC AAACTCAAAT CGATrTACCT AAAGGTGTTA AAAGTGGACT 3 900 

TAAGGAAGCG GTCAGTAGGT TACACAACGT TAAAOGTATA AGTATATTGA AATTAGATCA 396C 

GAGCGATGTA GTCAgACATC CATTGGTAAG TAAGATCATT GAACATTATG AAGGAGAGAA 402C 

TTAAATGTTT ACGATAGATT TTAGCGATCA CACAGGCTTA GTTAAAGATG CTTGGTATAA 408C 

ACAAATTGAA GATTTATTAG AATTTGCTAA AAAAGAAGAG CATATAGAAG ACGATGCTGA 414( 

GCTTTCTGTT ACATTIGTAG ATAAACAAGA AATACAAGAA ATTAATCGAA CATATAGAGA 4201 

TAArGwTAAr GTTmCaGATG tAaTCyCaTT tGCTTTAGrA GrAGATGAGC CmGaGATkGA 426^ 

TtTTAGTGGT CTTGATATAC CACGTGTTTT AGGGGATATA ATTATCTGtA CGGATGTAGC 432 

GCAAGrACAA GCAAACAATT ACGGACAITC TTITGAACGA GAATTAGGAT TTTTAGCATT 438 

ACATGGATTT TTGCATCTAT TAGGTTATGA TCATATGACT GAAGCGGATG AAAAGGAAAT 444 

GTTTGGTCGA CAAGATACAA TArXAAACGC ATATGGATTA ACACGAGACT AATTATGAAA 450 

AGGTTTAAAT ATGCACTTGA TGGGCTGAAA ATCTTAATTC AAAAAGACTA TAAATTTCrr 4 56 

TTACATGTGT TTGCAATGAT TCrTGCTATT GTCTTTGGTC TCGTACTAAA TATTAATCGG 462 

ATTGAGTCGA TATTTATACT CATTGCTATT GCATTAGTTC TCACTGTTGA AGCTTTAAAC 46£ 

ACTGCTATTG AATATGTTGT CGATTTAGTG ACCGTTGAAT ATCATGATTT AGCTAAATAC 47. 

40 

GCTAAAGATA TTGCGGCTTT TAGTGTACTT ATAGTTTCAA TATTAGCATT TATTATAGGT 48. 

TTAATAGTAT TTTTACCACA TTTTATAGCG TTATTrTAGG GAGGCATATA TGAGTTATCA 4 8, 

ACCTCATTAT TTTCAAGAAG TTAGAAAAGC ACAACAAGAA TCATATTCGC CATACAGTCA 4 9 

ATTTAAAGTA GGGGCTTATT TAAAAmCGAA AGACgGTAGA ACTTnTATG GTACCAATGT 4 9 

AGAAAATGCT TCTTATCCAT TATCGATATG TGCTGAACGA GCTAGTTTGG TATCGGCAAT 50 
nCTCAAGGA TACAGACCAG GTGATTTTCA ATCAAtAACT GTAACCGTAG ATGCAGATAA 



.„„^^^^^^ j^^^.-^.y^r-^^C, GAATTATGTG ATGATGATAT 5160 
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ACCATTTGGA TTTTCAGGAA AGGATTTAGA ATAAATGACA GAACATAAAT CAGGATTTGT 52 8 0 

TTCAATTATA GGTAGACCAA ATGTAGGAAA GTCAACATTT GTTAATAGAG TGATCGGCCA 534 0 

TAAAATAGCA ATCATGTCCG ATAAAGCTCA AACAACTAGA AATAAAATTC AAGGTGTTAT 5400 

GACAAGAGAT GACGCGCAAA TTATATTCAT TGATACGCCA GGTATTCATA AACCTAAACA 54 60 

CAAATTAGGT GACTATATGA TGAAAGTCGC TAAAAATACA TTATCTGAGA TAGATGCAAT 5520 

CATGTTTATG GTTAATGCCA ATGAGGAmAT TGGACGAGGC GATGAATATA TTATAGAAAT 55 80 

GTTGAAAAAT GTTAAGACAC CAGTATTTTT AGTATTAAAT AAAATAGATT TAGTGCATCC 564 0 

AGATGAATTA ATGCCAAAGA TTGAAGAATA TCAAAGTTAT ATGGACTTTA CAGAGATTGT 57 00 

ACCTATTTCA GCATTAGAAG GGCTAAATGT CGATCATTTT ATTGATGTTT TAAAGACGTA 57 60 

TTTACCCGAA GnACCTAAAT ATTATCCAGA TGATCAAATT TCAGACCATC CTGAACAATT 582 0 

TGTAGTGGGT GAAATCATTC GTGAAAAAAT CCTTCATCTT ACAAGTGAAG AAATCCCTCA 58 80 

TGCGATTGGT GTTAATGTGG ACCGTATGGT TAAAGAAAGC GAAGATCGTG TTCATATCGA 5 94 0 

AGCAACTATA TATGTTGAAA GAsGTTCGCA AAAAGGAATT GTCATTGGAA AAGGCGGTAA 6 0 00 

AAAGTTAAAA GAAGTAGGaA AAcGTGCGAG ACGTGAtATA GaAATGCtTC TAGGCTCTAA 6 060 

AGTTTACTTA GAATTATGGG TCAAAGTTCA AAGAGACTGG CGAAACAAAG TTAACTTTAT 6120 

TCGCCAAATT GGTTATGTTG AAGACCAAGA TTAATCTTAA AAGTGGTGAA GATAATTGTT 6180 

AATGCGCCAA AAAGGGATTA TCATCAAAGC AGTTGATTAT GGTGAATCTG ATAAAATTAT 6240 

CACGATTTTA AATG 6254 
(2) INFORMATION FOR SEQ ID NO: 405: 
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[i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3710 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 405: 

GTTGTTCTAA ATGTTTCTTG nATGAAACGA GTCAATGTAA ACTGATATTG CTGTATTTGT 60 

TGCAGCAATT CATATTGGTC TGGTGTTGCA ATAACAGCAG CTTGAGTTGG AGTCGCAGCT 120 

CTGATGTCTG CAGCAAAATC ACTTAATGTA AAGTCTGTTT CATGACCAAC TGCTGATATA 180 

ATCGGTGTCT TACAATTATA TATTGCACGG ACGACAGCTT CTTCGTTGAA ATTCCATAAA 24 0 

TCTTCTATGG ATCCACCGCC TCGACCTACA ATAATGGTAT CTACACCTAA ACTATCTGCA 3 00 
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TGTATTTGTT 
GCACCTGTAC 
TTCGCTTTAT 
TATAAGTTCC 
CGTTCAAAAA 
TTTAATTTAG 
ACATTAAAGT 
AATACAGATT 
GAAACACTTA 
TACACCATTT 
TTCATTCATT 
TAAAATAATA 
AATTGTCTCG 
TTCGAAGTCT 
ATCACTGTCC 
ACGACTCATT 
GTATGACATT 
ATCGCTGTCA 
GAAATATTAA 
CTCTCGATTT 
TTTAATTCAG 
ACAGATAGCA 
ACTTTGACCA 
AAAAGTTTGT 
ATGGAATAGT 
TAATCGCTTC 
ATGGCGGTAT 
GTGCAAGATA 



CAGCTAATGG 
TCGCTGTTAA 
CAAAACAACC 
CTATACCATC 
CAGAAACACG 
AAGCACTACC 
ATAAATGaCC 
GGAGATGTGG 
AATAATCTGA 
ATAAATTTAT 
ACGACTTTAG 
CGATCCGTTT 
TCTAATACAG 
AAATCTGGAT 
TTCATTTCTA 
TTTAACTCCT 
ACTTTTCAAT 
TATTAGAAAT 
CACCATGTTT 
TTAAATCACG 
CAAAATGGCC 
CTTCTGGCGC 
TGGATTGACC 
ATTAAATTTA 
TGTATCAATA 
ATCTCGTGTC 
CGTATAATTA 
TTGCTCmATT 



AAAACGACTA 
AACTGCAATT 
TTCTTCAGTT 
TAATTGCATT 
TGCTTCTAAT 
TTTGAACATC 
ACTTGaATGc 
ATCTTGATCA 
CATATAACAT 
AATGATCATC 
CAGGTGTATC 
TTAATAAACG 
GTTCGTGATC 
TATCGTCTTT 
ATTGAAATAA 
TCAAACGTTT 
AACGATTTGT 
TGAATTAAAA 
TAATGCACAA 
GCTTAAATTT 
AGTGATGCCT 
TATTTCTACT 
TCCTATTAAC 
CCGCTTCTAA 
CCAAGAACCA 
GGTTCATGTA 
GTATAACATG 
TTACCTGGTG 



TTTATCGTTG 
TTTTTAGGAA 
AATTTTTTCT 
TTATTTACAT 
AAGACTTCAT 
ATGGCACTTA 
TTTTTGAAAT 
AATTTATATT 
CACTCAATTT 
ATCACTGAAT 
ACTGTGTAAT 
TGCAATAGTC 
TTTAACGCCA 
AATAAAGCTT 
AGTTTGAAAA 
GTATTTTTCT 
GTAATGTGAA 
ATTGACGTTT 
TATACATCTA 
TTACGACTAA 
TCGACTTCCG 
TTACCTAATT 
CTTCATCATT 
ATATATCGTT 
CAAATTCACT 
TGATTAATTT 
CTGACTCTAT 
ATGGCATAAA 



AATGGATATC TCGAATTGCG 
ACTTAGGTAT TGATTTCTTA 
TTAATGCTTC TAATTTTTGA 
AAATTTGATA GTTTCCACGA 
CTCCTTCTTT AGGTTCGAAG 
TAACGCTTTC TTTATCTTTC 
TTGAAAGCTC ACCTTTAATC 
TAATATATTT CGTTAAAGCT 
TATTTTTTTA TATTACTCAA 
TGTTTTGTTA ATTCAACTGC 
ATTTCATATG TTGCCATTCT 
CAATCmTA AATAAGGACT 
GAAACTAGCC AATGAATAAA 
ATCGCTTCAT TTATCGTTAA 
GCTTGCACTC GGGATTCTTT 
TTATTTAATT ACTGAATTAG 
TATTAATTTG CTTAGGTTCT 
GAATTTTGTT TGCAGTTTTT 
TATATATGCC ATCTTCTTTA 
CTTTTTCTAA ATTTGTTTCT 
AAOTAGCTAT ACTTGCAATA 
TTGAATTTGA ATAATCAGTT 
CATAATGCTA TTTTGCTCTA 
ATTCAATAAT TTAATATGGA 
TAGTGCACGA ATGCCAGCCA 
CGCTACCATC GAATCATAAT 
TCGAACACCA TATCCACCTG 
GTTCTTGTAA GGATTTTCAG 



420 
480 

540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
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CAGTTACAGG ATGTTCTACT TGAATACGTG TATTCATTTC CATAAAATAA AATTTATTAT 22 2 0 

CATTTAAATC ATATATAAAC TCAATTGTTC CCGCATTTTC ATAATTTACA GCTTTCGCTG 22 8 0 

^ CACGAACTGC GGCATTTCCC ATTTCACGAC GTGTTTCATC ATCTAAAATT GGGGAAGGTG 234 0 

CTTCTTCCAC TAATTTCTGC ATACGTCTTT GAATTGTACA ATCACX3TTCT CCTAAATGAA 24 0 0 

TTACATTACC ATAGCTGTCC CCAACAATTT GGATTTCAAT ATGGCGGAAG TTTTCGATGA 24 6 0 

10 

ATTTCTCCAT ATAAAGTCCA CCATTACCAA ATGCAGTTTG AGCTTCTTGT TCTGTCATTC 252 0 

GGAAGCCAGT TTCAAGTTCT TTTTCATCAC GAGCAACACG GATACCTTTT CCGCCACCGC 25 80 

CAGCAGTAGC TTTAATGATG ACCGGATAGC CAATTTTTTT GGCGATTTTC TTAGCTTCTG 264 0 

AGACGTCTTT CATTAAACCG TCACTACCAG GAACAACTGG AACATTGGCT TTGATCATTT 27 0 0 

CTGCCTTAGC AACATCTTTG ATACCCATTT TTTGGATAGA TTGATAACTT GGTCCAATGA 276 0 

20 ACTTCPiATTG GcATgctTCG CATAATTCTG CAAAATCAGC ATTTTCAGCT AAAAAGCCAT 282 0 

AACCCGGATG AACGCCATCA CAACCTGTAG AAGTTGCAAT AGATAAGATG TTCGGAATAT 28 8 0 

TTAAATATGA ATCTTTAGAC AAAGTGGGAC CTACGCAATA TGCTTCATCA GCAATTTGAG 294 0 

25 

TATGTAGCGC ATCTTTATCC CCTTCAGAAT AGATTGCAAC AGTTTGGATG CCTAAATCAC 300 0 

GACAAGCGCG AATAATCCTA ACTGCGATTT CACCGCGGTT TGCAATTAAA ACCTTTTTCA 3060 

TTATTTCACC TTAAATAACG GTTGGCCATA CTCTACCATT TGTCCGTCTT CTACTAAGAT 3120 

30 

TTCAACAATT TCACCTGAAA TTTCTGCTTG AATTTCATTA AATAGTTTCA TTGCCTCTAA 318 0 

AATACACACT GTTGTTTCAT TTGAAACAGT GTCCCCAACT TGCACATATG CTTCTTCGTC 3 24 0 

35 TGGAGATGGC GATTTGTAAA ATGTACCTAC CATAGGTGCA TTAATTGTTT TGTGATTATC 33 0 0 

TGAAGTTGGC TTTGGAGCTT CAGTTTTATT GCTATCAGTT GATTGTGCTT GAGGCATAGG 3 360 

CATTGCCGCA GCTTCAACTG GCATTTGTGA GATTTGTGGC GTGATAATCT CAGTTTCTTT 342 0 

40 

TTCTTTCTTA AGCGTCACTT TGCCTTTAGT ATCTTCAATA TTGATTTCCG TTAAAGTTGA 34 8 0 

TTTATCCAGA ATTTCAATTA ATTCTTTGAT TTCTTTAAAG TTCATTATTA CTGACTCCTT 3 54 0 

CAGTTTGTTT TCATCTACCC GTCTATTTTA CTTGAGACAA CTCTTCAATT CAAGCATGTT 36 00 

45 

CATATTGCTG GCGACATTAT AAGTCTATCC CAAAGTTATA ATAAAACCAC ATTTTAAATT 3660 

AAAAACACTT GTGTATTTAT TACTTAACAT TGACTCATCT TAACTCTTGA 3710 
5^ (2) INFORMATION FOR SEQ ID NO: 406: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1705 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 406: 



^5 



25 



35 



GCTGATGTTC 


GTTGCCTTTn 


TCCACCAGAC 


AATTCAGAGG 


GATATTTATC 


ACTAATATCC 


60 


AATATATTTA 


ATGCTTCTGC 


TACTTTTTCA 


TAACGATTTA 


ACATATGTTC 


TTTATCTAAC 


120 


TTCTGTACCG 


TTAGTGGTAA 


CATTATGTTT 


TCTTTAACAG 


TCAATGTATG 


CAGTAAATTA 


180 


TACTCTTGAA 


AAATAAAACC 


AATATCATGC 


TTGCGTATAT 


CAGATwATTC 


CTTGTTTGAA 


240 


AGCTTTTCTA 


ArrrriTTCC 


TTTTAATGTA 


ATAGAACCTT 


GTGAAATATA 


ATCAATTGAA 


300 


CTTAAAACAT 


TTAATAATGT 


CGTTTTCCCA 


GATCCAGAGG 


GACCCATAAT 


AGCAATAAAC 


360 


TCGCCTTCTT 


CAATAGACAT 


ATTGATATCT 


CGCAACACTT 


CTTGTGCCAT 


rrrnTAGTT 


420 


CCATATATTT 


TTGTTAATTG 


TTTTACTTCT 


AAAATTGCCA 


CTTTAACACT 


CCTATAATTT 


480 


ATCTTAACTT 


CATTTCCTTT 


AGGCTTTGGC 


ACTTGTATCT 


TCAATTTAAC 


ATATGACTAA 


540 


CATCTATCTT 


ATTATAACGT 


TGAAG CTGCA 


TTGATGTATC 


AATTCTAAGT 


AACAAAACGC 


600 


ATGTTTAAAA 


TGACAAATTT 


GTCACTTCCG 


ACATGCGTTC 


AACAATTTCA 


TTTTGTAATG 


660 


GG AAAATCAA 


TCTGACAGTT 


GTCCCCTTAC 


CAACAGTCGA 


CGTGACTTGC 


AGGTGAATAC 


720 


CTAATTGATC 


CTTTACACTA 


TTTACTAAAT 


ATAGACCCAT 


ACCTGAAGAC 


GTCGTTTCAT 


780 


TTCTGTTAGC 


CGTTGACGTA 


AATCCTCGTT 


CAAATATTCQ 


CGGCATATCT 


TTTTTACTAA 


840 


TACCTCTGCC 


ATAGTCTTTA 


ATATATAACG 


AAACATGTTG 


ATCATTTAAT 


TCTGTCCCAA 


900 


TTTCAATATT 


AAAATTCTCA 


CTATATTTCA 


ATGCGTTTGA 


CAAAATTTGT 


CTAATAATCA 


960 


TACGACACCA 


TTTTATATCT 


GTATAAACAT 


AATCATCCAC 


TTTAAAGTCA 


ACATCAAAAC 


1020 


CAATACCTTT 


AACCTGACTA 


ATATGTCTTG 


TTAATTGTAT 


TTCATCAATG 


ACCATGCGTT 


1080 


TAAGTGACAC 


GTAATCAAAA 


TACATATCTT 


TACGTTGAGA 


TTCTAATCTA 


GTAATATACA 


1140 


GCTGTGTATC 


TAGCATCGAG 


TTTATACGAG 


ACCATTCATA 


TAGTAATGCT 


TGTwTTCTTT 


1200 


CTTGATTTTT 


TTCTTGATCA 


ATTAATAATT 


TCATAGCTGT 


CACAGGtGTT 


TTTATGTCGT 


1260 


GCACAAATTC 


TGTAATGGTT 


TGTTCATGCA 


TGTTCAATTG 


CAACTGTTGC 


TCAACAACCT 


1320 


TTTCTTTGTG 


CGCTGAGATT 


TGACGATATA 


AATAATCAAC 


TGTATGACGT 


TGAAATGGCG 


1380 


TTTCCGCTAA 


ATCTTTATGT 


TTAATTTCTT 


CTATTTCTTT 


ATCTTTGTCA 


AAATGCTTAT 


1440 


ATAATTTTAC 


TTCTTTAAAA 


TATGTCAATA 


AAAGAAAAAT 


CATTGTTAAA 


CTTAAATTCA 


1500 


AAGAAACAAT 


ATAAAATAAA 


CTGTCTATTG 


GAAAATCATA 


ATCGATTAGA 


CTArTGCCTA 


1560 






AATATCCAAA 


AtATCCAGTT 


CATGCGAGAT 


TTCAAAAAAT 


1620 



EP0 786 519 A2 



10 



20 



25 



30 



40 



45 



50 



GCACTATCCA TACTAATTTC AGATA 
(2) INFORMATION FOR SEQ ID NO: 4 07: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1722 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOI/OGY: linear 



1705 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 07: 

^5 TCAATATATC TTAAAATTCA ATGATTAAAT CnATTATCAC TAGACATnAA ATACATAAAT 6 0 

CCTATTCCCC ATTTTCATTT nTTAATTCAT AAATGAATCA ATAACCACCT AATAACAAAT 12 0 

CATATTATAC ACCTTTGTTC TCTATTTTTC TAAGGTTTAA AAAATATTTT TAGGTAAACC 180 

TAAAAATAGA TGTAATAAAA ACGCCTCCTC AGATATTTAT ATATCTATGA AGACGTTTAA 24 0 

ATACATTATA GATGGTCTGG TTCTGGGTGA ACGTATACTG AGGAAAtACC TTTTTTGTGC 3 00 

AAATGATGTT CGACATTGTC ACAAATTTGA TGCGCTTCTA CTAAGGAAAG GTTAGCATCT 3 60 

ACAACAATTG TGACATCAAT AAACACACTA CTTCCATGGT AACGCCCTTT AATACTTTTA 42 0 

ACTTCTTGTA CTTCATCAAC TTCTAAAATA TCATTGCGAT ACGCTTCTAA TTCAGTTTCA 4 80 

TTGAAACCAT CACTCAACAT AAAAATTGCT TCTTTAAAAA TACCAAAACC AGTATAAACG 54 0 

ATTAGTAAGC CTAGTAATGT TGCTAAAATA ATATCGACAA TTGGGAAACC GATTTGCGTA 6 00 

AAAATTAATC CTATCGCTGT TCCAATGCTG ACTAAACTAT CCGATAAATT ATCTTTGGCA 660 

35 GCCGAATTTA AAGAACTACT TTTCGTTCTT TTCGCTAGTC TTTGATTGAC TGCAAATACA 720 

ATCAACATTA CAAGACCACT GATTAAGCTG ACGATAATTG TTATTGCGTT AGGTACAACG 78 0 

TCATCTTCTT TGAACAAACG AGGTGCATTT TGAATAACTA CTTGGATACC TACAAACATA 84 0 

ATGACAAATG ACACCAATAA TGAAGAAATA TTTTCAGACT TCAAATGGCC ATAAGGATGA 900 

TTTCGATCGG CAGGTTTAAT TGAAATTTTC AATCCAATAA TAACAGCTAA AGAAACGATA 960 

ATATCTGTCA TATTGTTTAA TGCATCGGCT CTTACAGCTG CAGAGTTAAA GACAAAACCC 1020 

GTGACATACT TAACAATAGA TAAGATTATA TATACAATTA AACTCAAATA AGCACCGCGT 10 80 

TGCGCCAATT TAAGATTTTC ATTATGAGAC ATGCGTTGAA CCACCTTGAA TTAGTATAGT 114 0 

AACAATATTA TGAATGATTC ATTTTAATTT TACAACGTTT TTAATTTTTA TAAATTTTTA 12 0 0 

TAAAATTAAA CTAATTTATT CATTCGCAAC CCCTAAAAAT AATTTTTAGC CTTTCTGCGA 126 0 

ATTTTATGAG CTAGAAAGGC GCCCAACTCT CCCTGTTTGT TAACTTTCGC CTCGAAAGTT 132 0 
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CGAAwwTTAT GAGCTAGAAA GGCTTATGCA GTTGACGTTT TACGTCCAAC TGCGTTCCTC 144 0 

CGTCTTCTTC AAATTTATTT GTnAGAAAGG CACCCAACTC TCCCTGTTTG TTAACTTTCG 1500 

^ CCTCGAAAGT TTCTATGTTA GAACCCTATG CATGAGTTGC GAAnTATCTA ATGTCGTGAA 1560 

CTAATTATAT AGAAGAAAAA GTGCATCAAT GACAAATTAA ATGAGATTTC TACTCTACCA 1620 

AACTCTCTTC GAAAGACAAT TTTCTCCTCT ATTTATTAGC AACTATTGCA TTTCTCCATA 16 8 0 

10 

TAGTACTTCC TTACTTAAAA TACGCTGAAT GTCTGAATTA AA 1722 



(2) INFORMATION FOR SEQ ID NO: 408; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 408: 
GCGAGACCCC CTGAGGGAGC AGTGCCCAGT CGAAGACCCG AGGCTGAGAC GGCACCCTAG 6 0 

GAAAGCGAcC ATTyCAATAC GAaTTgTGAt AAATAGAGAA CAGCAGTAAG ATATTTTCTA 120 
ATTGAAAATT ATCTTACTGC TGTTTTnTA GGGATTTATG TCCCAGCCTG TTTTTTGTGA 180 
TTTTTAATAA TTTGAATATG GAAAATGTAT TATTCTCTCA TTTGTATAGA TTGTATTTAA 24 0 

30 

TAAGTTAATG TAATCCTTGA GCTCACGATT AATAAAATTC TATAACCTTA ATTATTTTCT 3 00 

CGATACAAAG GGTTATTAAC TTTAATATAA GTATAATGAT GTGCCTCATC TTCAAGACGC 3 60 

ATTGTTGTAA CACACTTATA ATCTATAAAT GGCGCGAACA TGGTATCTTT AATTTCaTTT 42 0 

35 

ATGCGATTCT CATTTACTTG ATTAGATTTG TGTGTCGAAA GTACAAGTTG ATCaAAAATG 4 80 

TTATCTAGTA CATCACGAAC GATATACCAC ATATGTCTTT CTAAGTTTGA ATCATTTGAT 54 0 

40 GCTTTAGAGA TTGTAAGAAT TAATTCGCCT AAATGGTTTT GAACGGTAGA ATAAAAGGCT 600 

TTGTTAAACA CAGACGTTTT TGAATCAGTA AGAATTCTTG ATTTTTCATG GAAATGAGAT 660 

GTACTGTATA CCATTTCATT TAGTTGTGCT TTATCAATTC TTAAACCTTC AAAGTCTCTA 720 

'^^^ ATATACATCG TGTCCAATAG ACCATCTTTC CGAAATGTTG CAATAGCATT TTGCAAGTGA 7 80 

GCCTCTAATG CAATGCCATA TTTAGTAACT AGTGGGATTA CGAGACCAAG CAATGCTTTA 84 0 

CTATAAGTTT CAACCCACGA TTTCGCCGAT GATTCAAAAT CAGACAATGA TGCAGCTGAT 9 00 

so 

TGATAACGTT TAATCAATGT CACGATAGGT GATTCATTGT TAAATGGGTA GGTTGCAACT 9 60 

.^.^^ ^^^^^^^^^ ^^^^^^^^^^ ^^^^^^^^^^ 1020 
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AAAAATGAAT 


ACCAGCAACT 


TCATCAATAA 


TTGTTGATGC 


ATAGGACTTA 


AATATGACAT 


1140 


CTTTCTCCAA 


AATATCATTT 


AAAATACGTG 


TCATTAGTGG 


ACCATTGTGC 


GTCGTTTGTT 


1200 


CTGATAATGT 


ACGAATCTCA 


CCTGTAATAT 


GAACGTTTGT 


CGACAATTTG 


ATGTGTGGCG 


1260 


ACATAGCTGG 


GTATTTAGGA 


ACTAATGTTC 


TGAAAGATAA 


ACCAGCATAA 


TAATCCaACG 


1320 


TATGTTTTGC 


TTCAATGATT 


AATTCTTTAT 


CTACTTCTGC 


TTGATAATCA 


GAATGTAATA 


1380 


CGTCATCTAA 


TTGCCATGGA 


TGAACAATCA 


TAATGTGATA 


ATCATTAAAG 


TTAAACTTTG 


1440 


GCGTAAATTC 


ATTTTCTAAT 


TGTTTAATTA 


AGTCCGGAAA 


TAGTTGATGA 


ACAGTTGTAT 


1500 


CATAATCTTT 


AGACAGTGAC 


ATAGTACGGC 


TTAATTTACT 


GTGAATCAGT 


ACTATTTTCA 


1560 


ACTTAATAGG 


TTGATTGAAT 


TCTGAAGAAT 


ATAGGAATGT 


TTGTAATGCA 


TTTAAACCTT 


1620 


TACGTAATTT 


AGCCCCAGGA 


TGTAGCGGAT 


GACCTTCAAT 


AACGGCTTGC 


TCTGAACGCA 


1680 


AGTAACTATC 


TTCGCTATTT 


TCGATAATAT 


TAAATAAAGG 


TGCAGAATCA 


TGTTGCATTG 


1740 


ACAGTGCTTG 


ATAGCTAATT 


GCAAATGTCA 


TATtAGTtGC 


ACTGTTTATT 


AAATCTTGCT 


1800 


GAAATTGATC 


ACTAGCAGCA 


TTTTTTAAAT 


CTGGTGCTTC 


AATTAAAATA 


CACTCAAGAA 


1860 


TTTCATTTGG 


ATGGTGTACT 


CGTGTAATCG 


TATTTGTAAT 


GTCATCTTTA 


ATGTAGAAAG 


1920 


GGCCTTCAAC 


ATCAATTCGA 


TCAAAGGCGT 


GTTCTCCAGT 


GATAGGAGCA 


TATAATGTTT 


1980 


GCTTAGCTTG 


TGGGAACTGG 


ATTTCTAGTA 


TATGAGTCGT 


TGAGATATCT 


AACATAATCA 


2040 


AATCACGACT 


CAATATTTTC 


TTACTTTGAG 


TGCGCGCTTT 


AACTAAGTTT 


TCGCGATGCA 


2100 


TTGATGTGAC 


CAATCTCTGA 


GTGACTTTAT 


CTCTTCCTTG 


TAAAATCATC 


TCTTTAAAAA 


2160 


TATTAGCCCA 


ATCGCTATTA 


TGTTGTTGTA 


AAAATAAATA 


TGTTTCTTGT 


TCTTCTTTGT 


2220 


TAAATTTTAA 


TGTCTGTTCT 


TTAAAAATTA 


AGTTCAAGTT 


CATAATTCAC 


CTCTATGAAA 


2280 


TATTTTACAA 


AAGCAAGATA 


GATTTGTATA 


ATCCATATTA 


ATGATAATGA 


yTCTTATTAT 


2340 


CAACAGAATG 


CGGGTGTAAG 


TTTTATGACA 


AAATATTTTT 


TTAGCAGTTC 


TTTTCTACTA 


2400 


TTTCTAGGTA 


ATTGGATTGG 


ACAAATAGGG 


CTAAATTGGT 


TTGTACTTAC 


CACTTATCAT 


2460 


AACGCAGTTT 


ATCTGGGGAT 


TGTCAATTTT 


TGCAGACTTG 


TACCAATATT 


ATTACTAAGT 


2520 


GTGTGGGCAG 


GGGCAATTGC 


CGATAAATAT 


GATAAAGGGC 


GATTGCTGAG 


AATTACAATT 


2580 


TCATCATCAT 


TTTTAGTAAC 


TGCAATTTTA 


TGTGTGCTCA 


CGTATAGTTc 


ACTGCAATTC 


2640 


CAATTAGCGT 


CATTATTATA 


TATGCGACAT 


TAAGAGGGAT 


TTTAAGTGCG 


GTTGAAACAC 


2700 


CTTTAAGACA 


AGCAATCTTA 


CCAGATTTAT 


CAGATAAAAT 


ATCTACTACA 


CAAGCTGTmw 


2760 


CATTTCATTC 


ATTCATCATT 


AATATTTGTC 


GTTCAATAGG 


GCCTGCCATT 


GCTGGTGTCA 


2820 



55 



EP0 786 519 A2 



10 



15 



20 



25 



30 



35 



40 



50 



CAGTTTTATT 
GaTACATGCC 
TATTTATAAC 
TTTTGACAAA 
GTGCCATTGG 
TGGTAAATAT 
TTCACAATAT 
CACGTACGAC 
TACTGAGTAT 
ATATGTGCAG 
TGCATTACAA 
AATCATGGTA 
CGCCATTATT 
AATTGAAATT 
AGAACGCTAT 
TATAACTTTA 
GCGACTACGC 
GTCTCACAGA 
TAACTTTTTT 
TTCTGAGAGT 
CTTAACTATG 
AATTATGATG 
TATTATTGAT 
AGGTTTGAAA 
GATAGGGAAA 
AATACTTCCA 
GTTAAgTTGC 
ACTGTAGATG 



ATGCTTACCA 
GTTAAAAGTT 
ATCATTATTG 
CAAAGTATTT 
TGGTATTATT 
GTATTATTTA 
TGTCATCATG 
AAATCGCGTT 
TaTTATGATG 
ATGTGTTTGG 
TGGTATTCTA 
TATCTTGAAT 
AAAGAACGTA 
CAGTATCATA 
GAGTTTACAG 
GAAGGTTTAT 
GAAGAGTTAA 
CAAACGTTAA 
TCakGGTtAC 
TTGGTACCAG 
GAAGAAGTAC 
ATTGAAAAAG 
GAAATAATTC 
AGTGAAGACT 
TATTTTGAAG 
AAAGCaACTT 
CCgTCGATGC 
GACCAAAGTT 



TTACATTTTA 
ATTATAGATT 
ATTATGGCGA 
CCGGGGAAAT 
GCAACGCTAG 
AGTTCATTTT 
TTCATTTGTA 
TATTTTCAAA 
GgATAGAGGT 
CATTGTTAGA 
TTTTATAAAT 
GGGCAAAGGC 
TTTACCCCGA 
TGCATGTGTT 
GTGATATTAC 
TAGATGTGTT 
TACACAGTCG 
TAAGTCmAAG 
AACATGTAAA 
AGGGGCATCC 
GAGCATATGC 
ACCATGTTGT 
CCGAATACTA 
ATAGAGCGAT 
CATGGnTTGC 
aTCatTTaGG 
aCAAGCAACA 
AAGTTATGCT 



AAGTAACTAA 
ACTTCAAATT 
CAGGTTTTTC 
CTGAAATATT 
TTTTACCTAA 
TATTTGGCAT 
TTACATTGAT 
ATAATGTTAA 
ATGaTTCCAT 
ACTTTTTCAA 
AGAAAGTTGA 
AGATAGAAAT 
GCAAACATTT 
GACTATTGAA 
TTATTTAAAT 
GAATCATGAC 
AGATAGTTTA 
TTTTAAATTT 
AGATAGTGAT 
AACACACCCT 
ACCTGAGTTT 
GTGCACAGCT 
CAATCAGATT 
TTTAGTACAT 
TAAAAAAATA 
ACGATGTCTT 
AGTGCCGTTA 
TTACAAAACA 



TATGCAAATH 



AATACCTGAA 
ACATATGGAA 
ATATACGACA 
TGGTATCGCT 
AGTACTTAAA 
TGCTTTGTTA 
TGGGTTATTT 
AGATTATGAA 
kGGGAAGTCn 
TAATGGGAAT 
AGTTAAAGTT 
ATTCAATATC 
ATTTCGCAAA 
GTTGTTAGAA 
AAAGGTGAAA 
TTTGATATCC 
GTTGAAACAT 
TCAAGGTTAC 
AAGACAGATG 
TTAACCAAAA 
GAAAAAGAAA 
ATGGATGGTA 
CGTGTGTTTT 
CCTTGGCAAT 
TTAATTCCAA 
TAATTGATAA 
GAACAGTCTC 
TGTTGAATCa 
TTGATAAAGA 



GaTGCATCAA 
GGTCGACAAA 
CTTTTACCAG 
ATGACGATGT 
TATATTGGTA 
GGTGTGGTAT 
AGTCAATGGG 
CGTGGTAAAG 
ATTAATGAGT 
AAGTACTATA 
GGAGGAAAGT 

gtgtaattaa 

AAGGATCTTT 
AAAGTGCATT 
CGTCATTAAT 
CTATTTCAGA 
ATAAGCAAAT 
CACAAGATAT 
ATTTAACTTA 
CGAAATTGCC 
TCCCTTTGCA 
ATGATCAATT 
TAAAGAGTTT 
ATGATCATAC 
CGCCGTTTAC 
ACCATACCAT 
AACTGTGACT 
ATATCCaGGA 
TAGGGCACGT 



2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

-V /• r* 
_> o w w 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 
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AGTGCAAGTC TAGTTAATAA AAATCCAATA GATCAAAAAG TTATCGTGGA TAGTTACTTA 4 74 0 

GAGTGGTTAA ATCAAGGAAT TACTAAAGAA AGTATTACGA CATTTATTGA ACGATACGCT 4 800 

CAAGCATTAA TCCCGCCTTT AATTGCTTTT ATTCAAAATT ATGGAATTGC TTTAGAAGCA 4 860 

CACATGCAAA ATACAGTAGT GAACTTGGGG CCACATTTTG ACaTTCAATT TTTAGTGAGA 4 920 

GATTTAGGTG GTTCTAGAAT TGATTTAGAA ACATTACAAC ATCGTGTATC AGATATTAAA 4 980 

ATTACAAATG ATAGTTTAAT AGCTGATTCT ATAGATGCAG TGATTGCAAA ATTCCAACAT 5 04 0 

GCTGTTATTC AAAATCAAAT GGCAGAATTA ATCCATCATT TTAATCAGTA TGATTGTGTT 5100 

GAAGAAACCG AATTATTTAA CATAGTACAG CAAGTAGTAG CGCATGCCAT TAACCCAACA 516 0 

CTACCACATG CAAATGAGTT AAAAGATATT TTGTTTGGAC CAACAATTAC TGTCAAAGCG 5220 

TTGTTAAATA TGAGAATGGA AAATAAAGTA AAGCAATATT TAAATATTGA GTTAGATAAT 52 80 

20 CCGATAAAAA AAGAGGTGTA GTACTACATG GCACACGTTA ACATAAATAT ATCGAAGATT 534 0 

AAaTATAACG CCAAAGTACT TCAAACAGTT TTTCAAAGTA AAAATATGCA ATTCACACCA 54 00 

GTAATTAAGT GCATAGCTGG TGACCGTACA ATTGTAGAAA GCTTAAAAGC GTTAGGTATC 54 60 

AATCATGTTG CAGAATCCAG ATTGGATAAC ATAATTAGTA TTGCAGATAC AGGATTTAAC 552 0 

A 5521 
(2) INFORMATION FOR SEQ ID NO : 4 09: 



25 



30 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1261 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 409: 

40 AGGTCGTCTr GCaGmAGmGm TATTTGACGA TCTTGCTTTC CCAAAACACG ATGATGATTT 6 0 

TAACATACTG TCTGATTATA TTGAGACACA TGGTGATTTC aCATTGCCAA TGTCTGTATT 12 0 

TGATGATTTA TATGAAGAAT ATACGGAATG GCTAAAATTT TAATATAATT TTTAATAATA 180 

ATAGTTAGAA CCAGGGTGAT GCAATTCGTT ATCCTGGTTT TAATTTAAAA TAAACTAAGT 24 0 

TGTGACTAAA AATTAATCAA TTATAGTGAA ATATGGTGCG CTATCTTGCA TAAATTGATA 3 00 

TGATTAACTA CACAGAATTT AAAAGTACAT AATACATAAT AAGGAAGTGA TACAATGGAT 360 

50 

GATfiAGCAAC ACACATCTTC ATCCGATGAT GAACGCGCTG AAATTGCAAC AAGCAATCAA 42 0 

GACCAAGAAA CTAATTCATC GAAACGCGTT CACTTAAAAC GTTGGCAATT CATATCAATA 4 80 
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CAAAAAATAA 


GTGGTTTAAA 


CAAAACTGAT 


CAAGCAAACT 


T a a Jl THIV 2V AT 




600 




TATAAAATCT 


TAAATAGTGA 


TWT" IS T^r* R ^ JS IS IV. 

i lAI 1 ALAAA 


AAACAGGACT 


r*m & r* A AfJTT 


AAGT AAAG CT 


660 


5 


GCAATTGATG 


G CAToGTCAA 


ACxAAl lAAAA 


GAT C CTT ATT 


1 1 /\ i 1 ± 


AAPAAAAGAA 


720 




CAAACCaAAAr 


L L 1 I 1 AA 1 oA 


a nr"TV2'T*T*Tr" Zi 
AoO i vj i 1 i w\ 


vjvj IviA 1 1 1 1 Lt 




TGCAGAAATG 


780 




CAAAAGAAAA 


ATGATCAAA i 




AiG J. s- V. 1 A A\iA 


A'J*^-5A 1 k.- -i. 


A r;PA ryvACGT 


84 0 


/O 


GCTGGCATTC 


G TC CTAAAUA 


1 G 1 vJA 1 i AV. i 


a & a r*T a & a Tfi 
AAAG 1 AAA 1 


a A ATP A AT 


TAAuAGGTAAA 


900 




GCATTAGATG 


AAGTTG iLAA 


AGA i \j 1 1 Uvj i 


AAAoAAA 


apnPTrzAAnT 


PAPTTTAACT 


960 


15 


GTTCAACGAG 


GTAOTGAAGA 


AAAAGAv^sji i 1 


a ar* a'mi ^ z^r* 

AAsjAl lAAAU. 


nTrtAAAAAAT 


T P ATCJTT AAA 
X L^in X ^ X X T^f^d^ 


1020 


AGTGTTGAGT 


ATAAGAAAAA 


TV (^o'p js a a^'PT* 
AGGTAAAG i 1 


GGAGl lAi lA 


p^aTm a T A A 


AT^TPPaOAAT 


1080 




GATACATCAG 


GTGAATTGAA 


AGATGCAGTT 


CTAAAAGCTC 


ACAAAGATGG 


TTTGAAAAAG 


1140 


20 


ATTGTTTTAG 


ATTTAAGAAA 


TAATCCAGGT 


GGACTACTAG 


ATGAAGCTGT 


TAAAATGGCA 


1200 




AATATTTTTA 


TCGATAAAGG 


AAAAACTGTT 


GTTAAAerAG 


ailAAAGGTAA 




1260 



1261 



25 (2) INFORMATION FOR SEQ ID NO: 410: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2488 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIiOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 410: 

35 

PJkKTATATTG AAnAGAnAAT TACTAAGATT AAATCnTCTT AAAATATCCC TGAAATAACG 6 0 

TCCTAAAGAT TAAAGGAAAG AGGTTATAAG TTATGCCAAA ATTAATTTTA TGTCGTCATG 12 0 

GACAAAGCGA GTGGAATGCT AAAAACTTAT TTACTGOATG GGAAGATGTT AATTTATCTG 18 0 

40 

AACAAGGTAT TAATGAAGCG ACTAGAGCAG GTGAAAAAGT AAGAGAAAAT AACATTGCCA 24 0 

TCGATGTAGC TTTTACATCG TTATTAACAC GTGCTTTAGA TACAACGCAT TATATTTTAA 3 00 

45 CTGAATCTAA ACAACAATGG ATTCCTGTAT ATAAAAGCTG GCGTTTAAAT GAACGCCACT 3 60 

ATGGTGGATT GCAAGGCTTA AATAAAGATG ATGCTAGAAA AGAATTTGGA GAAGAACAAG 4 20 

TACATATTTG GCGTCGTTCT TATGATGTGA AACCACCTGC TGAAACCGAA GAACAACGTG 4 80 

AAGCTTACTT AGCTGATCGT CGATATAATC ATTTAGATAA ACGTATGATG CCTTATTCTG S4 0 

AAA^^^^^.Aj\ AGATACTTTA GTTCGAGTGA TACCATTTTG GACAGATCAT ATTTCACAAT 6 00 
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TTAAATATCT TGAAGATGTG TCAGATGAAG ATATCATTAA TTATGAAATT AAAACAGGTG 720 



CACCGCTTGT TTATGAATTA ACGGATGATT TAGAAGTTAT AGATAAATAC TACTTATAAA 



780 



AaAAGAGCTG CATGTACACA AGGAGTGAGT GTATATGcAG CTCTTAAAtA TGTGAAGTAA 84 0 



900 



TGTAAGGAAA TAGTTAAGTA TAGAGTTTAT ATTAACGAGC TAGGGATACT CGAAAATATA 

GTTAGACATA CAATATAGTC AAATTAAAAC AATTATTTCG CTCTTTTATG TTGCTTAATA 960 

ATCTTTAAAG CACGCTTTCT TGTTTTAATG TTAGGGCTAT TTAAATTACG ACGAGCAGTC 1020 

TGTAAATCTA ATTTCATCTC TATCCCTCCT TGTAAATATA TTATGACCGA TAACTACTCA 108 0 

TATGTAAATA GTAATGATTA CGTTTTAAAG AAATTGTAAT AAAGTCGTGC TAATTTTTTG 114 0 

GAAAATGGGT ATAATTACCG GATATCTAAA AATGTGTGTC GTTTTTTAGA TGGTGAGGGG 12 00 

GAAGCTTTAA ATGTCGAAGA AACAAAAATT AACQATGATT ATTACTATGC TGATGGGTGG 126 0 

ATTTTTTGGA TTATTAAATG AAACACTATT AGTGACGGCT TTACCAAGTA TTATGAAAGA 13 20 

TTTTGAAATT TCATATACAC AAGTTCAATG GCTGACAACA GCTTTTTTAT TGACTAATGG 1380 

GATTGTTATT CCTTTGTCCG CGCTTGTTAT ACAACGTTAT ACAACAAGAC AAGTGTTTTT 144 0 

AGTGGGTATT TCTATCTTTT TCTTAGGTAC ATTACTCGGC GGCTTGAGTC CGCACTTTGC 1500 

AACATTATTA GTTGCTAGAA TTATTCAGGC GTTAGGCGCA GGTATTATGA TGCCATTGAT 1560 

GATGACAACG ATTTTGGATG TTTTCCAACC ACATGAACGC GGTAAATATA TGGGGATATT 16 2 0 

TGGTTTGGTA ATTGGTTTAG CACCAGCTAT TGGACCTACT CTTTCAGGTT ACCTTGTTGA 16 80 

ATATTTTAAC TGGAGATCGC TTTTCCATGT TGTCGCTCCA ATTGCAGCTG TGACATTTTT 174 0 

AATTGGaTTT AAAAcGATAA AAAATGTTGG AACTACAATT AAAgTACCTA TTGATTTTAT 1800 

TTCTGTCATT TTTTCTGTAC TAGGTTTCGG CGGGTTATTG tATGGAACGA GTTCaATTTc 1860 

AGAAAAAGGT TTTGATAATC CTAcGtATTA GTATCTATGA TTGGAGGCGT TGTTTTAGTC 1920 

GCATTATTTG TAwTACGTCA ATATCGGCTA TCAACACCAT TATTaAATTT TGCTGTATTT 1980 

AAAAATAAAC AATTTACAGT TGGTATCATT ATTATGGGTG TCaCAATGGT ATCGATGATT 2 04 0 

GGTTCGGAAA CGATTTTACC TATCTTTGTG CAAAATTTAT TGCATCGTTC AGCTTTAGAT 2100 

TCTGGATTAA CTTTATTACC AGGAGCAATT GTTATGGCAT TTATGTCGAT GACTTCGGGT 216 0 

GCTTTATATG AAAAGTTTGG TCCTAGAAAT CTTGCTTTAG TAGGTATGGC GATTGTTGTT 222 0 

ATTACTACGG CTTATTTTGT TGTAATGGAT GAACAAACAT CAACAATCAT GTTGGCAACA 22 8 0 

GTTTATGCGA TTCGAATGGT TGGTATCGCG TTAGGATTAA TTCCAGTAAT GACCCATACG 234 0 

ATGAATCAAT TAAAGCCAGA AATGAATGCA CATGGTTCAT CTATGACAAA CACAGTACAA 24 00 
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15 



25 



30 



35 



40 



AACTTTTCTC CAACTATGTC AGACTATA 24 88 

(2) INFORMATION FOR SEQ ID NO: 411: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1105 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 411: 

TTTTACAATT TCAGATATCT CTAAAACACA TACCGATCCA ATGGCCAAAA TTATTCGTCA 60 

AAAATTGAAA AAACTAGGAA TTCGTAAAGG GATTCCAGTT GTATTTTCAG ATGAAAGTCC 120 

AATTGTCATA AGAGAAGATG TAAAAGATAT AGTTGGAGAT AAAAATGCTA TCAATCGAAA 180 

20 AGGACAAATG CCACCTTCTT CAAATGCCTT TGTGCCAAGT GTTGTTGGAT TAATTTGTGC 240 

AAGTTATGTG GTGAATGACG TATTAAAAGA TATTCCAGTT CGTCGCATTA AAGACAAAGG 300 

GCAATAATTC ATTTTGAAAG GGATAATTTT CAACGTAAGG CAAGTGTAAC CACACCATAA 360 

AACTAAAACT GACTAGTTCG CAAGCATAGT AGAATATGCC TCGTGTACTA GTCAGTTTTG 420 

ATTTGATTAT AGCTAAAATA TCAAAGGTAA TCGATTTAAT CTTATTTTTA CGGCCTTTGT 480 

TTAAGTAAGT TGTTATATAT TTCTTTGAGT TGTTGTTCAC TTTTAGATGT CGTTTTTGGT 54 0 

TCGTAATAAA TTTTGTTTTT TAGTTTATCA GGTAAATATT GCTGTGAAAC ATAGCCATTA 600 

ACATATTGGT GTGGATATTT GTAACCAATA GATCGGCCTA GATCTTTAGC ACCTTGATAA 660 

TGTCCATCTT TTAAATGGTT TGGAATTTGG CCCACATGAC CGTTTCTAAT ATCGGACAAT 7 20 

GCACTATCAA TGGCACTCAT TGCTGAATTT GACTTAGGTG ATAAGCATAG TTCGATTACT 780 

GCTTGGCTTA GTGGAATTCT AGCTTCTGGT AAACCTAGAC GTTCTGCTGA TTCAATAGCA 840 

GCAAGTGTTC TCTQACCAGC ATTAGGTGAG GCTAAGCCTA TATCCTCATA GCTAATTACA 900 

AGTAATCGTC GAACTATTGT AGGTAAATCT CCAGCTTCAA TTAATCGTGC TAAATAATGT 960 

AAAGCGGCAT TGACGTCGCT ACCACGGATA GATTTTTGGA AAGCGCTCAT AACATCGTAA 1020 

TGCATGTCAC CATCCTTGTC ACTTACAAAT GCACCTTTTT GTAAACAGTC TTTAGCATCT 108 0 

TGCAATGTAA CATGTCGATA ACCGT 1105 

(2) INFORMATION FOR SEQ ID NO: 412: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 579 base pairs 

(P) TYPE: nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 412: 

TAACGTCGAA CTTGAGCTGT TACGTTATGA CTCATAATTA TTTTAGCATA GTCATTTAAA 6 0 

TAAACTTCTG TTCTGTCTGT TGGATAAGCA AATTCAAGCA ATTGACTGTA ACTATCATTA 120 

ATGACTTCTT GATCAACATG ACTATCAAAA TATACAGCAT AATAATAAGT ACCATCAACC 180 

ATATATAACA AATCTTCAAA CTCTGTAGTT ATTGGATTGC TATGATATGC ATAATTAATA 24 0 

ACATCTTCTA AATCGTTAAA TTTCACAATG ATTGTTCTTG TATTTTTACG TGCTGAAGAC 3 00 

TTTTGACGTT TAGAACCTTG AGCTTCTTTT TCTTTTGTTC GTTGCTCGAA TAATTCTTCT 3 60 

AATTGATCTT CACCTTCTAA TGTTTGAGCT AACAATTCTT GAACTTGTTC ATCAAATkGA 420 

TCAGTTGCAT CATCATCAGA CATATTCATC ATATCTTCAT TTTTAGATTT AGAAATTGTG 4 80 

ACTTCGACAC CTTTTTCAAA GGCATGTACT TGAATCCATA ATGGACCTnC TACAACAAAA 54 0 

TCTTCTACTT CGTTAATTTC ATCCATCATT GAnCAAAAG 579 



(2) INFORMATION FOR SEQ ID NO: 413: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 1342 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 413: 



35 



40 



45 



50 



GTGGTGAAAC 


TTCTTGCTTT 


GTAATTTTAT 


AAAGTGGATC 


AATATCACTC 


TGACTTACAT 


60 


CAGGTTGAAC 


TGTCATCTTT 


TTAGTTACTT 


TGTTTTCAAG 


CGTAATATTT 


GCTAATTCTA 


120 


GGtTTTACAG 


AATAATGAAT 


GTCATCAGCT 


AmCCCTTTmni 


CtTGATACTy 


CTCTTGATAT 


180 


TTTCCAGCTT 


TAGCATTTGA 


TAAATCAATC 


ACTACTCTTA 


AATCTTCTGG 


ATTTTCAATT 


240 


TTTATTATCT 


TTGATTGTGG 


TCCTGAAATA 


GTCACATTAA 


CTGTTTCAGG 


CGCTTTTGTT 


300 


AAATGCAAAT 


CTTTAGTGTT 


ATAAAGAATT 


TCAACGGGTA 


CATCTTGAAT 


CGTTTTACTA 


360 


GACTTTTGAC 


CAAGATTACC 


AGTGTTAAAG 


ATATTTCCAA 


AAACATTGTT 


AACAGATAAA 


420 


AAGAAAAACA 


ATGCCAAAAG 


AAAGGCAATA 


AATCTCAAGC 


CCCATTTACT 


TTCTAGCATA 


480 


TTATTTCACA 


CCTTTCTTTT 


GAAAGCGTGT 


GCCAAACCAA 


TGTTCAGCAA 


GCAACTCTTC 


540 


AAAAATTTCG 


TTTGAAATGT 


CTCGTCGTAA 


TTTTCCATCA 


AATGTTACCG 


AAATATCACC 


600 


AGTTTCTTCA 


GATACAATAA 


CGGTAAATGC 


ATCAGATACT 


TCTGAAATAC 


CAACCGCAGC 


660 
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15 



20 



25 



TGCTGCTGCT GCAATCTTCG TGCCTTGAAT AATCATTGCA 
AGGTATAAAG ACATTAATTA AAAGTTCTTG CGAAATATTT 
TTCAATATAA TCTTGAAGAC CTGTTTCTTT TTCAAAGACA 
TTTAGCCATA TATTGCACAG CCTTTGAAAC CGATTGAATC 
ATACGTATTA GAAGTATAGC GTTTrAAAAA GCTACCTCTA 
TCTAATTTCT GGTTGGAATA TTACTArrAA AGCTAATACC 
GAATAATTTA GATGITGCAG TCAAGTTCAA TATCATACTT 
TACTAATATC CCnTAAGTA ATTGTATCGC TTTAGTTCCC 
ATAAAGTACA TACCAAACTA TCAGTAAATC AAGGATACTC 
GAGGTTrrGA AAAAAGTTGG AAAAATCCAT AACATCTCCT 
ATACCCATTA TACCAATCAT TT 
(2) INFORMATION FOR SEQ ID NO: 414: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1073 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



CCATCATGTA 
GAATCCATTG 
ATTAATGCAC 
AATTTCTCTT 
CCAAGTTGTT 
CCCCATTGAA 
ATCTGCTGAC 
TTAAAGACCG 
GTTACAATTT 
CCGGGTAATA 



AAGGTGTGTT 
CAATACCTGT 
CTATACGTCT 
CATCTTTACT 
CTA-ACGCACG 
TAACGATATC 
CAATAACAAT 
TGATGAGAAG 
TTAACGTACT 
TTTTTCCATA 



780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1342 



30 



35 



40 



SO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 414: 
CTGTATAAAG ATGGAGGTGT mGCATGGT AAAACGTACT' TATCAACCAA 
ACATAGTAAA GTTCAIX^TT TCAGAAAACG CATCAGCACA AAAAATGGCC 
AGCGCGCCGT CGTCGTAAAG GCCGTAAAGT TTTATCTGCA TAAGATCACT 
TX^AtCTlcTTT rTTCGrTTAA ATTAAGAATA AATAGAAATT TATGTTATAA 
AGTTTAAATA TAGCTTCAnA TAAAAACGAT AnATAAGCGA GTGATGTTAT 
TTACCGAATT AAAAAGAATG CAGArTTTCA GAGAATATAT AAAAAAGGTC 
CAACAGACAA TTTGTTGTAT ACACriGTAA TAATAAAGAA ATAGACCATT 
TATTAGTOrr TCTAAAAAAC TAGGTAATGC AGTGTTAAGA AACAAGATTA 
ACGTGAAAAT TTCAAAGTAC ATAAGTCGCA TATATTGGCC AAAGATATTA 
AAGACAGCO. GCTTAAAGATA TGACGACTTT ACAAATACAG AATAGTCTTG 
TAAAATTGCC AAAGTTTTTA ATAAAnAGAT TAAGTAAGGn TAGGGTAGGG 
A^AACCACT CAACACATCC CGAAGTCTTA CCTCAGACAA ACGTAAGACT 



ATAAACGTA-A 
GTAAAGTTTT 
GACTCATCAG 
GCTCAATAGA 
TGGAAAAAGC 
ATTCTGTAGC 
TTCGCTTAGG 
AnAGAGCAAT 
TTGTAATAGC 
AGCACGTACT 
GAAGGAAAAC 
GACCTTAGGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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TTTAGATACA ATTACGAGTA TTTCAACACC AATGGGTGAA GGGGCAATTG GAATTGTTCG 84 0 

ATTGTCTGGA CCGCAAGCCG TTGAAATTGC TGACAAATTA TATAAAGGAA AACATCTTTT 900 

AAATGATGTT CCATCACATA CGATTAATTA CGGTCATATT ATTGATCCAG tCTAAAGAAG 960 

TGGTTGAAGA AGTTATGGTG TCTGTGTtAA GAGCGCCAAr AACATTTACA CGCGArGATA 10 20 

TTATAGAgAT TAATTGTCAT GGTGGtATTk TAACGATTAA TAGAGTGCTG GGA 1073 
(2) INFORMATION FOR SEQ ID NO: 415: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3176 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 415: 





CTTTACCAAT 


GCCAAATCCG 


AAGTAAAGTA 


TAGCAATAAA 


GATTACTAAT 


ACAATTCTGT 


60 




AAATGGCAAA 


TGGAATTAGT 


TTGATTTTGT 


TAATTAGATG 


CAAGAATGTT 


TTGATTGCAA 


120 


25 


TTAGTCCAAC 


AGTAAATGCA 


GCTAAAAAGC 


CTAAAATATA 


AAAAGGTATA 


TCAGCAATCT 


180 




GAATATCTTG 


ATAATGTTTT 


AATAAAGATA 


AACCACTAGC 






A n 




CCATAATAAA 


TGTAAAGTCC 


GATGCTGCTT 


TATGATTTAA 


TTTCATTAAT 


ACCCCAGTTG 


300 


30 


AAATTGTTGA 


GCCTGAACGG 


CTGAAACCAG 


GCCACATAGC 


TACTGCTTGA 


gAAATACCAA 


360 




TTACAAATGC 


TTGGAAATAA 


CTGATTTGAT 


CTACTGTTTG 


TGGGTTTTTA 


ACTTTAGCTG 


420 


35 


AGTATTTATC 


AGCAATAATC: 


ATATAGATAG 


CACCTACGAA 


TAAGCCAATC 


ATAACAGTTG 


480 


GCACACTAAA 


TAAATGTTCT 


TCGATGAAAT 


CATCAAATAG 


TAAGCCTAAA 


ATACCTGCTG 


540 




GCACCATACC 


CACTAATACA 


TGTAATAAAT 


TTAAACGTCT 


TGGCTTTGAA 


CGTCTTTGTT 


600 


40 


GATCGTTATC 


TCCTTCAACA 


TGTTTGTGTT 


TACCAATATG 


TAAAATCTCT 


AAGAAGCGTT 


660 




CGCGGAACAC 


CCATGCTGCT 


GCAAAGACGG 


ATCCTAATTG 


GATGACGATT 


TTAAATGTAA 


720 




ATGCTGACTG 


AGAACCTAAA 


AATTCAGATG 


ATTTTAACCA 


CATATCATCA 


ACTAGGATCA 


780 


45 


TATGTCCAGT 


AGAGGAAACA 


GGTGCAAATT 


CTGTTAATCC 


TTCGACGACC 


CCTAAGATAA 


840 




TACCTTTTaT 


TAATTCAATG 


ATAAACATAA 


TGTACCCACT 


TTCATTACTC 


AATTTAATTT 


900 




ATTTAAATAT 


CAAAATTACC 


ATATCATGAT 


AGCATATTCA 


TTTAAAGACA 


TGCTAGTTAT 


960 


50 


AGTTATAATA 


CTAGACTAAA 


GATGTATATA 


TTCATTTTCT 


TTTACATGTA 


AAACTACAAT 


1020 




ATTTTATTGA 


GCTATTTAAT 


TTGATTTTAA 


GGAAAACCTT 


TTATAATAGG 


TTTAGGTGAT 


1080 
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15 



20 



25 



30 



35 



40 



TTCTTGGTCA 
TTTTTAGCTA 
TTAGGTGTTT 
TTAGCATTTA 
CCAATCGGTG 
AAGAGTTATT 
ATGTTTTTCA 



TTGTTTTATA 
TTGAATCAAT 
TTTAATCGTA 
TTAACAATGC 
ATGTTAGGTA 
ATTGATTTTA 
AAGGACTTAG 
TTTGAGTTTT 
CAAAAGCCAT 
GTATTAAATG 
AGCGGGGCAG 
GGTACTATAA 
ATTTTCAGTG 
ATTGAAGAAG 
ATTAACACAA 
GAACTTTGCC 
GGTTTAGATA 
ACAACGATGA 
TATATAGAAA 
AATGGTGATG 
CTATAGTTGT 
ATATGGTGAC 



GTACTGGTCT 
AAATTATAAG 
TACTTTTAAG 
AAGTTAAGCA 
AACAAATGAC 
TGCCTCAAGT 
TCCATTTCAA 
TTATTTTCGG 
TTAGTCAACG 
CAGAGCAAAC 
GCATTTTaCG 
TTGGATTGGT 
AAACTGCGGC 
GGCAAGCGTT 
TAGAACAACC 
TTATTCAGTT 
ATTTAAATTT 
GTAAATCCAC 
GTACAAACCA 
CTTCTATAAA 
TGCTAGACGA 
TAATAGGTGA 
GTCTTTTAGT 
TTCAAACAGA 
TTGTCATTGC 
ATGGAAGACT 
ACTTATGAAA 
TGGTGTTTGT 
ACAAAGTGCA 



ATGGTCTAGC 



CGGCATAATC GTTATAACGC AAAATATTTT 
ACATCAATTT CAAGGTTTAT GGATTGTATT 
AGCAACTGTG CAATTTCTAA ATCAATGGTT 
TATGCTTAGA CAGCGGGTTA TTTATAAAAA 
TATACTCACA GAAAACATTG 
GTTCAAATCA ATGATGGTTC 
TACCGCATTA ATTATGTTAA 
TTTGAAAACG CGAGATGAGT 
GmTTAAAT ATTGCTAAAG 
AGAGAAGCaT ATTTaCGACG 
CAGTGCTTTT TTATCGGGAT 
TGCATTGGAA GCAACGCTAA 
AATTGCGATT ATTTTAGCGC 
CCATACTGGA AAACAAAGTG 
GAACTATAAT AATGAATTTC 
AACAGACATA TCATTTCGAT 
GGAAATATTT AAAGGTGATC 
TTTGACACAT CTTATTGCAG 
GCGTGATTTA AATATAGGAA 
AGAGAATATT ACGATGTTTA 
AGTAGGTTTA TTAGACAAAG 
AGGAGGCGAA ATGTTATCTG 
TATGAAGCCA GATCTCGTTA 
ACACATGATT CAGAACGTTC 
ACATAGAGAT AATACAATTC 
GATTGCTGAT GATCGCAATA 
ACACGACTAA AATTTCAAGT 
GGAAGTTTAG TTGCGCTCGC 
CTTGGTGCGC CACTATACGC 



CGCTCATCAT 
TAACTGCACC 
CAAAAGATCA 
GTTTAGTGAC 
ATAGTACTCa 
TAATGCTCGA 
UL'TTAGTAGT 
CTGAATTTTA 
AAGGTGCCAG 
TATTAAAGTA 
ATGATGATTC 
AAATTGCACT 
GTGTTTATCA 
TACTTAGTCA 
AAGATATAGA 
TGCAATCTTT 
GTGGACAGAT 
TATTTGATGA 
TGTTTCAACA 
GCCATTTACA 
TTTCAGTAAA 
AGATAAGGAT 
CATGTTTTTC 
TCTGATGATT 



AATAGCAGAT 
ATTTATTTTA 
AGGTGATACA 
TAATGGTCAT 
ACCTTTTTAT 
AATCATTGCA 
ATTTATTCCT 
AATGACTTAT 
GTTAAAGCTA 
GTTTAGAACT 
ATTTATAAGT 



All J. ± ■»■ 

TAATGCAATT 
TGACGTTGTG 
TGAGGAAAAC 
TGATAGATTG 
TGTAGGTCCA 
GCCAACAATA 
ACAGCCATAT 
AAATAATACT 
CACAAAAGGC 
GAGACGCATA 
GCCTGCAACT 
TTTTAAAGAT 
ACGACGCTTG 
TATAACAGAA 
TTATTGTTAG 
TTAAGTGGTT 
TTAGTCGTTA 



1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

ISSO 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 



EP0 786 519 A2 



ATAAAGCTAC ATTTACAATG CTACGTGATA TTCGGGTACA GTTTTTCGGT AAATTAGTAA 3000 

ATGTCATTCC TAATGTTTAC CGTAAACTGA GTTCTAGTGA TTTAATTTCA CGTATGATTA 30 6 0 

GTCGTGTTGA GGCATTACAA AATATAkATT TACGTGTTTA TTATCCACCA GTCGTCATCG 3120 

GTTTGACAGC GCTAGTTACA GTCATAGTTT TGGCGTTCAT TTCAATCGGC CATGCG 3176 



(2) INFORMATION FOR SEQ ID NO: 416: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2109 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 416: 

20 TCTTTATTTC AATTTCCAAT TGAATTTTTT TATTATTTAC GCATAGCTCT TAAAATTAAC 60 

GTTACGATTG CAATTAAGAT AATTGAACCA ATTAATGCTG GCAAGATGTA AATACtTCCT 12 0 

AATTCAGGAC CCCATTGTCC GAATAGTTTG CCACCTACCC ATGATCCAAT AATACCTGCG 18 0 

25 ATAATATTGC CTAAAATACC ACCTGGGATA TCTTTACCCA TAATAGCACC AGCAGCCCAT 24 0 

CCAATTAAGC CACCGACAAT TAACATTCCA ATAAATCCAA ACATAATTTT CAGTCTCCTT 3 00 

TTTCTATTTA TTTTGCGTTA TTCTAAGTAG TACCCCTTAT TTACAATTCT AAAACAAATT 3 60 

30 

CAAATTATTT TTATCCAAAT ATTTTTAAAA GTAGTAATTG AATATCAATT TTATTCAATG 42 0 

TAGCTATCGT TATTTAAAGT CTCTGTACCG ATAATATCAT ATACATTTAC ATTATTTTTT 4 80 

CTGCCGAATT CATAGCTTGA TTATTTTATG TTATAGGACT AgAATATACA CATATTATTA 54 0 

35 

GAGCATCTTT GAATTTTAAA TCAAGAAGCG AGGTTAATGA ACAATGAATA TGCATATTTT 60 0 

ATATaACTTA CGAACTAAAC ATAATTTAGA AATTGACGAA TTAGCACAGC AATTAAATGA 66 0 

GAAATATGGT ACTAAATATG AAGCACATCA AATTTGGGAA TGGGAGAATC ATCACCATGA 72 0 

40 

ACCTAAATTT AAAGATGCCA TGCATTTAGC TGACTtCTTT GATGCACCAT ATGAAATGTT 780 

TTTAGAAAGT AAGGTTAAAG AATATCAGAA ACATTTAGAA GAAGTCGATA TTCGCATGGA 84 0 

45 TAAATAGATG CAAATAAACC CTCACAACAC GTTTGGCATA TATCCTTTCA AATCTATACT 90 0 

GGATATATTA CATTACGTTG TGAGGGTATT TTATTAATTA ATATGAATTA AGACATTTTA 96 0 

CAAGCGTTAA TGCAACGAAT CTTTTTAGTG ATCTTGCTCA CTCTTTAATA CTTTACCGTT 1020 

50 CTTAGCATCA ACAGTAACTT CTTGTTTTTT ATTACCTTTT TTCAAATCGA TATTGTAAAC 10 80 

AAGTTTGCCA TCATCTTTTT CAAGTGACCA TTCTTTAATA TCACCATCAA ATTCTTTTTG 114 0 
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ATrcATTGTA TcrrcmTT cAGTCTrrrr gttaatcact TTrrrArrrr tatcagcaac 
AAGTAcrrcr gactcttcac cagatttttc ttgcgtcact ttataagccc attcaccatt 

AGAATTTTCA AATGAAATTC CTTTCAACTT TTGGCCTTrG TAAGTTTCTT CAGCrTTTTr 

cacagcatct tctgggcttg ttttaacatc ttttaaagca ataacatctt ttgttgtatt 

AGTGTCrrGA TTAGTATTTG ACTCTGTTGA TTTTGrrTCA TCTTTrGGAG TATCATTGCC 



' ACATGCAGTA AGCACCACCG CTGACATTGA TAACACTGCT AATGATTTTA ATTTCATAAT 

ATCACTCTCT CTTCCTATTT TTGAAACTCA TAACAAAAGC TTATATGCTA TATAGATTGT 
ATTACCCCTT GTmTAATT TTATTCATAA TTATTACAAA TATTTTTAAA TTAATCGTCA 
TGCG-rrACTT TCGTTCGTAC TCCTTITATA AATGAACCAT GTAATATAAG CATGCTATTA 
TCGACTCAGT TTGTCTAATG CTTTCTrrGG TACTTCTTCC TTTTCAACTr CTTCAAAAGT 
TTCTACATGA TGACCTTTAT GTGTAATITT CAAGTATCTA TGCGGTTTAA CATCAAATGT 
AGCAGTATAC ATTAATIGCG TCTCTTTCCC TrCTTTATTA AAAACACriT GAu.ATAACT 
TCGGAGTTCA TCATCCATTC TAOTAGACAC CTCTGTCGrT TTAACATAAG AATCATCCTT 
CTTTACTAAT GGGTTAAATT GATCTGTTAT ACCATGGGAG TCTATTGTTT TTAAAATGAA 
TAAAGCAgCA TAAGCGCCAA TGATAGTAAG TACAAGATAt GTkATTGTTT TTAAAATCGT 
TTCAAAAGA 

30 (2) INFORMATION FOR SEQ ID NO: 417: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ! double 

(D) TOPOLOGY: linear 



1260 
1320 
1380 
1440 
1500 



1B60 
1920 
1980 
2040 
2100 
2109 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 417: 
" GTTACATAAA TGAAAACAAA AAAGATAATT TTACTGCTTA TGCTACACCA GAACATAATT 60 

ATCAATTTGC TGGTGCTATG ATAGAAAGTG AAAAATTAAG CGAGriACTA AAGCCAGCCA 
ATCAGTTAAA ATCACCAGAT GATATAAAAA AAGAACTAAA TAAAAAGAAG AGTCACTAAA 
GTTAGGAGrr ACTTTAATCT CCAAAAAACA TGTTTnATA ATTATFGGTG TCATATTGTG 
TATATGTACA GTTTCTACGG TCATGCATTT TAAAATGAAA TATGATGAAA AAGAAAAACA 
so AAAAGCGATT TACTACAAAG AACAACAAGA ACGTATTACA CTCTATCTTA AGCATAATAC 

TAAAGAAACG AACACGATTA AATCTGTAC^ mCACAAAC TTGGAAACAA GTCCTATGGG 



120 
180 
240 
300 
360 
420 
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10 



15 



ATCGCCTGAA CATAATTATC AATTTGGTGG CGCTATGATA AAAAGTGAAG GAGTAGATAA 54 0 

ATTATTAAAA CCAGCACATG AAAGAAAATC ACCAGAAAAA ATCAAAGAAG AATTAGATAA 60 0 

AAAAGAAGGC CACTAGGGTC TTCTTTATTT TTGATTTAAT CTTCCT^TAA TCTATGTCAT 66 0 

TGCTATCGAA GGTGTTTCGC AATTAATATA AATCACTTCA TCATCACCAA TACTTCCCCA 720 

GTTTTGTACA GTACATTAAC ACAAACGAAC CACGTTAATT TAAATGGAwT AaTAGTTTGG 780 

CCATTATAAG AACAATATAT ATCGAnTAAC AAT 813 
(2) INFORMATION FOR SEQ ID NO: 418: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 418: 

AAAAGCAATC GTTGGTGGTG CTAAATTCAT CGGCAACTCA TATGTAAAAG CTGGTCAAAA 60 

TACACTTTAC AAAATGAGAT GGAATCCTGC ACATCCAGGA ACACACCAAT ATGCTACAGA 12 0 

TGTAGATTGG GCTAACATCA ATGCTAAAAT CATCAAAGGC TACTATGATA AAATTGGCGA 18 0 

AGTCGGCAAA TACTTCGACA TCCCACAATA TAAATAAGCA ACATGAACAT AGGATCAAAA 24 0 

GTCATCCCCC ACTATCAATC ATGGGGGATG ACCTTTGATC CCTTTTTTAT ACATACACAA 3 00 

GCAAAAATAG CGGTGATTGT TTACCATCAA TTTTAACAAT CACCGCTACT TTTGCTTGTA 360 

ATTCATGATT CAATTTTTGT TGTGTGCACA ACGACACTAA ATTATGTGTT TGCTATTGTC 4 20 

GTGTTACAAC GATATGCGTC GTTGATTTAA CTTATCAAGT AATTGATTTA AATTGTCTAA 4 80 

TTCGACTTCC GATAAACATT GACATCTTGC TTCAATCAAT TCGCAACGTG CATTATTTAT 54 0 

TTGTGAAATT AATGTACGTG CTTGATCAGT CAAAATTAAT TCTTTACATC TTAAATCTTC 600 

GCTAGATTGT TGACTTGTGA TGTACCCnnT CAAAACTAAT 64 0 
(2) INFORMATION FOR SEQ ID NO: 419: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1159 base pairs 

(B) TYPE: nucleic acid 

(C) 3TRANDEDN2SS : double 

(D) TOPOLOGY: linear 



30 



35 



40 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 419: 
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w 



15 



20 



25 



30 



35 



40 



rrCTTGGATA CGTAATAAAT TACCITTTGT CATAGGATTA 
ATAAACATTT CTATTTTGCT CTAATGCAAA ATCTATCGTG 
CTTAGCCTCA GTAATTAAAA CCCCTTTTGA CAAACCGCTG 
AAATCTATAT TTAGCGATTG GTGTATGTGG TGGATATTCA 
TTCTATTTTA TTTCTTAATG CTAATGTACT TTTGGGATAA 
TAAAACTGCA ATTGTAGGGA GATrGTATTT TAAAGCTATT 
TCCTTGAGCA AGGCCGGAAA CAATTGTTAA ATATTTGCTT 
rrCTAAAGAC TGTTGGGTAT AACTTGTAGA ATCTCTTGCA 
CATATTATTT ATTAATTTGA TGTTCCCTTT ATAGAAAAGA 
TTCmTAAT AGCACTGGGT ATTCAGAATC CATATATGTA 
CTGCAGTTCT TTAATAATAT CGTTATGATC AAGTnTATA 
TAAATGAACA TTTTCTTCCC TATTCACCCA TTCACATAAA 
CTCCTCTTTA ArTACATTAG GATATGCCAT TAAAAATTGA 
GTGTGCCCAG TATAACTTAA GCAAAAATAG TTTAATCAAT 
TCATACAAAA TCATATTCTA TTrrTGTTTT ACATTTCTAA 
AATG1TATTT TAAGGAGTAA AAATACTGAC TTAAAAAGTG 
TTAAAATTTC AATTTCCAAC ATACTTTATA ATTAAACCTT 
TATATAGAAT TGGTCTTACT GGTTTGATGA AATCACCAAT 
TAAAACCCTT TTTAAATTG 

(2) INFORMATION FOR SEQ ID NO : 420: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1879 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE5S : double 

(D) TOPOLOGY: linear 



AACATAGATC 
ATGTGACTGC 
ATAATTCTAT 
GATATAACTA 
TGTGTTTGAT 
TGATGTGCCA 
TTATCATTTG 
CCTACTACTG 
AGTAATGGAA 
ATGTAACTTA 
AAAGCATAGT 



M>* • « m» « /-«rriTv rrr-**T^ 

L:AUft i/Vrtrt. J. J-kA w A * 



TGAATTTGTT 
AAATCAACTC 
TACAAAAACA 
AAAAGTATGT 
ATAAATAAGT 
TAATTCTTCA 



CAGGTAAAAC 
CACTTTGTTC 
TGCGCTCAGG 
AACCTTTTTC 
C-GCCA-AAGGC 
TTGCATCAGC 
ATAATAAAAA 
CCAAATGATG 
ATTGATATAT 
CTTTTAATTT 
ATTTACGTAA 
TTTTTCTTCn 
TAGTCGAAAA 
CTTTTTGTAA 
TTGTCGATGT 
TGGAAAGAAT 
TTTGCAAATT 
ACATGTOC-AT 



120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 
1140 
1159 



50 



(XI } SEQUENCE DESCRIPTION: SEQ ID NO: 420: 
TCAATCAGCG ACTACAAGAA GTGCGGGTCT TCAAACAATT GATGTGACAA CACTAAGTGA 
CCCCACTAAT ATTATCATGG GTATTTTAAT GTITATAGGA TCTTCGCCAA GTTCGGTTGG 
TGGCGGTATT CGTACAACAA CTTTCGCTAT rTTAArrTTG TTrTTAATTA ACTTTAGTAA 
TAATGCCGAT AAAACATCCA TrAAAGTTTA CAATAGAGAA GTACACATTA TCGATATTCA 



60 
120 
180 
240 
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ATCAGCTACT 


GAAAATGGTA AGCTTACATT 


TTTACAAGTA 


TTTTTTGAAG 


TCATGTCTGC 


360 




ATTTGGAACT 


TGTGGACTAT 


CGCTTGGTGT 


CACAAGTGAT 


ATTAGTGATA 


TTTCTAAGGT 


420 


5 


CGTACTAATG 


ATACTCATGT 


TTATAGGACG 


TGTTGGCTTA 


ATATCATTTA 


TCATTATGAT 


480 




AGCAGGACGT 


CGAGAACCAG 


ATAAATTCCA 


TTATCCAAAA 


GAACGTATTC 


AAATAGGATA 


540 


1 n 


ATATAATAGC 


AATCTAAGTT 


TAGTTAATGT 


AGATTTTAAC 


TGGAACTTAG 


ATTGCTTTTT 


600 


TAGTTTGTAT 


TTTTAACTTA 


TTTTATAAGA 


CGATTGGTTT 


CGAAAATGGT 


AAACTAGTAA 


660 




CAATGAGAGG 


TGTAACATGA 


TGGAAAAAAA 


TGAAAACATT 


AATGTAGAGA 


TTTTAACTAC 


720 


15 


GTCAGATATG 


CATAGTCATT 


TCTTAAATGG 


TGATTATGGT 


TCAAATATTT 


ATAGAGCTGG 


780 




TACTTATGTT 


AACCAAGTAA 


GAGCACAAAA 


TCATCGCGTC 


ATTTTATTAG 


ATAGTGGCGG 


840 




AAGTTTAGCT 


GGCTCGTTAG 


CGGCCTATTA 


TTATGCTATT 


GTTGCACCTT 


ATAAACGACA 


900 


20 


TCCAATGATA 


AAGTTAATGA 


ACAGAATGCA 


TTATGATGCT 


AGCGGTGTGA 


GTCCAAGTGA 


960 




ATTCAAGTTT 


GGTTTATCAT 


TTTTAACTCG 


TTCAATTGCT 


TTGGCACGTT 


TTCCATGGTT 


1020 




ATCAGCAAAT 


ATTGAATACA 


ATGTTACTAA 


GGAGCCTTAT 


TTTTCAACTC 


CATATTGTAT 


1080 


25 


TAAACATTTT 


GGTGACTTAA 


AAATTGCTAT 


CGTAGGCGTC 


ACAGCAGATG 


GTTTAATGGA 


1140 




AAATGAGTAT 


TCTGAAATGG 


AGCAAGATGT 


ATCTATTGAA 


AAGACATTAG 


TGGCATCAAA 


1200 




ACGTTGGATT 


AGATATATCC 


ATGAAGTTGA 


AGAGCCAGAT 


TTTTTGATTG 


TAATTTATCA 


1260 


30 


TGGTGGATTG 


AATAAAATTA 


GTAATAGTAC 


GAAAAATAAA 


AAGGCAAGTT 


CGAATGAAGC 


1320 




TGAAAAATTA 


ATGGAAGAAC 


TCGGTGTTAT 


AGATTTAATG 


ATTACAGCTC 


ATCAGCATCA 


1380 


35 


AACAATAGTA 


GGTCAAGATC 


ATGAAACGTA 


TTATGTTCAG 


GCTGGTCAAG 


ATGCCAAAGA 


1440 


GCTTGTACAT 


CTTTCGATTA 


ATTTTAAAAA 


GAGAACAACA 


ACTTATGATG 


TTGAAAGCAT 


1500 




TGATTCTAAA 


GTGATTGACT 


TAAATGAGTA 


TGAAGAGGAT 


CAAGAATTAT 


TAGATTTAAC 


1560 


40 


ATTCTATGAT 


AGAAAAGCAG 


TGGCTTATTG 


GTCACAGGAA 


ATCATAAGTG 


ATAAAGGTTT 


1620 


GATGTTATCA 


GTAAATGGGT 


TACAAGATTT 


AGTCTGTCAA 


ACACATCCAT 


TTTCGCAATT 


1680 




ATTACATGAT 


GCAATTCACC 


TTGCATTTGA 


TAATGATATA 


ACATGTGTCC 


ACGTGCCTAT 


1740 


45 


GaACGGAGAG 


AAGGGGTTGA 


GTGGACAGAT 


TCGAAATGAA 


GrTTTGTATC 


aTGCATACCC 


1800 




ATaTCCAGAT 


aAGCCAtGGG 


tATGACAATT 


aGTGGTCAAA 


ATaTCAAGGT 


ATnTTGGGGT 


1860 




ATAGTTATTC 


ACCATTAGG 










1879 


50 


(2) INFORMATION FOR SEQ ID NO: 421: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2710 base pairs 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 421: 





AATTCCCCCT 


TTTTCCCCAT 


CCATTTTnCC 


ATCCACTGGT 


CCTTACGGGA 


CCATATTATT 


60 


w 


TnAAATTGGA 


nAAAATTATt 


TTAAAGaTTA 


TwACtACTCT 


TtAATCATTT 


TrGTGAATTA 


120 




AAAAAAGTAG 


TGCAAAAAGC 


AAAATATACT 


TTATACACTA 


CAAATCATTT 


ATTTATAATA 


180 




AAGTTTCACC 


AAAAAATGTT 


CCAACTAATG 


AAACCGCTTG 


TTCAGCAGTA 


TGATTATTAC 


240 


15 


TGTCAATCAA 


TGGATTTACT 


TCAACTAAAT 


CCATTGAGGA 


AATTAAATGT 


GATTGATGCA 


300 




GTAATTCCAA 


TGCAAAATGG 


CTTTCTCTAT 


AACTAAGACC 


ACCCAAAACT 


CTAGTACCAG 


360 




TGCCTGGCGT 


TTCAAGCGGA 


TCTAAAGCAT 


CAACATCTAA 


AGATAAATGA 


ACGCCATCAA 


420 


20 


CATTGCGTGA 


CTTCAAATAT 


TCTATTGTAT 


TTTCAATTAC 


TTCCTTTATC 


CCCAATTTAT 


4B0 




CAATATCTGA 


CATAGTAAAT 


GTTTTAATAT 


TATGATCTTT 


GATAAATTGT 


CTTTCACCTT 


54 0 




TATCTAAATC 


TCTCATACCA 


ATTAGTACGA 


TGnTTTCTGG 


CTTGATTACA 


TTACnATTTA 


600 


25 


ATTCTAAAAG 


TTCTTTGGGA 


CCTTCGCCTG 


TCAAAATCCT 


TAGAGGCATA 


CCATGAATAT 


660 




TTCCACTTGG 


TGACTCTTCA 


GGTATATTTA 


AATCACCATG 


TGCATCATAC 


CAAATAACAC 


720 




CTAAATTATT 


ATAATGTTTA 


CTTATTGCTG 


ATACTGAACC 


TACCGCAATA 


GAATGATCTC 


780 


30 


CACCAAGAAC 


TAGAGGAAAT 


CTGTTATTTT 


CAATTGATGC 


TGAAACCTCT 


TTATTTAATT 


840 




TTTGATTAAC 


ATCTATAATT 


TCATCATAAT 


TTCTTAATCC 


TTTTTGTTCA 


CTATGAAATT 


900 




TTTCAATGTT 


CACAGCAGGt 


ACCTTAATAT 


CCCCCTTGTC 


ATATACATCA 


AGGTCTAATT 


960 


GCTTTAATCT 


TGAAATTAAT 


CCAGCATATC 


TAATTGCTGT 


TGGTCCTAAA 


TCAACACCTA 


1020 




ATTTTCTTTG 


TCCAAATGTT 


GATGGTGCAC 


CTATAATATC 


AATTGCTTTT 


GTCTTTGTCA 


1080 


40 


TAAGCGTCCC 


CCTTTGCTCT 


ATGTAATTAA 


AGAATAATGT 


ATGCGCTTAC 


CATTATCAAG 


1140 


CAATAGCTAC 


ACATATAATC 


TGTTTATCTT 


ATTACTTCAT 


AAAAAAAGGT 


TCTTCATCTT 


1200 




TTATGGTGGG 


AAGGTAAAAC 


TTCCTGCTTT 


TTTTAATACA 


CAAAAAGCGC 


AATTGCCTCT 


1260 


45 


ATAATTTAAA 


GTGACCAAAC 


CCAAACTAAA 


GGAGACAAGT 


GCGCCTATGT 


GTAATGATAC 


1320 




CTTAGAATTA 


CTAAGAATAA 


AAGATGAAAA 


TATAAAATAT 


ATAAACCAAG 


AAATTGACGT 


1360 




CATTATCAAA 


GGAAAAAAAG 


CAACAGTGGT 


TAATGCTGTA 


CTAACGTATA 


AGCCTTCGGC 


1440 


SO 


CTGTTATTGT 


TGTGGAGTTA 


AAAATGAAGG 


ACAAATTCAT 


AAACATGGTA 


AGCGTGTTTC 


1500 






TTACTTAAAA 


CTCAAGGGTA 


TAACACATAC 


CTCAACTTAG 


CTAAACAACG 


1560 
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5 



10 



15 



20 



25 



30 



GTGTTTTATC 


TCAAGATGTG 


TTACTCAAAA 


AGTTATAGAA 


GAAGCTACTA 


AAGTTAAAAC 


1680 


AGAGATTGAT 


ACTGCAGAAG 


ATAACTGTAT 


CTCTCCATCT 


ACTGTAAGTC 


GTATTAGAAC 


1740 


TAAAGCGGCT 


AATTCATTAC 


GAATTAAACC 


CTTTAATTGT 


TTGCCAGAAC 


ACATCGCTAT 


1800 


GGATGAATTT 


AAAAGCGTTA 


AAAATGTAAC 


TGGATCAATG 


AGTTTCATTT 


TTATAGATAA 


1860 


TGATACTCAT 


GATGTTATAG 


ATATTTTAGA 


AAATAGAACT 


ACAAGATTCT 


TGCGTGCCTA 


1920 


TTTCGAGCGA 


TTCGATTTAA 


AAAATCGACA 


ACAAGTTAAG 


ACGGTTACTA 


TTGACATGTA 


1980 


TGAACCCTAT 


GTCCGATTAT 


TTCGCGACCT 


ATTTCCTAAT 


GCAGCTATTA 


TTTTTGACAG 


2040 


ATTCCATATC 


GTTCAACATT 


TAAATAGAGA 


ACTTAATAAG 


TATCGTGTAC 


AAGTTATGAA 


2100 


TGAATACCGT 


AATAAAAAAG 


GACCTGATTA 


TACAATTTTT 


AAGAATAACT 


GGAAAGTCCT 


2160 


ATTGATGGAT 


ACTAGTAAAA 


CCATATTTAG 


TAAATACAGA 


TGGAATAAAT 


CTTTTAAGGC 


2220 


TTATAAACGC 


TCATCTGACA 


TTGTAGAATT 


CATGCTTTCA 


AAAGACGATA 


TACTACGACA 


2280 


CTCCTACGAA 


CTTGTCCAAG 


GATTACGAAA 


AGACCTAAGG 


TTATGTAATT 


GGCCTAAATT 


2340 


TATTAATCGT 


TTAAATTCAG 


TTAGTAAAAA 


GTCTGTGAGT 


AAGGGTGTAT 


GGAAAGTGGT 


2400 


TAAATATTAT 


AGAAAACATC 


AAAGGATGTT 


AAGAAATACA 


ATTTATTACC 


CAGCATTTAA 


2460 


TAATGGTGCT 


ATAGAAGGAA 


TTAATAATAA 


GATAAAATTA ATCAAGTGAA 


TTTCTTTTGG 


2520 


TTACAGAAAT 


TTCAACAACT 


TTAAAGCACG 


TATAATGATG 


aTTTTCAGCT 


TGTACAAAGG 


2580 


AGaAAAAAAG 


AAGACAACCA 


AGCCCAATAA 


TGGACTGGCC 


GCCTAATAwA 


nGGGskCTAA 


2640 


AAGTTgTATT 


TTTAAAAATA 


GTTCCTTTAA 


ATTATATACC 


CnCCACATTT 


GGGGGAGGAC 


2700 


CTAAAAAAGC 












2710 



35 (2) INFORMATION FOR SEQ ID NO: 422: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1027 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 422: 

45 

CATTTTAATT GtTaAAATTC CAAAAAAtGT ArGTGGATTw AAAGrAAACC CtGTGTTTTT 6 0 

AAAAGGtACC ATTaAAATAg TTCCGATTGT ACCATCCCAC GTGAAAtTTT TAgTAtTGCC 12 0 

GGTGaGAGAA AATGCCAATG CAATCATCGC AGTTCATAAT CATCCATCCG GTGATGTAAC 180 

50 

GCCCTCACAA GAAGATATCA TAACAACAAT GAGGTTGAAG GAGTGTGGTT TGATTTTAGG 24 0 
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w 



15 



20 



GGGTTACTTT 


GATGAAAATG 


ATTGAAGTTO 


ATAGAATTAA 


TGACGTATCT 


iry* 2V T* A ^ 'P 


"3 ^ 

J6 0 


ACCTACGAAG 


TACTTTCATT 


GGAGGAAAAA 


TAGTGACTCT 


ATTTATTATT 




42 0 


TCGTGCCAAT 


GGTTTATACC 


ATGCAGTTAA 


ATATTAAAAA 


TGAACCTGTa 




>1 o <^ 

4 o 0 


ATCi"ITTAAT 


AACATTAGCT 


TTATCTACGT 


TAGGTATTTT 


AGTAACCGCG 






TAATCGTTAC 


GAAACAAGCT 


TTTCCTTTAT 


TAAGTGTAGC 


AATTGGCTCA 


Al 1 1 i lA^lAj 


£ A 

b 0 0 


GAATCGTTTG 


GGGCCi"l"lTA 


CTAAGTGGTA 


GcTACGCGCT 


GATACGATTT 


TTA I C- 1 AAu(j 


c c r\ 
6 b U 


CATTTGGGCG 


TAAGTAATAG 


TAATCTGTTC 


ATCAAGTAGT 


ATCCGTGCTT 


GAAAAC.AAi\(- 


T T n 


TAAAACTCCT 


AATGTGGAAC 


TAGTTAAAAA 


ATTCTAAACT 


AGTACATTAG 


GLAVj i 1 1 i v> 1 i 


T Q n 


ATGCAGAATA 


AATTTAATTG 


TTAAATTGAA 


AGTGCGGTAA 


AAATCCACTA 


TTTATTTGAA 


□ /I rk 
b4 U 


AAAAATCGAG 


CAACCAAATT 


AAACCTTGAT 


ATCCTAAGTA 


AATACATAAT 


AAAACAAGl V. 


ft A r\ 


CGACATAAAT 


TAAAAATCGC 


AAAATAGAAA 


GTCCAACTCT 


AAAAAGGAGG 


ATGACTAATA 


960 


GTGCTATTAA 


AATAATTGTt 


AGTATACTa\ 


ATGCTAC-AAA 


CCTCCTArTA 


CGCTTTTAAA 


1020 


TCCATAA 












1027 



(2) INFORMATION FOR SEQ ID NO: 423: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 569 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 423: 



45 



SO 



CGGCACATTT 


TTAATTTATA 


CATATTTTAA 


AACTAAGTAA 


CAGTTTGAAG 


AAATCGTAGT 


60 


TCAATAATGT 


TAATTGTGAA 


AATGTATATA 


AACATAAAAA 


AATCATGTAT 


AATATATGTT 


120 


GTTAATTAAA 


CAGTTCGAAA 


GCGAGATGAC 


ATTATGGGAC 


GTAAATGGAA 


TAACATTAAA 


180 


GAAAAAAAGG 




TAAAAACACA 


AGTAGAATAT 


ATGCGAAATT 


TGGTAAGGAG 


240 


ATTTATGTTG 


CAGCAAAATC 


TGGTGAACCC 


AATCCAGAAT 


CTAACCAAGC 


TTTAAGGTTG 


300 


GTGCTTGAAC 


GCGCTAAGAC 


ATATTCAGTG 


CCGAATCATA 


TTATTGAAAA 


AGCAATAGAT 


360 


AAAGCTAAGG 


GTGCTGGAGA 


CGAAAACTTT 


GATCACCTAA 


GATATGAAGG 


ATTTGGCCCA 


420 


AGCGGATCAA 


TGCTAATTGT 


TGATGCGTTA 


ACAAATAATG 


TAAATCGTAC 


TGCCTCTGAT 


4B0 


GTGCGAGCTG 


CTTTTGGTAA 


AAACGGCGGT 


AATATGGGTG 


TATCTGGATC 


AGTTGCTTAT 


540 


ATGTTGTCAT 


GTGGCCACAT 


TTGGTATTG 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3264 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
5 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 424: 

TGGACCTATA AAATGGATGC ATCCAAATAT GGATGnGGGG TAnGGCGGGa AAATAGGaGG 60 

TTATATAGTA AGTCATTTAA GTCGAATAGC CATCCTTTTT AAAATGAAAA AAATAGAAAG 120 

CTCAATAGTT TGTTAAAGCC TTTAAATAGC GTCGTTACGC AATTTTAGAA TGCTAAAAAT 180 

15 

TGTCACAAAT TTCAAATTAC GTGCTAAAGC TTGAGATATC AATATTTATT GGCGATAGAG 24 0 

TGTAATTTGA CTCCGCTTAC ATCTCAGTTT TATGTTTGTA ATTTGGTAGC ATAATATTAT 300 

AATAAAATAA AATTGTTAAT CTTTAATTTC AGTATAGATA TTTTTACGTG TAGTCACGTG 360 

20 

TAAAATAAAT TCAATTAGGT TAGGAGACAT AATTATGAAA ACATTTGGTA AAAAGGTTGT 42 0 

ATTAATCGGA GATGGATCTG TAGGATCAAG CTATGCCTTT GCAATGGTTA CGCAAGGTGT 4 80 

TGCTGATGAA TTTGTAATTA TTGACATTGC AAAAGACAAA GTAAAAGCAG ATGTTCAAGA 54 0 

25 

TTTAAACCAT GGTACAGTCC ACAGTCCTTC ACCAGTTGAT GTGAAAGCAG GTGAATACGA 60 0 

aGaCTGTAAA GATGCAGATT TAGTTGTTAT TACAGCTGGT GCACCTCAAA AGCCAGGTGA 66 0 

AACACGTTTA CAATTAGTTG AAAAAAATAC TAAGATTATG AAGAGCATCG TTAAGAGTGT 720 

TATGGATAGT GGCTTTGATG GATATTTCTT AATCGCGGCA AACCCTGTAG ACATTTTAAC 78 0 

AAGATTTGTA AAAGAATATA CTGGATTACC AGCAGAGCGT GTTATCGGTT CAGGTACTGT 84 0 

35 ATTGGACAGT GCACGTTTAC AATATTTAAT TAGCCAAGAA CTTGGTGTTG CACCTTCAAG 900 

TGTTGACGCT AGTATTATTG GCGAGCATGG TGATACTGAA CTTGCAGTTT GGTCACAAGC 960 

AAATGTAGCA GGTATTTCAG TATATGACAC ATTAAAAGAA CAAACTGGTA GCGAAgcTAA 102 0 

40 AGCGGAAGAA ATTTATGTGA ATACACGTGA CGCTGCTTAT GAAATTATCC AAGCTAAAGG 108 0 

GTCAACATAC TATGGTATTG CATTAGCATT GATGCGCATT TCAAAAGCCA TTTTAAATAA 114 0 

TGAAAATAAT GTCTTAAATG TTTCTATACA ATTAGATGGT CAATATGGTG GTCACAAAGG 12 0 0 

45 CGTTTACCTA GGTGTACCAA CATTAGTTAA CCAACATGGC GCAGTTAAAA TTTATGAAAT 12 6 0 

GCCATTAAGT GCCGAAGAAC AAGCGTTGTT CGATAAATCT GTTAAAACAT TAGAAGATAC 13 2 0 

ATTTGATTCA ATTAAATATT TATTAGAAGA CTAAGCCTAT TTTAAGTATT AATTAGAAAT 13 8 0 

50 

ATATTAATGG TAAGAGGATC TATTAGTGTT GCAGTAACAC GTGGCACTGA TAGGTCCATT 144 0 

55 
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10 



20 



25 



30 



35 



40 



TATGACTCGT GAAAAriTAG TGACATGAAT AAAAATGTTG AGGCGTCATT GAGTATAAAG 
GAAAGTAGTT CTGCATTAAT CACGAaGTAG AGCATGACtnA CGAGGAATAA CTATAGGGaG 
ATGGTTrTGG AATGACGAtG TCTTGTATCA ACATGGTACA TTAGGTACGT TAATGGCTGG 
CrrACTAGAA GGCACAGCTA CAATTAATGA ATTGTTAGAA CATGGGAATT TAGGGATTGC 
AACGTTAACA GGGTCTGATG GCGAAtAATA TTTTTAGACG GAAAGGCATA TCATGCTAAC 
GAGCATAAAG AATTTATAGA ATTAAAAGGC GATGAGAAAG TACCGTATGC ATCGATTACT 
AATTTTAAAG CGAGTAAGAC ATTTCCATTG CAACAATTAT CACAAGATGA TGTATTTGCA 
CAAATTAAAA ATGAAATCTT AAGTGAGAAT TTATTTTCGG CTGTTAAAAT TTATGGCACA 
TTTAAACATA TGCATGTACG AATGATGCCT GCTCAGCAAC CGCCATATAC ACGTTTGATT 
GATTCAGCAC GCAGACAACC TGAGGAAAAA AGACAAGATA TTCGTGGTGC CATTGTTGGA 
TrrrTTACAC CAGAATTATT TCATGGCGTA GGGTCTGCrro GTTTTCATAT ACATTTTGCG 

GATGATGAAA GAGCTTATGG TGGACATGT-r irnui^ui nv, ««oxo««xv«. ■ 

GAGATACAAA ACTTTCAAAC ATTCCAACAA CATTTCCCGG TAAATAACGA GACGTTTGTT 
AAAGCGAAAA TAGACTATAA AGATCTGGCA GAAGAAATTA GAGAAGCTGA ATAGTCTAAT 
ATAATTAAAA GACCTTAGCG ATATTAGGAA CAGGTGGTTC TAAATGCATC GTTAAGGTCT 
TTTTATATTA TATATGTGCT TACATATTTT GTTGATACGC GCGTAAAAAG CTATAAATAG 
TATCAAGATA TGGGGTGTCA AGAATGTGTT CACGTGCACG TCGATAAATA AAGCCTTGAA 
TCGCTTCGAC TTCAAGAGGT TGCTGATGCG CAATGTCGTA ATACATGCTC GTTCCCATTT 
CGTCGGGATA ACCTTGATAT ATAGTCATAA TAGTATCGAC AGTTTGTrCT GAAAAGTTTA 
ACCCCTCTGC CTGTGCAACA CGACAACCAT CTAGTAATAG CTGTCTACAC AGTGTACGTA 
rrrCAGGATT ATXSCATGATT GCAACGGTCT GTCTACCGAG TGCTGTGATA gaattaatgc 
CCAAGTTAAC TAACAArTTA TACCAAATAG CTTOTTCAAT ATTTGCTTCT AATACAATGT 
CGATTTGACT GTCTTGGACC AAATCTCTAA ATTGTCGAGT TAATGCATTA TCTTGTATAC 
GTAATTGATA ATCTCTGAAG TGCGTAACAA CATCGCCTTT CTTTTGACCA CTTATATAGA 
CAACTGCTTG GCATACGTrr TTAAATGAAA TATGTTCGAG TTGACCATAA CCATTTTGGG 
CT/^JU^ATGAT GAGCGTGTCC TCATGTCCAA GATGAGTTAA ATGAGGAATA ACTGCATCAA 
GTTGATGTGT TTTGACTGCA ATAATAATAA CATCAAATGT GTTTGTGACA TCTTCATAAC 
CTTTCACAAC AATATCTTGT GCAGGTGCAT GTGGTACAGT ATAATATGTT ATTGmTGG 



CGTGTCTCCC GATAAGCGTT GTATGAGGCA ATGATTGTTG TAATTCATAG GCAATAGTTG 3240 
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(2) INFORMATION FOR SEQ ID NO: 425: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 425: 
AGAACGAACC TTT/^TACGT ACGTGTCGAC CACGAACTGT CATGACAAAT TGATAAAACG 6 0 
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AATAGTTTTT CATTAGTTCA TTGTCACATC AATCACTTTT GTkTCACCTT TAATCACAGC 12 0 

ATTTTCATCA TAAATATTAA TTGAAGCTGC TTGATCAGTG TTAGTAATTA TAATTGGTGA 180 

AATTACAGAT TTAGCGTTAT TATTAATATA TTCAAGGTTG AATCTTACTA ATGGATCTCC 24 0 

GACGTTAACT TCGTCACCAC TAGACACTAA CACTTCAAAT CCTTCACCGT CTAATTGAAC 30 0 

TGTGTCTAAA CCGATATGAA CTAATAATTC TAATCCGTTA TCTGCTTTTA ACCCAATCGC 36 0 

ATGCTTAGTT GGAAAGACAT TGTCAACACG TCCTGCAATT GGAGACACAA CTTCTCCTTC 42 0 

AGTTGGATTA ATACCAAAAC CTTCGCCCAT CATTTTTTGT GCGAATACAG GATCTGGAAT 4 80 

ATCTTCAATT TTCACGAATT CTCCAGTTAA TGGTGCATAA ATTGCGATAT CTTTCTGAAC 54 0 

TTCTTTGCCT TTTCCGAATA ATTTTTTAAA CATACTTTCC ACTCCTACTT ATCAAAATGT 60 0 

GATATTAAAT CGCCATAACC CAATTCTTCT AACTTTTCAT ATGGAATAAA TTGAATTGCA 66 0 

GCGGAATTGA TACAGTATCT TAAGCCGCCA CTTTCTTTAG GTCCATCATT AAAGACATGT 720 

CCTAAATGAC TATTTGATTC TTCTGAACGC ACTTCAGTTC TCAACATACC AAATGATTTG 780 

TCGACTAATT CTATAA 7 96 

(2) INFORMATION FOR SEQ ID NO: 426: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 93 base pairs 
^0 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 426: 

ATCCAGCAAA TCTTTCTTAT CACGTTCTGT AATTTTCTTA GTATCCATCT TGATTAGCTT 6 0 

TGATAACTTT TCAGCCGTAT CCAACATTTC CGATTGTGTT GTTTTTCGAC CCCTAGTATA 12 0 

TGTAATAGCC ATTTTAGAAG CATTATCAAC TAAAACTTTC CCATTTCTGT CTAAAATACG 18 0 
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ATGTGAGCCT TGTGCGATTT GTAAATAACC TAAACGTAGT ACTAGTACTG CAAAAATAAA 300 

TACAATCACA CCAAATATAA AGTTAATTCT CTTGTTAATT GTATTTTGAA CGATTTCATC 360 

ATTTGATTTT TCTTTTAGTC TTTTTAACAA AACTACCTAC CTCTATTCAA AGTCTTTCAC 42 0 

TTTAAATCAT ATATGAATTT AGAAATTATT TCTATCTTTT TGACAAAAAA ATAACGGTCT 4 80 

CATTtAAGAG ACCGAACAAG TAATCATACT TTATTTTGTT GCATTATATA ATTCGTCAAC 54 0 

TTTTTCCCAG TTAACTACAT TCCAAAATGC GCCAATGTAG TCAGGGCGTT TGTTTTGATA 600 

TTTTAGGTAA TAAGCGTGTT CCCATACGTC TAAACCTAAA ATAGGTGTTT TACCCTCAGT 660 

TAATGGATTA TCTTGGTTTG GTGTAGTCAC AATTTCTAAC TGGCCATTGT TTACGACTAA 720 

CCAAGCCCAA CCTGAACCAA AGCGTGCAGC TGCTTTGTCA GCAAATTCTT TTTTAAATTC 7 80 

TTCTAAAGAA CCCCATTGTT CTTTAATTTT TTCTACTACA GTACCTTTTT CTTCTGAGTT 84 0 

TGGTGAAAGT AACTCCCAGA ATAATGAATG GTTTAAATGT CCACCGCCAT TATTACGTAC 9 00 

AGCAGTTTGG ATGTTAGCTG GTACACTGTC TAAATTAGCA ACAATTTCTT CAATAGATTT 96 0 

AGATTCTAAA TCTGTACCTT CTACTGCAGC ATTTAATTTC GTAACATACG TGTTATGATG 1020 

TCTGTCATGG TGAATTTCCA TAGTTTCTTT GTCAAAATGT GGTTCTAATG CATCAAATGC 10 8 0 

GTATGGTAAT TTTGGTAATT CAAAAGCCAT AAATAATCAT CCTCCTAAAA TGTCTGTAAG 114 0 

TAAATAATAA CAAGCAGTGA CTGGTTCAAC AAAGAATTTG CTTAAATTCT ACTACTTATT 12 00 

ATTTTCTCTA CTCATTTAAT ATAACTCAAA TCAAAAATAA TTAAACATTT TGTATATAAA 1260 

AAGTTAACAG ATTTGCCATA A-AATCATACG AACGGAGTAT GAAATGAACC TTTATCTTCT 1320 

ATAATTTAAA AAATGAGATT TATGCATACA TCGGACCAAA TGTGCATAAA TCTCATTTCT 13 80 

35 TTATATTAAT CTTGGcAAGA CTCACATACA CCATAAACTT CAAGTTTGTG TTTGTGAATA 144 0 

TTAACACCAG GTAGTGAT7>lA TTTTATCTGA TCTATTGGAC AATAATCTAT TACCTTTGTA 1500 

TCTCCACACT TTTCACAGAT AAAATGATGA TGATGATGGT TTGTACAAGC GATTCTAAAC 1560 

TTCATTTCAC CATCAAGTTC TGTATTTTCA ATAATTCCTA AATCTTTAAA TAAGTGCAGG 1620 

TTTCTATATA TTGTGTCGAA TGAAATTCCA GGATAATTTT CATCCATAAC TTGTTGTATA 1680 

TACTTTGCGT TTATATACTT ATCTTCTTCG ACAAAAATAT CTAACATATC TTTACGTTTA 174 0 

TCTGTATATT TTAAACCGTT CTCTTTTAAA ATTTTAATAG CATCATTTGT ATTCATTGAT 18 00 

ATTAGCTCCC TTTTTAAACT TCATTCGCAT TTTCTGATAA GCCATTGTAA TCATAAGTAA 186 0 

AATAACAAGT AGAACTACAA TTACACCACC CGGAGAAATG TCCATATAGA AAGCTAGGAC 1920 

50 

TAAGCCTAAT ATTACTGATA ATTCACCTAA AAATACACTT AGTAATATCA ATTGCTTAAA 1980 
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TATCCCTACA ACACGCATTG AGGCAGAAAT AACCATCGCT ACAATAACAA TAAATAAAAA 2100 

TTGAATCCAT TTAGGAATGC CAATGACTTT ACTATATTCC TCATCAAATG ACAATATAAA 2160 

^ TAATTCTTTA TAAAACAATG TAATAAACAG AACAACTATG ATGGCAATGA CAATAATCGT 2 220 

TGTTAAATCA CTTATATTCA CTGCGCTTAT TGAGCCAAAT AGCAATCCAA CAATTTCTTG 2280 

ATTGAACCCA TCAGCTAATG AAATGAAGAT TGCACTCAAG GCGATACCAG CACTCATTAT 2 34 0 

10 

AATTGGAATA GCAATTTCTT GGTAAGCAGT GTATGACGTT CTTAATTTTT CAATTAGAAG 24 0 0 

CGCACCTACT ATTGCGAATA AGATTCCAAA CCACATTGGA TTAATAAATA CTAGTGTTGG 246 0 

CATAATAGTA AGTAAAAACA TACCGAAAGA TATACCACCT AAAGTTACAT GACTTAGAGC 2 52 0 

15 

hi:QAQQi:\n:K AGTGATAGTC GTCTAACAAc GATAAAAGCA CCGaTTAGAG GCGCAATAAA 2 580 

ACCTAtCAAG ATAcCACTAA TTAAAGAGTA CCTCATAAAA TCAAAATTCA ATAATGCATC 2 64 0 

TATCAATTGT GACACGCCTT TCCATTTTAA ATAAACTCAA ATCTTTATTA ATTACAACAT 2700 

20 

TCTCGATTAT GCTGATGATC GACAAAACGT ACAGGATGTC CATAAATTTT TGAAATTTCA 276 0 

ACTTCATCAA GTGATTTAAA CTCATCAGTT GTACCATGGA AATGCAAATG CTTATTTAAA 282 0 

CATGCTACTT CAGTAGCAGT ATCTGCTACA ACACCGATAT CATGAGTAAC TAAGATAATG 2880 

25 

GTGATACCTT CTTGTTTTAA TTGATCTAAA GTATTATAAA ATTCACTTAC ATGTTTTGCA 294 0 

TCAATACCAT TCGTTGGTTC ATCAAGTACT AATACTGcAG GTTCTGAAAT CAATGCTCGA 3 00 0 

GCAATCATTa CACGTTGTTG TTGACCACCT GATAATTCTG CTATATTTTT ATGAATTAAA 306 0 

TCACTTATAT TCAGTCTTTC TAGTACTTTA ATCACTTTTT CATTATCTTT GCTATTAAAT 312 0 

GTTTGGAAAA GACGTTTTGT CTTTGTTAAT CCGCTTAAAA CAACTTCTTT AACACTTGCT 3180 

35 GGGAAACCTG AATTAAAGGC ATTTGCTTTT TGTGATACAT AGCTTAATTT AATTGATGTT 324 0 

TcTTATTTTT AAAATCAATA CCTTCAACAA AAATCTCACC ACTTTGTAAA GGTAATAACC 33 00 

CTAGAATCAA CTTCAATAAT GkTGATTTAC CAGCACCATT TGGkCCAACA AwTGCtAAAA 3 360 

40 ATTCACCTTT ATTTATTTTG AATGnnnTAT ATT 33 93 

(2) INFORMATION FOR SEQ ID NO: 427: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1123 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 427: 
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CCCTGTTTCT TAGCCTTTTC GTATATGCGT TCATAAGAAG AATTCTTTTT GATTTCAGAT 120 

TGTAAAGCAC TGTTTTCACT TGATTGTTTA TCTATTTTAT AATCTAAATC TGCAATCTTT 180 

CCTCGCGTAT CATACGCATC CATTTTTAAA GATAGCATAT AAATACTTAA CATAGCAATT 24 0 

ACAGTAATCA AAGTTATGTA TAAAACTTTT TCAAATTTAG TTAATTGTAC AACCACTTTT 30 0 

CTCGAAACAG TCTTCTTTTC GGGCTTAGTT TGTGGTTGTT' GCTTCGGTAT ACTATTATAA 36 0 

ACTTGTTCGT CATATGGTTG GTACACTTTT TCTACAGCCA TTATAAATTG CTCCTTATTT 42 0 

AAGTATTTCA GCTACACGTA ATTTCGCGCT TCGTGCTCTG TTATTGTCAT CTAAATCTTC 4 80 

TTCTGTAGCG GTAATCGGTT TACGATTAAC ACGCTTTAAC TTAGGTGTAT ATGCTTCTGG 54 0 

TATAACTGGT AATCCTCTTG GTACCTCTGG ACCTTTTTCA TATTCTTGGA ACACCTGTTT 600 

ACATAAACGA TCTTCTAAAG AATGGAAAGT GATTACCGAA ATCCTGCCAT CTACTTTCAC 660 

TAATTCAATC GCTTGTTCTA TTGAATCTTC AAAAGCTGAC AATTCATCGT TTACTGCAAT 720 

TCGTAGTGCT TGAAATACTC GTTTl'GCAGG ATGTCCGCCT TTTCTTCTTC CTTTTGCAGG 780 

AATACCTTCT TTTATAATGT CAACTAATTC TAATGTTGTT GTTATTGGTT GTTGTTCGCG 84 0 

ATGTGCTTCG ATTCTTCGAG CTATCTGTTT TGAAAATTTC TCCTCGCCAT AGCGATAAAA 900 

AATCTTCACT AACGCTTCAT ATGACCAATT GTTAACAATT TCATATGCTG TTAGTTCTTG 960 

TGTTTGGTCC ATACGCATGT CTAATGTTGC GTCATGGTGA TAACTGAATC CTCGTTCTGG 1020 

30 AATGTCGAGT TGTGGGCTTG AAACACCCAA GTCGTAATAA ATTCCATCTA CTTTTTCAAT 1080 

GT'TTAAGTCT TTTAATATTT GAGTTAATTC ACGaAAQTTG CTA 112 3 
(2) INFORMATION FOR SEQ ID NO: 428: 
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(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 42B: 

TATATCCTCC AnACCATAAA AATGGAATTA TAGTGAAGAC CGCTATGATC CAGGATGACC 6 0 

AC3TATTATA GAATCTTTCg ATAGAGAAAC GAATCaACGT ATCCgACATC aCGAAATGAA 12 0 

ATTAGAAGAT TATAAAGATG AGTTAAGAAG AGAATATCTA AAACAATCTG ACAGAATTGA ISO 

AGGAGATGAA TAAGCGTGGG ACTTGATTTT AGTGGTTTAC CAGATTTAGC AGTATTGGAA 24 0 

CAAATGAAGG AAAAAGAACA GATTAGTGAG GTTATTGCGC CTGAACATGT TCGTATGCAT 300 
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CATTTCAAAA AATTTGAAGA TGATTTTAAA AATGCGGCAC AAGGGGCTTG GGTGAAAAAT 42 0 

GCCACAGACG AATTAAAAGA TATTAGTAAT GATTTAGAAA AAATTCAAGA TATTAAAGTA 4 80 

TAAAAAGGTA TTAAGAAACA CAATAAGTAT ATAATCCATT TAATAATAAA TGAATTATAT 54 0 

AGTTCATAAT TTCGACTATA AGTGGCTATT AGCCATTATT TTTCGGGATC TATGTCAAAT 6 00 

CGGACTAATG AATTCAATAA TGGAAGTTAA GCAACCAATC TTTGTTTAAC TTCTITTTTA 66 0 

TTTTTTGGAA AATAAAGTTT TGAACATAAT AAAATTTGAT TATGTTTTAA CGAATTTTGA 720 

TGTTTCTTGA ACTATATATC ATCTAGTCGT CATTTACAAT TGGTAAATAT GACTTCAAAC 7 SO 

TGkATGAAGG TGATGGCGAT TAAAAGGCTC ATCCGTAGGT TCTAAAGAAC TAGAnTTT 83 8 
(2) INFORMATION FOR SEQ ID NO: 429: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1150 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



[xi) SEQUENCE DESCRIPTION: SEQ ID NO: 429: 

nGTGTTCAAA TAATAGAATG GCTAATACCC CAACAATAAA TAGGAATAAA TATGTTGATG 6 0 

TCGTAATTTT AGTAAATAAT GAAAATCTAA AATTAGTAAC CCTATTTTGA ATATAAGCTT 120 

TAACTTCTAA TAATACTGGG AAGCCGATTG ATCCTAAAAT TATTAAAAAT ATAACTATCG 18 0 

CTTGTACAAA ATAATCATGT GCATAAGGAA TTAATGACTT ACCTGTAATA TCTAATCCAC 24 0 

CATTGGTAGT GGCAGAAATA GAAACAAAAA CACCTTGCAT GATTGCATAT TTTAAATCTG 300 

GATTATCTCG ATAAAAGTAA AATGCTAATA ACATGGCTCC TACGAGTTCG ATTACAAATA 360 

TTGATTTTAC AATATCAATA ATCAATTTGA CGGTACCACT CATTGTGTTT TTGTTATTAT 420 

CTAACATAAT TAACTGACGT TCTCTAATTC CAATATGTTT ACCTAGTACC ACCCATAACA 4 80 

TCGTACCAAT TGCCATGACG CCAATTCCAC CAATATTTAA TATCACGAGG ATAATTAATT 54 0 

GTCCAAATGT AGAATAGGTA TCGACAATAC TTATCGGAGA CAATCCAGTA ACACTAATTC 600 

CGGATACGGC AACAAATAAT GTGTCAATTG GATTTACTTC TACACCTGGT TTATGaACAT 660 

ACGGTAAGTT TAATAATAAA AATGCAATGA CAATTGCGAC GATATAGTAC AATACAATAC 72 0 

CTTGTTGAGG GCTTGATCTT TTTAAAAACT GGCTAAAAAT TGACACGTTG TTCACCTCAA 7 80 

CATTATTTTA GTTTAATATT AATTTCTTTC TCTTTACCAT CTCTATAAAT CTTCGCTGTA 84 0 

ATTGATTTCA AGTCATCTTT ATGACTAAAT ATAATCTGCC TAAAGCGTAA ATCATCTTCT 90 0 
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AAACCATTGT TGTCAACTTG ATCTACAACA ACACCGTTCT TAACTTTTCC TGGCAATTTA 102 0 

ACTGCTTGTC TTTCAAAACT ATTTAGACTG ACAATATTCT TCATTTTAAC ACCTACATCG 1080 

GGATAGTCAA TTTTACCTTT tGTTTCTAAA TCTTTTACAA TCTTTtGTAC TTCATTAaCA 114 0 

GGtATTGCAA 1150 
(2) INFORMATION" FOR SEQ ID NO: 430: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 797 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 430: 

20 AAGAATATTT AGGATTTTAT GTTTCGCAAC ACCCAGTAGA TAAAAAGTTT GTTGCCAAAC 60 

AATATTTAAC GATATTTAAA TTGAGTAACG CGCAGAATTA TAAACCTATA TTAGTACAGT 12 0 

TTGATAAAGT TAAACAAATT CGAACTAAAA ATGGTCAAAA TATGGCATTC GTCACATTAA 18 0 

ATGATGGCAT TGAAACTTTA GATGGTGTGA TTTTCCCTAA TCAGTTTAAA AAGTACGAAG 24 0 

AGTTGTTATC ACATAATGAC TTGTTTATAG TTAGCGGGAA ATTTGACCAT AGAAAGCAAC 300 

AACGTCAACT AATTATAAAT GAGATTCAGA CATTAGCCAC TTTTGAAGAA CAAAAATTAG 360 

30 

CATTTGCCAA ACAAATTATA ATTAGAAATA AATCACAAAT AGATATGTTT GAAGAGATGA 420 

TTAAAGCTAC GAAAGAGAAT GCTAATGATG TTGTGTTATC CTTTTATGAT GAAACGATTA 4 80 

AACAAATGAC TACTTTAGGC TATATTAATC AAAAAGATAG TATGTTTAAT AATTTTATAC 54 0 

AATCCTTTAA CCCTAGTGAT ATTAGGCTTA TATAACTTTy ATCAACTAAT AAATTATGAT 600 

ATAGTAAACT GATGGTTAGA TATTTyTyAA CCATCAGTTT GCGTktATAT TAGTTTTTTA 660 

40 TGCTTATTAT TTTTATGAGT TTCACTTTAC ATTATTGATT AATCCATAAG AATAATTAGC 720 

ACAAAAAGCA GTATACATAA ATTGAGTAAA GAATTTTGTC GATATAACCG nGCGGAAAAA 78 0 

TAATAACnTT TTGGATn 7 97 
{2) INFORMATION FOR SEQ ID NO: 431: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1466 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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TTTCCAAGAG AAGTTGCGGA AGTAATTAAT AAAACGCATC ATAATAAATT GGTCATTTCG 6 0 

ATGATTTCGk CACAAATCGA TGCGGATAGA ATGGATTATT TACAACGTGA TGCGTATTTC 12 0 

ACAGGTGTAT CATATGGTGC TTTTGATATG GAGCGTATTT TAAGATTAAT GCGACCTTCT 18 0 

AAAGATGAAG TACTAATCAA AGAAAGTGGT ATGCATGCAG TTGAAAACTT TATTATGAGT 24 0 

CGTTATCAAA TGTATTGGCA AATTTACTTC CACCCAGTTA GTCGTGGTGG AGAAGTGCTG 30 0 

CTTAATAATT GtTTGAAACG CGCAAAACAG CTTTATAATG AAGGCTATGA ATTTAAGTTG 3 60 

CATCCACATG ATTTTATTCC ATTTTTTGAA GAGACAGTTA CGATTGAACA ATATGTTGAA 42 0 

CTCGATGAAG CGGTAGTTAC GTATTATTTG GAAAAATGGA CAAAAGAAGA TGATGCTATT 4 80 

TTAAGTGATT TAGCAAGTCG ATTTATTAAT CGAGACTTAT TTAAATATAT TCCATTTGAT 54 0 

GGCTCAATTA TTACAATATC AGAACTGCAA GAACTGTTTG AAGCAGGTGG TATTAATCCA 600 

20 GATTATTATT TTGTGAGTGA AGCATTTTCT GATTTGCCAT ATGACTATGA TCGACCGGGG 660 

TCAAATCGCA AACCGATTCA TTTATTAAGA CAAGATGGTA CGATTAGAGA AATAAGCAAT 72 0 

CAATCATTAG TCATTCATAG TATTACAGGC ATTAATCGCC AAGACTATAA ATTATATTAT 78 0 

CCTAGAGAAA TGGTTGCAAA GATTAAAGAT AAGACAATTA GAGAAGCTAT TGAAAATTTG B4 0 

ATTAATGAGC TTAATTAAAC AGGGCTAAAA TTGTTATCGT TAAATATGGA GGTTATATCA 900 

TTGTCTGAGA AAAAAGGCTT TAATTTTAAT ATCATAAAAA ATGACCCTCT AGATGGTCAT 96 0 

AAAGGTACAA ATATTGGTTC AATTAGCTTA GACAATATTG CACCAGTTTT TATCGATGTT 102 0 

GCTAACAAAG AAGCATTTAT TGATATTGGA GGCATGCATG CTCGTGCCAA AGTTGAAAAA 1080 

GGTGTGAAAT GGATTACTGA TAAAGCTGCT GTTGAAGGCG ATGAAGCTAA AGaATATTGG 1140 

TTGTGTTGGG TAACAACAGA ACGTAATGAA CAAGGACCAT ATTACGCTGG TTTAACAGCG 1200 

TGCTATTTAT TAGTGAATAA AGCAATTCGT CGTGGTTATA AAAGTATGCC TGAACATGTT 1260 

AATATGATGG ATAAATCAAT GAAACATCAT ATTATCATAG ATCAAATTGG TGACGAGAAT 1320 

AAAGCTATTT TAAAAGACTT TTTAATGAAC CATGATGAAG GTATGTGGAA GCATTCTTCT 1380 

GATGCTTTAC ATCAAGCATT TAATTAAATA TTAGAAACTA AAATTTCCCA ATTAATCTAT 1440 

AAAGATATGA TTCATTTCTC AATGAC 14 6 6 
(2) INFORMATION FOR SEQ ID NO: 43 2: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2304 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
{ D ) TOPOLOGY : 1 inear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 432: 
™ACGAAT ACAAACrTGC ATTAATATAG AATCATTTAT GCmTTAAT ATCACAAAAC 
AAATTGArrA AACTAGCTAA AATAGXCAAA ATTGOAATCA AGATTACATA AATATCATAA 
CCrCTAGATA rTAGTATACA TAATATAATT AGCGACCCTA TTATAATTGC TAAATTATAA 

oco^CTAA rrrTATArrr catttcaatt cccaccttaa tagcatttat taaa.attcta 

TAATGITTAC ATAGACTTCT TAACAAACTT TAACCCAGCT AAGCTAAGTA TTGAAATAAT 
XACACCTCCT AATATAAATA TATTACTTAA GCTTAGTAAA GGTATAATTA TACTTATOVG 
TCCTAAAGAC AAT^TATCCG CIX^cATAATT CGaTGtAGAT GAGATACTAA ATACTTTCCC 
CATCAAATGA TTrGGCGTrr TTATTrGAAT GGCAACTGAT CrTGTTAGTC CCTCTATAGA 
TTGTCCAAGT CCCAATAATG TTGCACCTAT ATATAATATC GCCACACTTG GAAACACATT 
;.TAATCGAT AAGCCAATTC CCCAAACTAA AACACCAATA CTAAATTTAA AGATTAATCG 
CTTTTC^AC AGCAAACCCA TAATCAATGA CATTAATAAA GATX3CTATAC CTAAACAT^A 
TGTAGCTAGT Ca.TATACAC CAACGCCCTC nTTAATATA ^GGAAATAA ACAATGGTAA 
TACAACACGC CAAAGGCCAG TATTAATCAA TATGCAGGCA AATTGGATGA TTATAATAAA 
TGGAATTTCT TTAGAT^TT TCAAGAATTC CCAAGTTTCA GAAAAATCTT crm«AG^ 
TCTATCAATC ATGrTGrTAT rTGTATAT^ TAAAAGTGCA TTAAAAATAA ATCCTAAAAA 
TAGCAATATA CTACAAATAA AAAAGACGCC AACATTACCA ACTAGTATTA CAATGACACC 
;^TTAAAGCA GGTAAAATAA TATTTGAGCC TCTT^CAAA CTATCGATTA ACGCATTACT 
TGriGCTAAA TGCTCCTCAX CAATAAmC AGGAAGAATT GCCCTAAACG C^GGATCCGT 
ATAGCAGTTA ATAATX.GTGA TAGCTGTAGA TATGGTTAGA AGCGTCAGAT AArrTAAATT 
TGATGrrATT GCAAGTAAAG GAATAATTAT TATAATCAAA CTTAGTATAA GATCAGATAG 
ATAAAGTATT rTCrrmAT TATGnTATC AGAATATGCG CCACCGAAAA TACCAAATAT 
^ATAAATGGA AGTGTTXGAC TCATAACCAT CATTGATAAT TTTAAAGATG ArTGGTrTGT 
O^XTCAACA GTAAACCAAA TTArTTGTAA CGAAAACAGC ACAAAACAAC TCCOACGTAA 
OATATTACCA ATCAATAAAT ATGT.AAAGTT TCTATTTrTC AAAACTTCTA AATACAACAT 
ATTTATCACC TCTCATAAAA ATAATTGAAT GCATCOVCCA GCmTrTAG ACCTTCrTCT 
;^CTCrCTT TATCCAAAGC GCAATTAATT CTAATATAAT TTAGTCAGn AAATATCAAT 

^^rrrCGAAA tatacatact acitgaaaca ccatacataa cccccaaaat gactacxcag 
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TCATTCTCAT CATATTCTAG GTTGTTTTTA ACAAACTAAA TATAGTGAAT GCAAATCAAC 180 0 

TATTATTTAA ATTATGAATT ATTTTAATTC TTTCTTCTAC GAGCCAATAA CATTAATCCA I86 0 

5 GCAATTCCAA TTATACTACT AAAGATCAAA CCTTTTTGCG TGCTTTCTAA ACCTGTTTTT 192 0 

GGTAATTCTG CTCGTTTTTT CTCTTGATTA GCTACTGATT CTTTAGCAAT TTTAGATTTT 198 0 

TTAACTTTAT CATTTTTATC CATTGAATGA ACTGGGCCAT TTGGTTTTGC TCTGTCTTTC 204 0 

10 

GATAATCCTG GATTGTTAGG ATTTACTGGG CCACTTGGAT GAGTTGGTCT GCTCGGCTTC 2100 

TCTGGGTTTT CAGGTCCTTT TGGATCTTTT GGTTTCTCTC CACCGAACTC TACAATCTTA 216 0 

TCTACTGGTT GTTTtGTGAT CTCTTCTGTT GGTTGACCCT CGCCAACTTT TTCACCTGTT 2 22 0 

15 

AATGGGkTCA CTGTGAwTGG TGkTGTGAwT GTCyTACTTC CTGGkTGTCC TTCTTGTTTC 228 0 

ACTCGCTCTT CACCAGGTTG TAAT 23 04 
20 (2) INFORMATION FOR SEQ ID NO: 4 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4733 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 433: 

TTTACAAGAT ACAGCCTGAC AATATTTATA TTACAAACCA ATATGGTAAA 6 0 

CTCGATTAAA AAGAATTTTA GAGCAAAGTA TACCTAATAA AGAATATATT 12 0 

AGATATCATT AATTACATTT TTTAAACCAG AATATAATAA ACAACATGTG 18 0 

TTAATTCTTT ATCTTCTTCG AAAATAAAGA ATTTAGCCAA AGAAAATGAT 24 0 

TTTATTTAAA TTTTAAGGAT GTTAATTATC AACTGaTAaG TGAAGGCAAT 3 00 

CTGCAAAACC ATATTTGCAT TGTGTGTTCA ATAAAGATAT TTTAAATATG 36 0 

ATATAGAAAA TTTTTTCAAA AGTATAGAAG TCAAATACAG TAAATAAGAT 42 0 

ATCCACCGAC CAGCAGATTT TTTTAAAAAA TTAATACCCC GTTAATACCC 480 

TTTGATGAAA ATCAATGAAA TTCAAAATTG AAGAAATCCT TTAATATCAA 54 0 

AGTCTATATA ATCATGCGAA ATTCTAAAAT TTTCtGAcGT AAAAAAACAA 600 

CAGCTCGTCG TTCACCTCAA TTCTCAAAAC GTTAATTGTC GGACGATATA 660 

ACCTCGATGT TATGTCGAGG TGTTTTTTTG CGTTTGTGTG GGGAATATGG 72 0 

AAATGAATTA ATGTCTCAGG TATAGAATTA ATTCAACTAT GAATTATTGA 780 
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GTCTTAATAT 
GAGGCTGTGT 
GATTTATCAG 
AATGAGGAGT 
GGTATTATTA 

GATAAAGTTG 
GAATTATTGA 
TTTGCTTCAA 
GGTGTACGAC 
ATTCyTAAAG 
TATACAAAAC 
AATAGAGTAT 
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rrAGCTTGCG GAGACCGTAC GATAGTGTTC TAATAATAGA TATTCAAGGG AAAGAGCTAT 
CGGATGATAA CCAGATGATA ACTCAAGAAC CATGrTTTAA GAGTGTTATC AAAATAAGGA 
GTGACTTAAT TGAAAGATTC TATATrTTGG AAGAAAGCTT TTATTyCTGT TTATTTTATT 
GTTGSGATGC TGGTGTTTCT ACrrrrTAGG TTTrATATTA AaACAGATAA CrTTTCTATA 
TATTTAATGA GTATCTTCTT AATrTGTTTA GGAACTGCTT CTATCATTTA TAACTATAAA 
ACCAATCGAT AAATATAAGT ATGAAATGTA TAAGTCACAG TTAGATCTAA GTCTTGCTGT 

GGcrrrrrAT atctccataa aacgacaaat tcaagcccga cataaaacag cattttcagc 

CCACCATAAA ACGACAATTT CAGCCCGCCA TTGACTAAAC ACCACATCCC AAAAATATCG 
TAACAATCCT CTACATCAAT CAATCCAACA TCCCTCATAA TCACAACGCA CAAAATCTAT 
TCATGCATTT rTGGAATACT TAGTATTACA AATAACGATT TTTATTCATC TAACAAAGGT 
TATATAATGT ACTGAAGGCA ATTTTTATGT ATTACAAATC TAATCGTACA TGTAAAATTT 
TGATAAACAT CATTAArrTT GCGTAACTAT CATTAGATTA CAAATCACAA AGTAATTACA 
TGTAATACAC ATCTATACAT CACATTTGAA GGGAAATGaA TATAAATGAC TGATAAAAAG 
TACACTGCAG CCGATATGGT TATTGATACT TTGAAAAATA ATGGGGTAGA ATATGTTTTT 
GGTATTCCGG GTGCAAAGAT TGACTATCTA rTTAATOCTT TAATTGATGA TGGTCCTGAA 
CTTATTGTCA CTCGTCATGA ACAAAACGCC GCAATGATGG CGCAAGGTAT TGGAAGATTA 
ACAGGCAAAC CGGGTGTAGT ACrTGTTACA AGTGGTCCTG GTGTAAGTAA TTTAACTACT 
GGATTATTAA CCGCAACATC TGAAGGGGAT CCTGTATTAG CGTTAGGTGG CCAAGTGAAa 
CGnAiATGAT TTATTACGAT TAACGGATCA AAGTATTGAT AATGCTGCGC TATTAAAATA 
TTCATCAAAA TATAGTGAAG AAGTACAAGA TCCTGAATCA TTATCAGAAG TTATCACAAA 
TGCAATTCGA ArTOCTACTr CAGGAAAAAA TGGCGCAAGT TTTATTAGTA TTCCGCAAGA 
CGTTATTTCT TCACCAGTTG AATCTAAAGC TATATCACTT TGCCAAAAAA CAAATTTAGG 
AGTACCGAGT GAACAAGATA TTAATGATGT CATTGAAGCG ATTAAAAATG CATCATTrCC 
TGTTTTATTA GCTGGTATGA GAAGTTCAAG TGCAGAAGAA ACAAATGCCA TTCGCAAATT 
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AGTTGAGCGC ACGAATTTAC CAGTTGTAGA AACATTCCAA GGTGCAGGTG TAATTAGTCG 
TGAATTAGAA AATCATTTCT TCGGTCGTGT GGGCTTATTC CGCAATCAAG TTGGTGATGA 
ATTATTACGT AAAAGTGATT TAGTTGTTAC AATCGGTTAT GATCCAATTG AATACGAAGC 
TAGTAACTGG AATAAAGAAT TAGAAACACA AATTATCAAT ATTGACGAGT TCAAGCTGAA 
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CAATTAAGAA CACATATTGA TGAAGAAACT GGTATTAAAG CGACGCATGA AGAAGGAATT 2700 

CTACATCCAG TGGAAATTAT TGAATCTATG CAAAAGGTAT TAACTGATGA TACTACTGTA 276 0 

ACAGTTGATG TTGGAAGTCA CTATATTTGG ATGGCACGTA ATTTCAGAAG TTACAATCCA 2 82 0 

AGACATTTAT TATTTAGCAA TGGTATGCAA ACGCTTGGTG TAGCATTACC GTGGGCAATT 2 88 0 

TCAGCTGCAC TTGTGCGCCC TAATACGCAA GTTGTGTCCG TTGCTGGCGA TGGTGGCTTT 2 94 0 

TTATTTTCAT CACAAGATTT AGAAACGGCC GTACGTAAAA ATTTAAATAT CATCCAGCTT 3000 

ATTTGGAATG ATGGAAAATA TAACATGGTT GAATTCCAAG AAGAAATGAA ATATAAACGT 3 060 

TCGTCAGGTG TAGACTTCGG TCCTGTAGAT TTTGTAAAAT ATGCAGAATC ATTTGGCGCG 3120 

AAAGGTTTAC GAGTTACTAA TCAAGAAGAA TTAGAAGCGG CAATTAAAGA GGGCTATGAA 3180 

ACAGATGGTC CAGTATTAAT TGATATACCT GTAAATTACA AAGATAATAT CAAACTTTCA 324 0 

20 ACAAATATGT TACCTGACGT ATTTAACTAA AATAAAGATA AATGTTAAAG AGGAGTGGGA 33 0 0 

GATTTTATGA CTAATGTTTT ATACCAACAT GGTACATTAG GCACATTAAT GGCAGGATTA 3360 

TTAAAAGGAA CTGCATCAAT AAATGAATTA TTGCAACATG GTGACTTAGG TATCGCTACA 3 420 

CTAACAGGTT CAAACGGTGA GGTAATCTTT TTAGATGGAA AAGCTTACCA TGCAAATGAA 34 8 0 

CATAAAGAAT TTGTAGAATT AAAAGGTGAC GAGTTAACAC CATATGCAAC TGTAACTAAA 3 54 0 

TTTGTAGCAG ATACAAGCTA TGAAACGAAA GATAAATCTT CAGAAGCAGT TTTTGCAGAA 360 0 

ATTAAGGAAA AGATGTTGAG TGAAAATTTA TTTTCAGCAG TAAAAATTTC AGGCTTATTT 3 66 0 

AAAAAAATGC ATGTACGTAT GATGCCGGCT CAAGAACCAC CTTATACACG TTTAATCGAT 3 72 0 

TCAGCTAGAA GACAACCTGA ACAAACTGAA ACGTATGTCA AAGGTTCAGT CGTTGGTTTC 37 8 0 

TTTACACCAG AATTATTCCA TGGTATCGGA TCAGCAGGAT TTCATGTACA CTTTGCGAAT 384 0 

GATGATCGTA ACTTTGGTGG ACATGTCTTA GATTTTGAAG TAGAAGATGT TAAAGTAGAA 39 00 

ATCCAAAATA TAGAAACATT TGAACAGCAT TTTCCAATTC AAGATAAAGA TTTCACTAAA 396 0 

GCAAATATTG ACTATAAAGA TATTGCAGAC GAAATTAGAG AAGCTGAATA ATGAATCCAG 402 0 

AAATATAATG ACGGTTTATG AAAATTGACT TCATAATGCG CGATTTAGAA ATGATAGTTT 408 0 

GTAAATATGA TTAACCATGA CTACAATAGA ACAAATATAT TTATAATTAC GTCTAAGTAA 414 0 

TAAAATAAAT CCCTTCACTA TTAGCAGTAG TGAGGGGATT TATTAGGTTC CAGATATTTG 420 0 

AGATTTGCTG TTATGTTTAG ATTATAAATT GTGGTACACA CTCATATAAA ATTTACTATT 426 0 

GTATAGGCCA ATCTGTTACT ACGAGAAGCA AACAACAATA ATTTACAAGT TCAATAACTA 4 320 

AAAAGACAAA CGCCAATTTT TCAGCGCTTG CCCTATAAAA CTATTTTCAA ATTATTATTT 43 8 0 

55 



30 



35 



40 



45 



50 



EP0 786 519 A2 



10 



15 



20 



25 



30 



35 



40 



TTTCTrrAGA rTCACTACTT mTATTACC ATCATTCAAA GTAAGCGTAT AAGTTGCrGT 
rrGGGCATTA TTAATTTTTT CTGTTGTAAC ACCACGTTGA GAAGCTAATT CATTTTrrAC 
TTTACTGTCA ATTTCTTGAT AAAGAACATT TrTATTTTCT GGGAAGATAA AGTAAGTTCG 
ATGTAATGCA GTAATACCAT CTACTGAAAT TGTGTAAGGA ACAGTGTGAT AACCATCCAC 
AGTCATTTCT TTATAGCCGT TATTACTATC TGCAGATGCT TCGTGACTCG GTA 
(2) INFORMATION FOR SEQ ID NO: 434: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1284 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 434: 
AATAAATGGT AATTTTATAT CATCAACGGC TACAAATGGG GCAGmCAT ATATAAATAA 
ACACATCTAC TCAACTGnTC TTAArTAATT AATAAATGAT TTACCAGATG ACATATAACA 
GCCATTATTA GnTTAATTCT AATAGTTTAT TTAAATTTTC TTCGGTTGTC GCCCAACTGG 
TTGCGAATCT AACAACACGA TOrTGATCAT CGTATTITrC CCAAACAGCA AATTTAACTT 
nTGTrCTAA CTCTGCTATT TTCTCGTrAC TTAAAATAAA AAATTGTTGA ITGGTTGGAG 
AATCAAAGTA AAGACGATAG CCTTTATTIT TAAACCCGTC rrrCATCTTA rrTGCCATIT 
CGATAGCATG TCTGCTTATA TTAAAATATA AATTGTCCGT AAATAATTCT AAAAATTGTA 
TGCCTGTTAA CCGTCCTTTT GCTAAAAGGG CACCGTGGAT GCTTOArTCG AGTGGTAAAT 

TGrrrcGGrr cATTArrrrr cgtaaaaaca atggcttccc cgcataatgc acctatctkc 

GTACCACCTA TATAAAATAC ATCACAATAT TTAGCGrTGt CTTTAATAGT CATATCTGAT 
tggtcactca TCAATCCATA CCCTAATCGT GCACCATCCA TAAATAATGG AAGCTGATAT 
TGCTTACATA CTTTGCATAA CTCTTCCAAT TCTGaTTTAG AGTATAATGT GCCATATTCT 
GTAGGATGAG AAATATATAC CATTCCTGGG AATACCATAT GGTCCTTTTT AAAATCACIT 

ttaaatgtct ccatgtaagt ttcaacatct gaagcactaa CTrrrccrrc cttagagggt 

atagtaatta CTTTATGTCC ACTATATTCA ATTGCACCGC CCTCATGCAC agcaacatga 
ccagtgtctg CTGAAATGAC CCCTTCGTAA CTTTCTAACA TTGAATTAAT AACAACCTGA 
TTGGTTTGTG TTCCACCTaC TAAAAAACGA ATTGTAGCAT TTGGaCAGTC AATTGTATCT 
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TCGAATGAAA TCACTACATT TCCCCCTAAA ACTAATATCA ACATTTTAAT AAGATAAACC 12 0 0 

AATTTCAAAA CTAGTTCGAT ATTTAAAATG TATTATGGAT GGnTAAAGTT TGTATCGCAT 12 6 0 

TATCGCGAAG TTGnATAAAT ATAT 1284 
(2) INFORMATION FOR SEQ ID NO: 435: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1072 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 435: 

ACAGCTTTTG GTAArGGAGA AAwTCAaTAt GAAACAGTAT AArGCGTATT TaATCGATTT 60 

AGATGGCACA ATGTATATGG GAACAGATGA GATTGATGGA GCAAnACAAT TCATCGATTA 12 0 

TTTAAATGTA AAAGGCATTC CTCATTTATA CGTAACTAAT AATTCAACAA AAACACCTGA 180 

GCAAGTAACT GAAAAATTAC GTGAAATGCA CATTGATGCT AAACCAGAAG AGGTTGTAAC 24 0 

GTCAGCGTTA GCCACTGCTG ATTATATTTC AGAACAATCA CCAGGAGCAT CAGTATATAT 3 00 

GTTAGGTGGG AGTGGTTTAA ATACTGCGTT AACCGAAGCG GGACTTGTCA TTAAAAATGA 3 60 

CGAGCATGTT GATTATGTAG TTATTGGACT TGACGAACAA GTTACATATG AAAAGCTTGC 4 20 

GATTGCAACG TTAGGTGTAA GAAATGGtGC AACATTTATT TCTACAAATC CTGATGTATC 480 

AATTCCTAAA GAGCGTGGTT TATTACCTGG TAATGGTGCT ATTACAAGTG TTGTAAGTGT 54 0 

ATCGACAGGT GTATCGCCAC AATTTATTGG TAAACCAGAA CCGATTATTA TGGTTAAAGC 600 

ATTAGAAATT TTAGGATTAG ATAAATCCGA AGTTGCTATG GTAGGCGATT TGTACGATAC 6 60 

CGATATTATG TCTGGTATTA ACGTAGGTAT GGATACGATT CATGTACAAA CAGGTGTATC 720 

TACGTTAGAA GATGTGCAAA ATAAAAATGT GCCACCAACG TATTCTTTTA AAGATTTAAA 780 

TGAAGCAATA GCTGAATTAG AAAAATAGAT ATAGTCATTT TATAAAGTAG GTGAATTGAT 84 0 

TTGGTAAAAA TAGTTGTTTC GAGGAAAATT CCAGATAAAT TTTATCAACA ATTAAGTAAA 90 0 

CTTGGTGACG TTGTTATGTG GCAAAAATCA TTAGTGCCTA TGCCTAAAGA TCAATTTGTG 96 0 

ACaGCcTTCG TGACGCAGAT GCTTGTTTTA TTACATTAAG TGAACAGATC GATGCAGAAA 102 0 

TTTTAGCGCA ATCACCAAAT TTAAAAgTAA TTGCGAATAT GGCTGTAGGA TA 107 2 

(2) INFORMATION FOR SEQ ID NO: 436: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3271 base pairs 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 436: 

TAAAAACTTA CTTTAACACC ATTCCTTTTT AACTTTTTTC GTGTTTCnTT TnCTTAAGTC 6 0 

10 CATCCATATT TTTAATGATG TCATCTGCTG TTTTATCTTT TAAATCTAAC ACTGaGTGAT 12 0 

AmCGAATTTG TAGCACAGGA TCAAATCCTT TATGGAATCC AGTATGTTCA AATCCTAAGT 18 0 

TACTCATTTT ATCAAAGAAC CAATCATTAC CAGCATTACC TGTAATCTCG CCATCATGAT 24 0 

TCAAGTATTG ATATGGTAAA TATGGATCGA TATGTAGGTA TAGACAACGA TGTTTTTTAA 3 00 

CATATTTTGA TAATTCATTA AAGAAAAAGT GTACGAGTTC TTGATTTTCA TAATCAATCA 36 0 

CTGGACCGCG ATTTGAATAA AAATACTTGA ACACTTTCAT AACAGGTACA GCAGTAAGtA 4 20 

AGCAAGCTGC AATGACCTCG TTATTATTGT TTTTTATTCC CACTAAATGT GTTTCATAAC 4 80 

CTTCAGCAAG CTTTAACTCA TAGTGGCCAA CAGTTTGCGT GaAATGACTG TATGGCATGC 54 0 

TATCTGTAAa GGCACCAAAC TCTTTAGCTG TTAAATTTGT AAACTTCATT ATCATTACTC 6 00 

CTATTTGTCT CTCGTTAATT AATTTCATTT CCGTATTTGc AGTTTTTCTA TTTCCCCTCT 660 

GCAAATGgCA AAAATAATAA ATCTAATCTA AATAAGTATA CAATAGTTAA TGTTAAAACT 720 

30 AAAACATAAA CG CTTTAATT GCGTATACTT TTATAGTAAT ATTTAGATTT TTGAATACAA 7 80 

TTTCAAAAAA AGTAATATGA ACGTTTGGCT TTGCTCATAT TACTTTTTTT GA.AATTGTAT 84 0 

TCAATTTTAT AATTCACCGT TTTTCACTTT TTCAAACAGT ATTCGCCTAA TTTTTTTAAA 9 00 

TCAAGTAAAC TTAATTATTC AATGTTTGTT GGATAGATTG TAAATATTTA ATGATTTCCT 960 

CACGCGTGTT AGATTTAAAT CGCTTAACGA TTTCGCTACC AATGACAATG CCATCTGCAA 102 0 

CCTCTTTTAT ATCTGCAACA TGTTGTGGTG TTCTTATACC AAATCCTGCG ACAACTGGCA 1080 

CATTGGCTAT CGCTTTAATT GACTCAATTT TTCGTTTTAA TTCTGGATGA AACGCACCGT 114 0 

TTTGCCCTGT TGTCGCATTC ATCGTCACAG TATAAATAAA GCCTTCCGCA TGGGATACGA 12 0 0 

TATCTTTTAT ACGTTTGTCA TCAGTAGTCA TCGCAACTAA CGATATGATT TTGACGCCAT 12 6 0 

AGTGACTAAA TTGTTGTTTT AAACGCTGCG ATAATTCATA TGGTAAATCA GGAATAATTA 132 0 

AGCCGTAGAC ACCAGTATCT CGACATTTTT CAAAAAACGC TTGTTCTCCA TAATGACAAA 13 8 0 

TAATATTATA ATACGTCATT AATACATAGT TACACTTAAT TTGATCACCA TGTTTTTCTA 14 4 0 

;r>-'^r;AA AATATAATCT ATCGTGATGC CTTGTTTAAT CGCTTGTTGA CCTGCTTCCA 1500 
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GTATAAATAA TTTAGTCATT TGCAAGACCT CGCTCTACCA TATATTGTCT AATTGTTTCC 16 8 0 

ATATCTTTAT CGCCACGTCC AGAAATAGTT ACTACAATAA TATCTTCTTT CGACATCGTA 174 0 

GGCGCTAGTC TTTCAACATA ACTCAGTGCA TGTGCACTTT CAATTGCAGG TATAATACCT 18 00 

TCATGTTTTG TAAAGTTGAT TAAAGCATTC ATTGCTTGTG TATCACTAGC ATTTTCAAAA 1860 

GTTACTCTAC CAATGTCGTG GTAATAAGAA TGTTCTGGTC CAATACCAGG ATAATCAAGT 192 0 

CCTGCTGAAA TAGAATGTGC TAGTTGCACT TGCCCATCTT CATCTTGAAT TAAATACATT 1980 

TTAGTACCAT GTAATACGCC AGGTGATCCT TTGCCAATTG CAAGTGCATG TTTATCAGTA 2 04 0 

TCATCGCCTT GACCTGCGGC TTCAACACCG TATAATGCAA CATCATCTTT AATAAATGGA 2100 

TAAAATGTAC CGATTGCATT TGAGCCACCA CCGATACATG CTACAATTGC ATCCGGAAGT 2160 

CGACCTTCTT TCTTCAATAT CTGTGATTTT ATTTCTTTAC CAATCACACT CTGAAAATCT 2220 

20 CTAACAATCG TTGGGAACGG GTCTGGACCT AATGCAGAAC CTAATAAATA ATGTGTATCA 2280 

TCTACATGAC TTACCCAATA TTGCAATGCT TTATTAACTG CATCCGATAA AGTCCCTTGA 234 0 

CCATCTTCAA CTGCCACAAC CTTTGCACCA AGTAATTCCA TTCTAAATAC ATTAAGTTGT 24 00 

TGTCTTTTAA TATCTTCACT TCCCATAAAG ACAACAAGTT CCATATCAAA TAATGCAGCA 24 6 0 

ACCGTAGCAC TAGCTACACC ATGTTGACCC GCACCAGTTT CAGCAACAAG CTTCTTCTTG 2 520 

CCCATTCTTT TAGCAAGCAA CGCTTGACCT AACGCATTAT TAATTTTATG GGCGCCTGTA 2 58 0 

TGATTTAGAT CCTCTCGTTT CAAATATATT TTAGCGCCAC CTAGGCTTTC AGTATATGAT 2 64 0 

GCAGCATATG TAAGTGGTGT CGCGCGTCCT ACATACTCTG ATAAATAGTA TTCCAGTTCT 2 70 0 

CTTTGAAACT CTGGGTCTGC TTTTGCCTCT TTATAAGCTT TTTTCAACTC AATAATTGCT 2 76 0 

GGCATTAATG TTTCTGGAAC ATATTGCCCT CCATATTCAC CAAAGAAACC TAATTCATCT 2 820 

GCTTCTGTTT GTATTTGTTT ATTCATTGTC TCTATCTCCT TTCACAATAT TTACAATTGC 2 8 80 

40 TGTCATTTTT TCTATATCTT TTCGCCCATT TACTTCTATA CCTGATGCAA GATCATAACC 2 94 0 

TTGATGTGAT AATTTAAGTT GATTAACTGT TTGAATATTT TCAGAGTTAA TGCCTCCTGC 3 0 00 

TATCAAATAA GGTATGTCTT TTATGTGCTT CAAAATAGTC CAGTCATATG TTTGACCGGT 3060 

ACCACCATAC GACACTGAGG GTGTGTCGAT AATAAATAAA TCTACGAACC CTTTATATTT 3120 

ATTTATGTTT TGGATTATGT TTTCATCTnG CAGtAAAGCT TTAGTGATTT TAATGCTTGA 3180 

ATATkTCTTT TTAATTTCCT GTAtAAAAtC AATAGATTCt GtGTAACTGT ATTGTGkTAA 3 24 0 

wTGaCGwATg CtTAAwACgT GTGCCAATGG T 32 71 
(2) INFORMATION FOR SEQ ID NO: 437: 
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(A) LENGTH: 1553 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 437: 
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ATAATGAAAT 


AATACTGTGT 


TTTATCTGCG 


AAATGTAtCA 


TTTTCTAATc 


GTTTCACAGT 


60 


AAAATGAAAA 


GATAAAGTGT 


GTTTTTACTT 


GAATTTTGAC 


TAAAATTACT 


CTATATTTAT 


120 


TAATTGAGCT 


ATGCTTATTA 


TTACAATTTG 


ATTACAAATT 


TTAAATTTGT 


TAATTGAATG 


180 


ATAATATTAA 


ATAAAGAAAC 


TTACACAAGC 


AAATATGAGT 


TGTAGCCCAA 


AATACTTGTT 


240 


AAATCAAAGT 


TGAAAGCTAC 


AAATAATGAA 


AATTATAAAC 


TTGAATCTGA 


AAGTAATTAC 


300 


TATAATTATG 


ACAATGTTAA 


CTTTTAAACG 


CACTTATTAA 


TTAACTACAT 


AATGTTAATA 


360 


TCTAATTTAT 


TCAAGTACTT 


TCGCAAGATT 


TATTATCTAA ATAACGGGGG 


AAAGAATCAT 


420 


GAGTtCACAA AAAAgAAAAT 


TAGTCTTTTT 


GCGTTCTTCr 


TATTAACCGT 


AATAACGATT 


480 


ACCTTGAAGA 


CGTArrrrrc 


TTATTATGTT 


GATTTTTCTT 


TAGGTGTTAA 


AGGTTTAGTA 


540 


CAAAACTTAA 


TATTATTGAT 


GAATCCTTAT 


AGTTTAGTAG 


CACTGGTTTT 


AAGTGTGTTC 


600 


CTATTCTTTA 


AAGGCAAAAA 


AGCATTTTGG 


TTCATGTTCA 


TAGGCGGCTT 


CTTATTGACG 


660 


TTCCTATTAT 


ATGCCAATGT 


TGTGTACTTT 


AGATTCTTCT 


CTGATTTTTT 


AACGTTTAGT 


720 


ACTTTAAACC 


AAGTAGGTAA 


CGTAGAATCT 


ATGGGTGGTG 


CGGTTAGTGC 


ATCATTCAAA 


780 


TGGTATGACT 


TTGTTTATTT 


CATTGATACG 


TTAGTTTACT 


TATTCATTTT 


AATATTTAAA 


840 


ACAAAATX3GT 


TAGACACAAA 


AGCATTTAGT 


AAOAAATTTG 


TTCCTGTCGT 


AATGGCGGCT 


900 


TCAGTAGCAT 


TATTCTTCTT 


AAACTTAGCT 


TTTGCTGAAA 


CTGACAGACC 


AGAATTATTA 


960 


ACACGTACAT 


TTGACCATAA 


ATATTTAGTC 


AAATATTTAG 


GACCTTATAA 


CTTTACAGTA 


1020 


TACGaTGGTG 


TTAAAACTAT 


CGAAAATAAT 


CAACAAAAAg 


CGCTAgCATC TGAAGATGAC 


1080 


TTAACaAAAG 


TATTAAAtTA 


TACGAAAcAA 


CGTCaAACtnG 


AGCCTAACCC 


rGAwTATTAT 


1140 


GGGGTGGcAA 


rGAAGAAAAA 


TATTATTArG 


ATTCATTTAG 


AAAGTTTCCA 


AACCTTCTTA 


1200 


ATTAATAAAA 


AGGTTAATGG 


TAAAGAAgTA ACACCGTTTT 


TAAACAAATT 


ATCAAGTGGG 


1260 


AAAGAGCAAT 


TCACATACTT 


CCCTAACTTT 


TtCCATCAAA 


CAGGTCAAGG 


TAAAACATCT 


1320 


GACTCTGAAT 


TTACAATGGA 


TAACAGTTTA 


TACGGTTTAC 


CGCAAGgTTC 


TGCCTTTTCA 


1380 




ATAATACGTA 


TCAGTCATTA 


CCAGCAATTT 


TAGATCAAAA 


GCAAGGCTAC 


1440 
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(2) INFORMATION FOR SEQ ID NO: 43 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1419 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 438: 
ACAAAATCAA TCAGCAAAAT GATTGGATTA AGACGCAAAT TGAGCGTTCA ATGGAAGGCG 6 0 

AAACAGTTGG CATTAATGAT CAAAATATAG AAATATATAG TGAAcGtTgA GATTTATACC 12 0 

ATACACTCGT ACCTTTAAAT CAAGAATTGC ATAAGTTGCG ACTTAAAACT CAAAACTTAA 180 

CCAATGAAAA TTATAATATT AATGATGTGA AAGTTAAAAA GATTATTGAA GATGAACGTC 24 0 

20 AAAGACTAGC ACGAGAACTT CACGATTCTG TTAGTCAGCA ACTTTTTGCG GCmAGTATGA 300 

TGCTATCTGC TATCAAAGAA ACGAAtTAGA ACCACCATTA GACCAACAAA TTCCTATTTT 3 60 

AGAGAAAATG GTTCAAGATT CGCAGTTAGA AATGCGTGCT TTGCTGTTAC ATTTAAGACC 4 20 

GCTTGGTTTA AAAGACAAAT CTTTAGGTGA GGGTATTAAA GATTTAGTTA TTGATTTACA 4 80 

AAAAAAAGTG CCAATGAAAG TTGTGCATGA AATACAAGAT TTTAAAGTGC CTAAAGGTAT 54 0 

TGAAGATCAT TTGTTCAGAA TTACACAGGA AGCAATTTCG AATACATTGC GTCATTCAAA 6 00 

CGGTACAAAA GTGACAGTAG AATTGTTTAA TAAAGACGAT TATTTATTGT TGAGAATTCA 6 60 

AGATAATGGT AAAGGTTTTA ATGTTGATGA AAAATTAGAA CAAAGTTATG GACTTAAAAA 72 0 

TATGCGTGAA AGAGCTTTGG AAATTGGTGC AACGTTCCAT ATTGTATCAT TGCCAGATTC 780 

AGGTACACGT ATCGAGGTGA AAGCACCTTT AAATAAGGAG GATTCGTATG ACGATTAAAG 84 0 

TATTGTTTGT GGATGATCAT GAAATGGTAC GTATAGGAAT TTCAAGTTAT CTATCAACGC 900 

AAAGTGATAT TGAAGTAGTT GGTGAAGGCG CTTCTGGTAA AGAAGCAATT GCCAAAGCCC 960 

ATGAGTTGAA GCCAGATTTA ATTTTAATGG ATTTACTTAT GGATGACATG GATGGTGTAG 102 0 

AAGCGACGAC TCAGATTAAA AAAGATTTAC CGCAAATTAA AGTATTAATG TTAACTAGTT 10 80 

TTATTGAAGA TAAAGAGGTA TATCGTGCAT TAGATGCAGG TGTCGATAGT TACATTTTAA 114 0 

AAACAACAAG TGCAAAAGAT ATCGCCGATG CAGTTCGTAA AacTTCTAGA GGAGAATCTG 12 00 

TTTTTGAACC GGAAGTTTTA GTGAAAATGC GTAACCGTAT GAAAAAGCGC GCAGAGTTAT 126 0 

ATGAAATGCT TACAGAACGA GAAATGGAAA TATTATTATT GATTGCGAAA GGTTACTCAA 1320 

ATCAAGAAAT TGCTAGTGCA TCGCATATTA CTATTAAAAC GGTTAAGACA CATGTGAGTA 13 80 
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(2) INFORMATION FOR SEQ ID NO: 439: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 608 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 439: 
ATGT^GGCGG ACGTAAAGGA CATGTITATA CTGATGATCG AGCATTAGAT ATTGATATCG 
TTCCGCCTGC TCAAGCAGAT GGTAAAGCTA CTAACCCCGA ACAATTAriT GCAGCAGGTT 
ATGCATCTTG CTTCAACGGT GCTITCGACC TAATTTTAAA GCAAAACAAA GTGCGTGATG 
CTCATCCAGA AGTAACACTA ACAGTGAGAC TAGAAGATGA TTCAGACTCA GAAAGTCCTA 
AATTAAGTGT TTCAATTXIAT GCGACAATTA AAAATGTTAT ATCTCAAGAA GAAGCT^^ 
AATArriACA AATOGCTCAT GAATTTTGTC CATATTCAAA AGCGACTCAA UOAAATA. xA 
ATGTCGATTT AAATGTAAAT GTTGTAGATT AGCATTAACT TAAAGAGATT ATTCAACGTT 
ATTAATAAAA TTCACATAAA ATTCAAATTt StCrAcCAAA AATTTTTGGT TOGyTATTTT 
rrCTATTCGT GATTGAAATT TCTGGCAATA riTAACT^AA AATGATTCTA CCTTAGTCAT 
CATAAATGTG ACCGGTTCCA ATACTGGCTT GACTTCTTCG CATACCGTCT ACAAATAAAA 
GTCCAGTG 

(2) INFORMATION FOR SEQ ID NO: 440: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 682 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 440: 
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ATTGGACCGT GGGTAACAAA TACTAATTGA AGTCGTGATA ACGTAAAAAT TAAAAATACT 4 80 

ATTTATAATC ATTTACATTT AAAGCATATA TTTGAATCGA tGCCTTATCT CTATGGTGAT 54 0 

GATATTAATA AACTTCAAGG TAGGCCTATC GTTGGCTTAT CTCATGCTGC CGGGTATGCA 60 0 

TGTGGCTATC ACTTGGTAAA ATACTTTTTA CAAAAAACAA ACATACCTAT TGAAGTTGCT 66 0 

ACAACACTTC CAGCACAAAA AA 6 82 



10 

(2) INFORMATION FOR SEQ ID NO: 441: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2574 base pairs 
^5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 441: 

ATCTTCTTTT AACATGACAA ATTGCAACAA AATATTAATG ACGCCAAAAG ACATTTTTTC 6 0 

ACGTTCAATT AATTCTTCAA CCATTGTCTT TTGCGATATA GTTGGTTCTG ATTCAGACCA 12 0 

25 

AGAAGCTAAC ATATCAATTG GACTCGTTTG TTCAAGTAAC TCAAACCATT CATCACTTTG 130 

TGGCTTTGGA TTCACTTCTG AAGATTTGCC CGCCGAAGAT GATGTAGCAG GAGATTTCAC 24 0 

CTGTAATTTA GGCATTTGAT TTTCGTGTTC CATTAAGTAA TACGAGCGTG CTTGTTTACG 300 

30 

CATTTCTTCA AAGGATAACT GTTGTCCACT TGTAATTGAA TTTAAAATAA CATGCTTCAT 360 

GCCATCTGCT GTTAAACCAT ATAAAGTCGC GAGTTGTGTA ATTAAACGCT TTGCATCTTT 4 20 

GGTAACAATG TCTTGACTAA TAAAATGTTT ACCTAACATT TGTCTCAACA TTTCAAAGTC 4 80 

AAAAGATTCA TTTGATAAAT CGATACCTTG GTACGGTTCA TTAATCGGaA TATCACTTGT 54 0 

ATCGATATCT ATTTTTGTAG ACGGCACTTT AAAAACATCA GTAAATTGTC TTGTTACCTG 60 0 

40 TTTAAATTCA CTCAAATCAA TTTGTTGATA CTCAAAGTAT TTCTTCAACT CATGAAATCG 66 0 

ACGATGCTCG ACTTCACTAT ATAAAAAGAT TGACAACATT GGATCATTAA AAAATAAATG 72 0 

TGCTGAAGGG GtTGaATTAA TTGGTAAACA AATTGTGTTT CTTGTTCATC ATGTTTGACA 78 0 

AACGCCTTTA ACAATCCAAT CGCTTCAAGT AAGTCCATTT GTTGTCTAAA CTCTAGTAAA 84 0 

TTAATTTTAA GTTCATTCAT AAAAATATAA TGAGAAAGAA TCAATGTTTC ATTATGACTT 900 

TCTTTAACGA ATTGAGTCAT AAAATGATAT AAACCCACTG CTTGCGTTCC AATTAGCGGT 96 0 

50 

GTATACAGTC GATTCAATAC CTCTAAATGA TTCGTATTTA AATCAAAGTG TTGCATAACT 102 0 

TTGAATTGAT CCTTTGGTCT TAAGCCGAAT TCGAAGGCTT GTCGTCCCaT TTAAGCnATC 108 0 
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AAATTCTTTA TAGACTGATG CAAATCTAAC ATATGAAACT TGATCAACAT GCATTAACAA 
GrrCATAACG TGTTCACCTA TATCTCGTGA AGACACTTCC GTATGACCTT CATCTCGTAA 
TTGCCArrCA ACCTTGTTAG TTATGTCTTC AAGTTGTTGA TATCTAACTG GTCGTTTCTC 
ACAAGAACGC ACAAGTCCAT TAAGTATCTT TTCTCTTGAA AACTGCTCTC rTGTGCCATC 
TTTTTTCACA ACTATAAGCT GACTAACTTC GATATGTTCA AATGTAGTGA AACGTGTTCC 



ACAATTTTCA CATTCTCTTC GTCTTCGAAT GGCATTTAAT TCATCGGCAT GCCTTGAATC 
TACAACTTTA GATTGTGTAG AATTACATTT CGGGCATTTC ArTACATCAC CCTCTTTATT 
TTGArrATGC CTAATTATAC TATAAATCTA GAGATGAAAA AAGAATCCCT CAATTTAATT 
CATTTAACCA AATAATGAAA CAATAAAAAA CArTATATCG TTACrTATTA AGTAATTTGC 
ATGACAATAT TATTGTATTA AAAATAAAAA ACCTAACTCC GAAGTCAGAG TTAGGCTATA 
AATTAATTGT ATTAACTTGC ACTTACAGTT TCTTTTGATG TCAAAAGTGC TCCAATTTGC 
TCAGCAACAT CTACAACTCT ATTTGAATAA CCCCATTCAT TATCATACCA AGCAATAACT 
TTTACTTTAT TCCCTGACAT GACCATTGTT GATTTTGCAT CAATAATAGC TGAATTTGGA 
5 rTAGTA-TTAA AATCAACAGA CACTAGTGGT TGATGTTCGA CTTCTATGAT ACCTTCTAAA 

CCTGCATTTT CAAAAGCTTG GTTTACTTCT TCTGCAGTTA CTTCTTTTrC TAAATCAACA 
ACTAAATCAA CGAGCGATAC ATTCnTGTT GGTACACGTA ATGCCATGCC GTGTAArTTA 
CCTTCTAATT CTGGTAATAC TTCTTTrAAA GCmCGCCG CACCAGTAGA AGTAGGAATA 
ATGCTTTCAT TACATGAACG TGCACGTCTT AAATCTTTAT GTGGATTATC AATATTTTTT 
TGGTCATTTG TAATAGCGTC AACAGTAGTC ATTAAACCAT TAACTATTCC AAACTGATTA 
TTTAAAACrr TTGCAACTGG ACCAATGCAA TTAGTAGTAC ATGAAGCATT ACTAAAAATG 
TCAAATGCTT CTATATCTAA TTGGTTATCA TTrACGCCTT TAACTACCAT TTGAACATGT 
CCACCTTTTG nAGGACCAGT TAACAAAAsT TTTtTOGCAC CTGCTTTAAT ATGTGCGATG 

40 

GCTTTATCAC CATGaTTAAA TTTACcAGTT GCATCTATAG CAATATCGAT ATCTAATTCT 
TTCCATGGCA AGTTTTCAGG ATTGCGATCA GCAACCAATT TAATTTTATG ATCT 
45 (2) INFORMATION FOR SEQ ID NO: 442: 

(i) SEQUENCE CHARACTERISTICS: 

;a; LENGTH: 3326 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
50 (D) TOPOLOGY: linear 
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CAAGGTACCG GTTTAGGTTT GTTCATTTGT AAAATGATTA TCGAAGAGCA TGGTGGTTCC 6 0 

ATAGATGTTA AAAGCGAATT AGGGAAAGGC ACAACATTTA TTATTAAACT ACCAAAACCA 12 0 

GAATAAAATT GAATATAGTT ATTTCAGAAC GCATGTTATT GATTAGAGAC TCTAATTTAT 180 

AGCATGCGTT TTTTGATTGA TGTGGGGAAT TTTGTATGTG GATTAGAACT TAGGGTTTTT 24 0 

GCGAATATCA ACTATTAAAT ATATTACTAA TTTATACAAA AATATAAAGT TTGATAAAGT 3 00 

TATTTATTTG ATTATAAAAA TAGGGTAAAA TATAGATATA TTGTATTAAT TAAATTATTC 360 

GAGGTGTCAT ATGAAAAAAT TCATTGGATC AGTTTTAGCT ACGACATTAA TTTTAGGGGG 42 0 

ATGTTCCACG ATGGAAAATG AATCAAAAAA AGACACAAAA ACAGAAACAA AATCTGTACC 4 80 

AGAAGAAATG GAAGCTTCAA AATATGTAGG CCAAGGCTTC CAACCGCCTG CAGAAAAAAA 54 0 

TGCGATTGAA TTTGCGAAGA AGCATCGTAA AGAATTTGAA AAAGTAGGTG AACAATTCTT 6 00 

TAAAGATAAC TTTGGACTAA AAGTTAAAGC TACAAATGTT GTAGGTAAAG ATGATGGTGT 6 60 

AGAAGTTTAT GTGCATTGTG AAGATCATGG CATTGTATTT AATGCAAGTC TACCTTTGTA 720 

CAAAGATGCC ATCCATCAAA AAGGATCAAT GCGCAGTAAT GACAATGGTg ATGATATGAG 78 0 

TATGATGGTG GGTACAGTGC TGAGTGGCTT TGAATATCGA GCGCAAAAAG AAAAGTATGA 840 

TAACTTATAT AAATTCTTCA AAGAAAATGA AAAGAAATAT CAATATACAG GCTTTACAAA 900 

AGAGGCAATT AACAAGACAC AAAATGTCGG ATATAAAAAT GAATATTTTT ATATTACATA 960 

CTCTTCTAGA AGTTTAAAAG AATATCGAAA GTATTATGAA CCACTGATTC GAAAAAATGA 102 0 

TAAAGAATTT AAAGAAGGAA TGGAACGAGC AAGAAAAGAA GTGAATTACG CTGCAAATAC 1080 

AGATGCTGTT GCTACACTTT TTTCTACTAA GAAAAACTTT ACTAAAGACA ATACAGTAGA 114 0 

TGATGTAATC GAACTAAGTG ATAAATTATA TAATTTAAAA AATAAACCAG ATAAATCTAC 12 00 

AATCACAATA CAAATAGGGA AACCCACTAT TAATACTAAG AAAGCCTTTT ATGATGATAA 12 60 

TCGTCCAATA GAATATGGGG TGCACAGTAA AGATGAATAA AATTAATGAT AGGGATTTAA 13 2 0 

CAGAATTAAG TAGTTACTGG GTTTATCAAA ATATTGATAT AAAAAAAGAA TTTAAAGTTA 13 8 0 

ATGGAAAAAG GTTTAAACAA GTAGACAGTT ATAATGATGA TAAGAATAGT AATTTGAATG 144 0 

GTGCTGCTGA TATTAAAATA TATGAGTTAT TAGATGATAA AAGTAAACCA ACTGGTCAAC 150 0 

AGACAATAAT TTATCAAGGA ACATCTAATG AGGCAATTAA TCCAAATAAT CCATTAAAAT 15 60 

CATCGGGGTT TGGAGATGAT TGGCTCCAAA ATGCTAAATT AATGAATAAT GATAATGAAA 16 20 

GCACAGATTA TTTAAAGCAA ACAGATCAAT TATCAAATCA ATATAAAATA AAGTTAGAAG 16 8 0 

ATGCAGATAG ATTATCAAAT AGTGATTTTT TAAAAAAATA TAGAATGGAA TCAAGTAACT 174 0 
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ATCAAGGAGC GAAACATCCG AATCAAAAAG TTGTTGCTAC TGACTCAGCA ATGATTCCTT 
ATGCTGCTTG GCAGAAATTT GCTAGACCAC GCTITGaTAA TATGATTAGT TTTAATAGTA 
CCAACGATTT ATTAACATGG TTACAAGATC CATTCaTCAA AGATATGCCA GGAAAACGCG 
TTAACATTAA TGATGGTGTG CCCAGGTTAG ATACTTTAAT AGACAGCCAT GTAGGITATA 
AAAGGAAGTT AAATAGAAAA GATAACACAT ACX3ATACTGT ACCACTAATC AAAATAAAGT 
CGGTAAAAGA TACAGAAATT AAAAATGGAA AAAAAGTAAA AAAGACTATT AACATAACAT 
TAGATATGGA TCGGCGAATT CCAATAAATG TTTGGACAGG AGATTCGATT GCACGTTCTG 
GAAGAGGAAC TTTAATTAAA CTTAAmAG AAAATCTTGA TGCGTTGAGT AAACTGATTA 
CTGGTGAAAC AAGTGGTATG TTAGCAGAAT GCGTAATCTT TTTAAATGAA AGTTTTAACA 
TCTCAGAAAA TGAAAATAAA AATTTTGCAG ATAGAAAGAa ACAATTATCA GAAGGATTTA 
AGGATAAGAT TAACTTATTT CAGTTAGAAG AAATGGAAAG AACTTTAATr AGTAAAATAA 
ACTCACTTGA AGAAGTTGCA GATGAAACAA TAGAAAGTAT TAGTGCTGTT AAACACT'TAT 
TACCTGATTT TGCATTGGAT GCATTAAAAG AAAGAATTAA TGAGTTGTrr AAAGGTATAA 
AATCTTTTAT AGAAAAAGTG TATGATAGTA TAGATAATGA AATTTTAGAA ATTTTCAAAA 
ATATAGATCA CGACTTCAGA GATGGAGTAT CTGAAGAAAT GATGAAACAT TTGAAAGTAG 
TGAAACAGAA TATAGACCAA ATAAAAAATC AAAATGATAT TTATGGTAGG CAAATTGCAG 



ATATTAGAAG TATTATGAAA CAACAAGATG CAACAATnT AGATGGAAAT TTTCAAATTA 
ATTGTAGCGG CGAAAATATG GTACAGGGTC TAGTTATACC TTCTAATTAT TTAGGAAGAA 
AAATGAAAAT ATTAAAAGAC CATATCGATG ATGGTATTAA AAAAATAGCA GACTATGTTC 
AAGGTATATA TGATGAATAT GCATCGAAAA TTGTCGATGT AATAAAATAT TTGATTAATA 
CAATTCCCAA AATACGTAAG AATTTAAGAC ATGCAAITGA AATGTTAAAT GTAAAAAAGA 

AAGAArrrrT gtccctgatt cctaatgtaa cttgtaatta tattaaaact aaattagaag 

AATTAGATAA TACTTTAGGC AAATGGGAGC CTTTTCrTAA TGATTTAAAA GCAGTGTCAC 

caattttaga taaccattta gatgatattg ttaagaacat gaagcctttg attgtacaaa 

TGAVATWTGA ACCATCACAT TATGaCGATA TGTTTAATTC aAGAAAAGCT TTAACGcCAG 

TGTTCTCAAG CGTTTTATAA AGGTTG 

(2) INFORMATION FOR SEQ ID NO: 443: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5301 base pairs 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 443: 

ACCTATAGCT GGCCCGTCAA GTCCTGAATC ATATGTCGCT ATAAACCCAG TTGCATGCCC 60 

AGCAATCTCG GTTTCAATTT CAAAATCATG CTCTTTCAAA CGATCTATTA AAGTTCGAGA 12 0 

CGCAAATATT TCTTCATTAC CAAGTTCAGG ACGTTCATGA ATTCTATGAC TGATTTCGAT 180 

ATAACTATAT TTATTTGTTT CTATATAATC GAGAATTTGT TGTTTTTCAC TCATTTTTTA 24 0 

CTATCTCCCT TTTACCCTAC ACTCATTTTA ATCATCCACA ATATTTTGTT CTTTCAAAAT 3 00 

GAATTATTAC TTATTCTATC GGTTTTATCT CATGATGTCA TCTAGTTTTT CTTTATTTAC 3 60 

AAAATTTTCT AATAACTAAA aGCCTTTCAT AAATTTATAA AACAGTTTCA AATTGTAAAA 420 

CATAAGCCCT ATTGTTACAA TTTAAGTATT GTTAGAAACA CAACTTACAA CAAACTTAAG 4 80 

TTTTATAATG CAACAAATCA TAAGCGTTTT ATAGTTTTGA GTAGTTAATA GGAGGAAATT 54 0 

CAAAATGACA AAAATGAATG TTGAAAGTTT cAATTTAGAT CaTAcTAAGG TGGTTGCCCC 6 00 

ATTTATTAGA TTAGCGGGAA CGATGGAAGG ATTAAACGGA GATGTCATTC ACAAATACGA 6 60 

CATTCGTTTC AAACAACCAA ACAAAGAACA TATGGATATG cCCGGACTAC ATTCATTAGA 7 20 

ACATTTAATG GCTGAAAATA TTAGAAATCA TAGTGACAAA GTTGTTGATT TAAGTCCTAT 780 

GGGTTGCCAA ACTGGTTTCT ATGTATCATT TATTAATCAT GATAATTATG ATGATGTATT 84 0 

AAATATTGTT GAAGCAACTT TAAATGATGT GCTAAATGCT ACTGAAGTGC CTGCTTGTAA 900 

TGAAGTACAA TGTGGCTGGG CAGCAAGTCA TTCATTAGAA GGTGCTAAAA CTATCGCTCA 96 0 

AGCATTTCTA GACAAACGAA ACGAATGGCA TGATGTTTTC GGTACAGGAA AATAAATCTT 102 0 

AGTCAATCAA GTTAATCAGA AAaGCAGTCG AACAATGATT TTACAATCGC CATTGTCCAA 108 0 

CTGCTTTTTA TTATGCTTCA AAGTCAAAAA ATCGAACAAA TGAAAAAGTA AAATCTTTAA 114 0 

CATTTGTCCG ATTTATTTGA GAACCACTAT AATTTCTTAA TTAGTCCCAT TAACACGAAC 120 0 

TGCATAGGTA ACCTTAAATA TAGTTGCCAT GTTGGCAATT GTTTATCTCC TAAAGGTAAC 1260 

TTTTTAACTG CCATATAGAT ATTAGCTGGG AATACAGCTA GCAAGAATAG ATTGATTGTA 1320 

TTTTTCAAGC ATTGAGATGG TCTTTTAATT AAAAGTGCAA GTCCAAATAA TATCTCAAAG 1380 

ATTCCTGTAA CAAGAACCGC TGTTTTTCGA AGTGGCAAAC ATTTCGGTAT GATATTTCTA 144 0 

AATTGTCGTT CTCGTGTAAA ATGCAATACA CCTATTACAC TAAAACCTAT TCCTAATAAA 1500 

TATCTTAGTA TGTTCAATCA GCTTCAACTC CTATTCTGTA ATGATTTTAT GAATTAATGT 156 0 

AGGCGATACA ACATGaTCAG CAATTGTTAT GCTTGAATCT AATTTTTTAA CAACATCGTC 162 0 
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AATTTTTTTA 

TCCCGCTCCT 
ACCTGATTTT 
GTCAATAACA 
TGCACCTGAA 
GTTTGCAAGT 
ACCTTGTCCT 
TCCAAGTGGC 
AATACGTACC 
GCTACCAGTT 
AATTGAAGAG 
TAATTTTTTG 
ATTAACCAAT 
TAATTTATCA 
AACATCTACA 
TGTTGAGTGC 
ATTAACCATA 
CGCCATTGCT 
AAAGAAATTA 
TATCATTCTC 
CTCTAGTCCA 
TTAATATCAC 
GTCCCTATTA 
ACATACTTAT 
GAATCTGCTT 
CCTGCTTCAA 
TCTATATCAG 
CCTGCAAAAC 



TTTA/^CAA 
AACATCATCG 
TTAGCTTTAT 
GTTTCGTCGC 
TTAATTGCCT 
ACAACCATTT 
TTCAACGTAT 
TGATTCATAT 
ATTGCGTGCG 
TTTACATCTA 
GCAATTAATG 
TCTGCAGGAG 
TTCACAAATG 
ATCGTACCAC 
GCTGCTACTA 
TTATCTACTT 
GCCATCGTTA 
AAACTTGATG 
ATTTCTTCCG 
ATTTTTATCA 
TTTTTAAAAG 
AATTTCAATG 
AATTATCTTA 
ATATGGTTAC 
CTAAACCTTG 
CCATTTTATT 
CACCTACTGT 
CTGTTGAAGT 



TACCAACCGC TAAATCAATA TCATCCTCTT TTGTrAAACG 
AAGCGACACC TATATCGriA GAGACTAATT CAGTCACATA 
ArrCAATTTG ATATTGAGCT TGTGGCAAAC GCTCTGGATG 
CACCTTGGTT TTTAATAAAT GTrTTGAATT TTTCTAATGC 
CAATTAGCAA CXXTTCTCGCT TCTTCAA.GCG rTTCAGCTTT 
GAGAACCTAA TGTTAATACA AGTTCTGrTA AATCTTTCGG 
CAATTGCTTC rTGTAACTCA AGCGCATTGC CAATCGCACG 
CAGAAATAAT CGCCATCGTA TTACGTCCCA CATTATTACC 
CTAATGCTTC AGCATCTTCT AATGTTTTCA TAAATGCACC 
ATACAATTGC ATCTCCACCA GCAGCAATCT TTTTACTCAT 
GTATTGAATT GACAGTACCA GTAACATCCC TTAAGGCATA 
TTAAATTTCC TGATTGTCCT ACAACTGCCA CTrrATTTTC 
TTGCrrCATC TATTTCAACA TGAAAACCAT CAATTGCTTC 
CTGTATGACC TAATCCACGC CCACTCATTT TTGCAACAGG 
ATGGTGCTAA AACCAATGTA GTTGTATCTC CTACACCACC 

TGACACcrrr aatatcactc aaatctatca tatcaccaga 

ATGCTCCACG CTCATCATCA nCATATCTT GGAAATAAAT 

cttggtaatc aggaatatcc cctttaacat agccgccaat 

TTGTTAGTGT ATGACCGTCA CGCTmTCT CAATAATGTC 
TCCmTCTT AAAAAOCrTA GGACAAAGCA TCTGCGCTTT 
CACAAGCGAA AATTATTATA GCAAGCTATC GATTTCATTT 
CGATGTTATT ATTCTTAAAT AGATrGGTTA TAACGTTAAA 
GAATCATCAT GGCATTTATG ATGTCTTAAA GCTGATATCG 
GATGTCCCAT GCTTACATAT TTTTATAAAA TTAGTAATCT 
CATAATTTCa ACGCCTGCGC TCGCACCAAT ACGTGTCGCA 
GAAATCTTCT AAATTACGTA CGCCACCTGA TGCTTTTACT 
ATCTTTCATT AATTTAACGT CTTCTGCAGT CGCACCGCCA 
TTTAACGAAG TCCGCACCAG CCGCTrTTGT TAATTCACTC 
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TTTAATGCGC CGATGTTGAT GACCATGTCA 
TCTGTTTCAA ATGCTTTCGT CGCAGTTGTC 
5 CAAACTAACA CCTCTGAATC AGCTAGTCGC 

ACACATACAG ATTTAAAATG GTATGCTTTC 

CGTGTTGACT CAGGCTTCAA TAAAGTGTGA 

10 

CTCCTCCTTT ATGGTTGATT ATAAAAATAC 

TCATTTAAAT ATCAGGTTAG ATGTTCGCTT 

TTAATTCATT TCCCATAAAC AAACAATACA 

75 

GACTTGACAA CAAGCTAATC AGGATTATAT 
AATTCGCAAA AAAGGGAAAC AAATGTTATC 
20 ATAACAAATG ACAAAAGGTA CACCACATAT 
AGTATTAATG CCTGGCGATC CGCTACGTGC 
TGTTGAACAA TTTAACGATG TACGTAACAT 
AGAAGTTTCT GTAATGGGTT CTGGTATGGG 
GTTATACAAC TTCTTTGATG TAGATACAAT 
AGAAAATGTT AACTTATACG ATGTTATTAT 

30 

TGTAGATCAA TACAATATTC CAGGTCATTT 
TAAAGCTAAA AATGTCGCTG ACCAAATCGG 
TTCTGATACA TTTTACAATG CCGATCCAAC 

35 

TTTAGGTATC GAAATGGAAT CAGCTGGTTT 
AGCACTTGGT ATTTTCACAG TAAGTGATCA 
40 AGAACGTCAA AATTCATTTA CACAAATGAT 
TATTTAAATT GACTTTAATT GCTCTTTAAC 
TCTGGGCGTA TCCCCATTTA TGTGTTGGCT 
AGCTTAAGTA AGATTTAAAC ATCCAATTAT 
ACATATCTTC TGATACATCA TCGCCAGCAT 
GTTTTGAGAT ATTTGTGAAG TCTTGTGATA 

50 

CTTTCGCAGC TTCTTTAACA ATTGATTGCT 
TTCCTCCTAC CGCTAAAATT CTTTCAGCTA 

55 



ATTTCATCTG CACCATTTTG AATCGCATCT 3 54 0 

GATGCACCTA ATGGGAATCC TATTACCGTA 3600 

TCTGCTGCAT ATTTAACATG CGTTGGATTC 3 66 0 

GCTTCATCGA TGATTTGATC GATTTGCGTA 3720 

TCAATCAATT TTGCACTATT CATTTTCTAT 3 780 

GGTTGTAAAT TAGTTGATTG AGCGTCAGGT 3 84 0 

TTTATGTAAC CGCATACATA TACTATTACA 3 900 

ATTGAACGTG ATATCTTCAT TATGAACGAT 3 960 

TTTTATAATT CTTTAATTCT ATAGTACAAA 4 020 

TTAAAATTAT TAATGAATAT TAAGGAGAAG 4 080 

TCAACCAAAT GGAGTAAAAA TTGCTAAAAC 414 0 

AAAATATATT GCTGATAATT TTTTAGAAAA 4200 

GTTTGGTTAC ACTGGTACAT ATAAAGGTAA 4 25 0 

TATTCCAAGT ATTGGTATTT ACTCATATGA 4320 

CATTCGTATC GGTTCTTGTG GCGCATTACA 4 3 80 

TGCACAAGCT GCATCAACTA ATTCAAATTA 444 0 

CGCGCCTATC GCTGACTTCG AGTTAGTAAC 4 50 0 

TGCTACTACA CACGTAGGTA ACGTATTATC 456 0 

ATTCAATGAT GCTTGGAAAA AAATGGGTAT 4 620 

ATATTTAAAT GCGATTCATG CTGGTAAAAA 4 6 80 

TATTTTACXJT GACGAAGCTA CTACACCTGA 4740 

GGAAATCGCT TTAGAAATCG CAGAGTAACT 4 800 

AATGCGATTA AACTCAAAAA GCCAACACAT 4 860 

TTTATTTATA TTATTACTTA TCTGTAGATT 4 920 

GTTTATCTAC TGATGTTTGC ATACCTATAA 4 980 

TACCAGCAAT TTCGATTGCG TTTTCTAATT 504 0 

ATTCTTCAAC CATTTGTTCT GCAGAGTAAC 5100 

CTAAGCATTC AGTTAATGTA CCTACAGGGT 5160 

ATTCGTCTAC ATATTGGCTT GCTTCATTAT 5220 
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AATTGTGTAG CTTTGTGTAA G 
(2) INFORMATION FOR SEQ ID NO: 444: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: X1466 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



5301 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 444: 

GAAGATGAAA GCTATATTGA AGATGATGTT ACTAAAAAGG AAGCTATTTT AAGTATGCAA 60 

ATTCCTAAAG GTTTCTCTCA AAAATTAAAA GAGAACCGTT TAAAAGAAAC GATACAGTTA 120 

TATGGTAGAG ATGACTTTAT AGGTGGTATT GCTGTAGAAA TTGTTAGTAG TTCATTATAT 180 

GAGCAGCAAA TTCCTAACAT TATTTATGAA CACCTTGAGG ATATGAAACA GCATCAATCC 240 

ATCGATGCTA TCAACAAGTC CTATCATAAA CATACACCTG AATCTAAAAT CAAATTTGTG 300 

TCGCTTACTA AACAAGCACA ACACTCTATT TCAATTAGCT TAATCTTTGC GGTGATTTTA 360 

TTTGTTAGCG CTGTTCAAGT AGTACTTCAT TATCGTTTAA ACCAACAAGC AGCATTGCAA 420 

CGATTATCAC AATATCATTT AAGCCGnTC AAACTATATA GTACTTATGT AATGACACAT 480 

ACGA-mrGT TATTGTTGGT ACTATTGGCA GTTAGTCTAT ATTTGTCTCA ACCACTCAGC 540 

TTAATATTTT ACTTAAAATC ACTGTTACTT ATATTGAlTr ATGAGATAGG TATCGTCTTT 600 

ATCTTATTCC ATATTCAAAC AATAAGTCAT CGATTATTCA TGACATTTAT ATATGCACTT 660 

GCTATGGGTA TCGTATACTT GArTATTTTC ATGTAAAGGA GCGTAACTGA TGATAGAAAT 720 

TAATAACCTT TCAAAGCGTT ACCGTAACAA ACAGATTTTC AATCATTTAA CTATGTCCTT 780 

TGATAGTAAT CGTTTAACCG TATTACTTCG TGATAATGGT GCTGGAAAAT CAACATTACT 840 

TCGTATGATT GCTGGTATTG AAAAAGCTAA TGATGGAACT ATCAACTATT TCGGCGAAAA 900 

ATGGAATCAA AGACaAATAC aAAATCACAT CGGTTATGTG CCACAAGACA TTGCGTTATT 960 

TGAACACATG ACAGKJGCTG AAAACATTAA ATTTTTTAAA TCACTTTGTA AAAATCCAAT 1020 

TAACGATACA ACTATCAACG AATATTTACA GCAATTAAAC TTTGATGATA CGTCTGCCAA 1080 

AGTATCTACA TTGTCCGGTG GGAATAAACG TAAAATTAAT ATATTAGTAG GTTTACTAGG 114C 

TCAACCTCGA ATTCTCATTT TAGATGAACC GACAGTTGGT ATTGATTTAA AATCTAGACA 120C 

TGACATCCAC CAACTACTTA ACATCATGAA ATCTAAATGT TTAATTATAT TAACTACCCA 126C 
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CCCAAGCTGC GTATGATATC GCAACTTGGG 
ACGTTGGGTC TACTGCATAT TGATTACCGA 
^ GAGGCGCTGT TGAATTACCC GTACTACCTG 
TATCACCAGC TGAAACAGTT AAACGATTAT 
TCGCTTCTTT AATCGTTACT TGATTGCCGC 

10 

CTGTACCATC AGTTAATGAG TAAACTGGTG 
AATGCGCACC ACCACCGTGA TATTGTCCAT 

^5 AGCTTGCGTC TTTCGCATGA CCACTAGcTG 
TAGATGTGCC ATTAGGATAA TTGACCTTTC 
TGTTGTTACT ATGTGAATAA TAGCTCGCGT 

20 GATTAATATG CTGTTGGCTT TGACTCGCTG 
GATTCACATA TGTTTGGCCG CTTCCATTGG 
AATAATGCGT ATGTCCTTGA GCATCTACGA 
TTGCTTGATT CCAGTTACCA TCAGGTGTGT 
AATAAGTACC ATAAGATACG TCTTGTGATT 
TGTTTGTCGT TTCTGCTGCA TCTGCTTGAT 

30 

TGCAATCGCT GCTGCTGTTA ATTTTTTCAT 
TAGTTTTCAA TTAAACTGTA CTGCACATAC 
35 TAATTCTCAT TTTCCAACTA TTTAATATTC 
TCATTACAAA ATAGCCATAC ATTGATATTA 
CTTTTATAAA TTTTCAAAAT TAACAACTAA 
ATGCATGATA GTAATCATCC ATCAAATCAT 
GATTGTCATT CAACTTTCTA ATAGTGATAT 

TTCTATTAAT TAAATCATCT GTGAGCTTTA 

45 

TAATATATAA CGTTCTTGTA GTTAAATTTA 

CAGCATCATG CGGACGTACA ATTCCAATCG 

CATTCATTAA ATCCATTTCT TTATCATTGT 

50 

TTCTCACAAA GCGCTCAGTT GAAGTAAATC 
TGCCTAAAGG TTCAATCGTT ACACCTTCCA 

55 



ATTTTCTGTA TTATCTACTT TGCAAGTATG 144 0 

TGCCACCAGA CATACGTTGG AAGTGTACGT 150 0 

AATATGCAAT TTGGTCACCA GCTTTGACTT 1560 

TATGCATATA CCATTGGTAG TTATTACTGT 1620 

CACCATAGTT ACTCCAACCT GCTTGTACTA 16 8 0 

AATTTTCAGG CATTGCATAG TCGACACCGT 174 0 

ATGGTTGTAG TTGTTTACGA CTTGTTAACC 1800 

TCGCTTTACT TGCTGATCCA CCATTTTGAT 1860 

CATTACCATC ATGGCTGTTA TACGCTTGGT 192 0 

CTGGACCTAC ATTTGATTGA TAACCATATT 1980 

TGTAGTCATT GTTATCTCCT GCTGTTGCTG 2 04 0 

CATTTGCATT TTTTGGATAA CAGTTATAAA 210 0 

ATGTATAGCT ATATTCTTTA TTATCAAACA 2160 

GATGATAATC CCCATTAGAA TCAATTGTAT 2 22 0 

GTGTTGACAT TTGTGTATGT GCTTGTTGGG 228 0 

GCGCCATTGT AAATGTAGCG AAsCCATCGT 234 0 

GTATAAAACA TCCTCCATTA AAGTTAAAGT 24 00 

TAAAAGAATT AGACAACTGA GTAAAGGATT 24 6 0 

CCGAAATGTT TTACTAAACT CATTACATTG 2520 

AAATGACATC TCTCACTGCA TTCGTTTAAC 2 580 

TCGTTCGTCC ATGTTCGCGA TTCAACGCTA 264 0 

ATCAACCAAA TTCCATTATC AATCGCTATT 27 0 0 

GCTTCTCAGG CTTAAAAATC GTCATATCTT 27 6 0 

ATGCTACTAA TTCATTGCTG CCATAATACT 282 0 

TTACGGTCTG ATACATCGTA TAGTGATTGT 28 8 0 

GTATATTTAC CGCATCTAAT AAATAAAATG 2 94 0 

TTTGAGCAAT GTTTGCTTTC ATAAATGCCA 30 0 0 

CACCTGGCAA TCCAAATGTA CCTGCTTCAT 306 0 

ATAAATTTGC TGTTGCTGGA TAAGGAGAAA 312 0 
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CACCAATAGG ATTATCTTTT ATAACCACTT CACCCTCTTT AAATGAAACT TCGACTGTAT 324 0 

GTCCAGTTGC ATCGGAAACA TGATAATGCA ATGGCOGAAC TTCACCGATG TCATTTAAAT 3 300 

^ ATACAGCTAC AACATGTATT TGGGATGCTT GTTGTTTCAT ATCTTCAATG CTTGTTGTAT 3 360 

ATCCCAAAAT CCATGTCACA ATTTCATTTT GCGTAATATT CATCGCGTCC GCTTTGTGTG 3420 

TTGATCCATA TGAACTATAA CCTCGGAAAT ATTGTGTTGA AATGGCAACG CcATGTtCAT 34 80 

10 

TAACACCATC ACCATAAATA AAACCTTCCA TATCTGTTCC TGTGCCAATA AAGCCATATT 3 54 0 

GCGTTTGGCC TGTCGTGCCA GTGCAAGATT TCCAACGATA ATTTCTAGGC GTCACTGCTG 3 600 

GCGAACCATC TAATGGATAA TCATAATCCA TCGTGCGTCC AAGAAGTACT TGATTATTTA 366 0 

15 

AAGTTTGTAT TGTGAATCCT GTGCACATTG TTCTCACTCC TCTGTACCTT CATTTACTTT 372 0 

AATCACTTTC AAATAAAGCT GTTTCACTTA AACATACTAT AAAAAATCAA TTATACAAGC 3 780 

20 AATTAATTGA TATTCATTCT CAATAACTGT GGTATGATAT GTAAGGAAAT CATGACTTAT 3 84 0 

GTGTGAGTGA ACGATCATCT ATACATCCGT TCACTTCATC TCATGACTTT CTATATTTAA 3 900 

TTTTTACAAG GAGTGACATC TGTGAATAAC ACACAATCTT CACCACGCAG TAATATTATT 3 960 

ATTGCGATTA TGTTGTCTGC ATTAACATAT TGGTTGTTTG CACAATCATT TATTAATATA 4 020 

GGACCTCTCG TTGGTCJkAAC ATATCAAACC TCTCCTGCCG TGTTAAATTT ATCTATTAGT 4 080 

TTAACTTCCT TCGCCACAGG TATCTTCATG GTGGCTGCAG GTGATATTGC TGATAAAATA 414 0 

30 

GGACAACTGA GAATGACATA CATGGGTCTC ATAATCAGTA TGTTTGCATC TCTTCTATTA 42 00 

ATTATATCGG ACATCACTGC ACTGCTCATC ATCGGTAGAA TTTTACAAGG TCTATCAGCA 426 0 

GCTATCTTGT TACCTTCAAC AGTTGGCGTG TTAAATAATC AATTTAAAGG AGAACATTTA 4320 

35 

AGACGAGCGA TTAGTTATCT AATGATTAGT ACTGTTGGTG GCATCGGCCT AGCTGGTGTT 4 380 

ATCGGCGGTT TAATTGCCTC AAATTTCGGA TGGCAAACGA ATTTCATCAT TAGTATAGTC 444 0 

40 ATTGCTTTCA TTGCCATATT GCTTCTAAAA GGCACACCTG AAAAAGTAAG TCAACATAGC 4 50 0 

CACCGTCATC CATTCGATTA CAAAGGTATG TCGATTTTCG CTGTTATGAT TGGTAGCTTT 4 560 

ACATTATTGT TAACACAAGG ATTCGAACAA GGTTGGTTTA GTACATTTTC AATCATTTGT 4 62 0 

'^'^ CTGAGCATTT TTATCaTCAc TACGTTGATA TTCATCATCA TCGAACGTCG ACATGAAGTA 4 6 80 

CCTTTTATTG ATTTCTCAGT ATTACGCAAC CGTCCGTTCA TTGGTGCATT TTTAAATAAC 4 74 0 

TTTGTTTTAA ATAGCGGTCT AGGCGTAACA GTGGTCTTTT TCATATATGC TCAAACACAC 4 8 00 

50 

CTTGGTTTAt CAgcTGCGCA ATCTGGACTT GTTACATTGC CATATGCCAT TGTGGCAGTT 4 860 

• ^^j^^^^rj^r^^^ ^^^^^^^^^^^ 4920 
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TCACAATATG TCATTGCAGT TATCATTGGT TTCGTCATAT GTGCGATAGG TAATGGTTTA 5 04 0 

GTCGCAACAC CTGGACTTAC GATTGCAATT TTCAGTATGC CTAATGAAAA AGTTGGTTTA 510 0 

^ GCTACAGGAT TATATAAAAT GAGTGGTACA TTAGGTGGCT CCTTTGGTAT AGCACTAAGT 516 0 

ACTACAGTTT TCAGTATGTT ACAACTAAAC TATGCACCAA GTGTAGCTGC AACCGTAACA 522 0 

TTTATAGTCA GCATTGTATT GATGATCCTT GGCTCATTGT CTGCATACAT GATCATTCCA 5280 

10 

AAAACAGTTA AATCTTAAAT ATAATAGAAG AATTATGTTT CGAAATATCT TTATCACTTT 534 0 

AAAATGATAT ACAAGAAATC CAAGAAAAAT AAGCGAACTG AATAAATAAA GATTCAATTA 54 0 0 

ACGCATCAGT ATTAGGATTC ACTCTAAAAC GATTAATAGT TTTATAAGAA GGTGTTTGAT 54 6 0 

CTTGAGCTAA CCACATCATT CGAATACTGT CATGAAGTAA TTTCTCTATT CTACGACCAG 552 0 

AAAATACAGA TTGAGTATAT GCATATAAGA TGATTTTTAA CATCATCTTT GGATGATAGG 558 0 

20 ATGTTGCGcC ACGATGATGT CTGAATTCAT CGAATTTGCT ATCAGGTATC GTTTCAACAA 564 0 

TTTCATTAAC ATGTCGCGAA ATATCATTTT GAGGAATTCT AACAGAAGTT TTTATTGGTA 570 0 

GTGTAAGTTG GGCAAAGTGT CTTATTTTTT TAAAGTATTT CAAAGTAAAA TTACATGTTA 576 0 

ATACGTAGTA TTAATGGCGA GACTCCTGAG GGAGCAGTGC CAGTCGAAGA CCGAGGCTGA 582 0 

GACGGCACCC TAGGAAAGCG AAGCCATTCA ATACGAAGTA TTGTATAAAT AGAGAACAGC 5880 

AGTAAGATAT TTTCTAATTG AAAATTATCT TACTGCTGTT TTTTAGGGAT TTATGTCCCA 594 0 

30 

GCCTCTTACT CTAATTATAT TCACTATCAA TTAGACAAAA TGGCCATTTT CAAATATCAC 6000 

GCGTTGTTTC TGACCTTGAA TATATTTATT ATAATTCTCT TTTTGAAAAT CAGTTAACAT 6060 

TAATTTAGAT GTACCGTATT TTAACACTTT TTGCATTGTT TCTATTCTCA TTTTTCTAAA 6120 

TAACCATCCA TCTTTTAACA CAATACGATT AACAGCATCA TATGATAATT CTACTGTTTC 6180 

TTTAATTTCA AATGTCTTGA ATGAAATAAT CGTGCACATT AAAAACGTAT CACCAAAGTA 624 0 

40 ATAAACATCT AAATCATCAC GTTTATGTTG TCCAACTAAC AAACGACCAT ATTCGAACTC 63 00 

TTTTTCTGGA TATTTCAATT CTAAAAAACT AATAATCTCT TCTTCTTTTA ATTTGAATTG 636 0 

CATTTAAAAA CATCCTCTCT TAAGTTTTAA CAAGCCTTAA TTAAAAATTT TTTCAATCAC 6420 

45 

ATAGTTCAAT ATACATCATT TCGTTATGTT TTTTAATACT TTGTTCAAAA ACAAATATTT 64 80 

TATTCTTTAA AATAATGACT TTTGTATTTT TAATCACAAT AAACATTTTA AAATTCTTGT 6 54 0 

TATCATAATC ATTAAAAGGT ATTAACCTTA ATAATATATT CTCTCGTCTC AACCTTAATC 6 60 0 

50 

GTATACTTCA GACGTCTGTT TGTAGACAAT AAAAGTCATT CACGTCTTCA TATGTCATCA 666 0 

AATGTTTATC ATGATATGAT GAATATAATA ATCGGGTATA TAACTGTATG ATTAATTACA 6 720 
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TGGTGTATCA 
AACAACGATA 
TGCAAAACAA 
AAATCCGTTT 
TTTATGCTTT 
TCGAGGTATT 
CTATCTAAAG 
TACAGATTTT 
AGAACTCGGT 
GACTCGTGTC 
CGTCCATACT 
ATCTGCCACT 
AATTGAACAT 
TTATGCAGCG 
CAAAACATCT 
GATTGAATTA 
ACTACGTATC 
AGGGTTTTTA 
ATATGATTTA 
ACGATGGCAC 
TGTTGCATTA 
TTCAAAGTTT 
CTTCGCTATT 
ATTCGCCTTA 
CGAATTAGTC 
ATCTTTATGG 
ATTTACAGCT 
AGAAGCAAGT 



GTTACAACTG 
AAAAACGTAC 
TTGCGCGGCA 
TTCTCAGCAC 
TTAACATCTA 
GACGGATTAA 
AAACATCATG 
GTTCGGACAA 
CACAACCATA 
GCTGGATTTG 
GAATTATCTA 
GCAATCTTCG 
GGCATCAGTA 
TACGTCTCGC 
TTAACCTTAT 
CCAACGACTT 
TTCCGAAATA 
ATCATAATTA 
AACGTTTTCA 
TGGTAAAACG 
CTTATCGGGT 
GGTGGTAAAC 
TTATTAGCCG 
TTATCAGGTG 
GGCTCATCTC 
GGTGTCTTTG 
TTAGTCGTTA 
AACGCCAAAA 



TGTCACATAT 
ATGCTGTTTC 
GTAAAATTCA 
TGATGCAAAG 
CAGCAACTGA 
TTATCGCACA 
TACCTTATGT 
ATGAATATCA 
TGATAATTGT 
TCGATACTTT 
AGCGCGGTGG 
CTATTAACGA 
TCCCGAAAGA 
CACCTTTAAC 
TACTTCAACG 
TAAAAATTCG 
TACTCATCAT 
TTTTACTAAG 
GTTTTTATGA 
TTTTACTAAA 
TAGGCCCCTT 
CTGTACATCA 
TAGTCACATC 
CAGGATGGGG 
GTGCCATGCC 
CATTAGGAAA 
TTCTAATTGG 
ATTTACGTCG 



TTTAAATCAT 
AGAACGTTTA 
AACTATTGGC 
TATTCATGAC 
TTTGTATGAC 
ATACATATCA 
CGTACTGGAT 
AGGTGGACAA 
TGCACCATAT 
GCGCGCGAAT 
GCTAACCATT 
TGAACTCGCT 
TATCTCATTA 
TACTGTGGCA 
ATTACAGCAC 
TGCAACAACT 
TGTTAGGCCC 
AAATTAAAAT 
AAACGCATGC 
AACAAATCAT 
AATTGGCTGG 
AATTATCGGT 
AAGTGGCTTC 
ATTCGGACAA 
AGTCACAACA 
TTGGCCAGGC 
AGCGCGTATG 
TGCAGTGGTA 



AATGATAGTC 
GGCTATGCCC 
GTCATTTTGC 
CATAAACCAT 
AATATTAAAC 
TCCCCGGACG 
CAAAATGACC 
CTTGCAGCAC 
GACATGATGG 
CAATTGCCAG 
GTTGATGACA 
ATTGGCATTT 
ATAGGTTATG 
CAACCTATAA 
TTAGATAAAT 
GGCTATCATC 
TTAGCGTTGC 
AATAATGTAT 
ATTTTACAAA 
GAGGTGTATA 
GGCTTCTTCC 
GCTACTGTAG 
CCTACTGGAA 
ATCATTACAT 
GCATTCCAAT 
ATTGGTCATA 
ACAGTTTGGA 
CTTCTGTTAA 



GTTTTTCCGC 
CTAATAAACA 
CTAGCTTAAC 
CTGATGTTGA 
ATTTAATTGA 
CCCTAAATAA 
ATCAAGGCTA 
AACATTTAAT 
CGAATATGTC 
AACCACAAAT 
TCATGGTTCA 
TACGAGGACT 
ACGACATTGA 
CTGATATTGG 
CCATTGATAT 
TTTCAAACTA 
TTTAATGCTG 
GAATTTTTAA 
TAAAAATGGT 
ACATGAGCAT 
CAACAGTCGC 
GTACGTTAAT 
CCAATTTGCT 
TTAAAGCGTT 
TATTAGGCGC 
AAATCATTGG 
GTGAACGCAA 
TTGGTGAATT 
TAACTGCCTT 
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7560 
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AGCAGAGAAT CCATTCCGTA ATAAAATTAC GTGGTTACAA ATTATTTCAG GTTTCTTCTT 864 0 

TGCATTTGGT GCTTTAACAT ATCTTATTTC AGCACAACCT AATATGAATG GTTTAGCAAC 8700 

5 

TGGATTTATT CTTTCTCAAA CATCCGTTGT GCTTGCTACA TTAACTGGTA TTTATTTCTT 8760 

AAAACAACAT AAAACGTCAA AAGAAATGGT TATTACAATC ATCGGCTTAG TACTCATTTT 8 820 

AGTAGCCGCT TCTGTTACAG TATTTATAAA ATAAGGAGTG TAGATGTCAT GAAAAAATCA 8880 

10 

GCTGXTTTAA ATGAACATAT TTCAAAAGCA ATCGCGACAA TTGGTCATTT TGATTTATTA 894 0 

ACGATTAATG ACGCTGGCAT GCCAATTCCA AATGATCATC GTCGTATCGA CCTAGCTGTA 9000 

15 ACTAAAAACT TACCACGCTT TATTGATGTC TTAGCTACAG TGTTAGAAGA AATGGAAATC 90 6 0 

CAAAAAATAT ACTTAGCAGA AGAAATAAAA GAACATAACC CTACACAATT GCAACAAATT 9120 

AAACAATTGA TTTCATCGGA AATGGAAATC ATTTTCATTC CTCACGAAGA AATGAAAAGT 918 0 

AACTTAGCTC ACCCATTAAA TAAAGGTAAT ATTCGTACTG GTGAAaCAAC GCCCTACTCT 924 0 

AATATTGCAT TAGAATCGAA TGTTACTTTT TAAAAGTTAT AACTTGAAAG GAGCGTACAC 9 3 00 

ATGACCAACA AAGTTGTTAT TTTAGGTTCA ACGAATGTCG ATCAATTTTT AACAGTTGAA 936 0 

25 

AGATATGCAC AACCAGGCGA AACATTACAT GTTGAAGAAG CACAAAAAGC ATTCGGCGGA 94 2 0 

GGTAAAGGTG CCAACCAGGC TATTGCCACT GCACGCATGC AAGCAGACAC AACATTTATT 94 80 

ACTAAAATTG GCACTGATGG CGTTGCTGAT TTCATCTTAG AAGATTTTAA AGTAGCTCAT 954 0 

30 

ATTGATACAT CATATATTAT CAAAACAGCT GAAGCAAAAA CGGGCCAAGC CTTTATCACT 960 0 

GTGAATGCAG AAGGACAAAA CACCATCTAT GTTTATGGTG GTGCGAATAT GACGATGACA 966 0 

35 CCTGAAGATG TTATTAACGC AAAAGACGCT ATAATCAATG CAGACTTTGT CGtTGCACAA 972 0 

TTAGAAGTAC CCATCCCGGC TATTATATCT GCATTTGAAA TTGCCAAGGC ACATGGTGTG 97 8 0 

ACGACAGTAT TAAATCCTGC ACCAGCGAAA GCATTACCTA ATGAATTATT ATCATTAATC 984 0 

GATATTATTG TGCCAAACGA AACAGAAGCC GAATTGTTAT CTGGGATTAA AGTAACTAAT 990 0 

GAACAATCTA TGAAAGACAA TGCCAATTAC TTTTTATCTA TAGGCATTAA GACTGTTTTG 9960 

ATTACGCTAG GTAAGCAAGG TACATATTTT GCTACTAAAA ATCAAAGCCA ACACATCGAA 10020 

45 

GCTTATAAAG TAAATGCGAT TGATACAACT GCTGCAGGCG ACACATTTAT TGGTGCATTT 10 080 

GTCAGTCGCT TAAACAAGTC GCAAGATAAC TTAGCAGATG CTATTGATTT TGGTAATAAA 1014 0 

GCGAGCTCAC TCACTGTACA AAAACACGGC GCGCAAGCAT CTATTCCTCT ACTAGAAGAA 102 0 0 

GTAAATCAAG TTTAAATGAA TCAAACACAG CTATGATATG AAGGTTTAGC ATATAACATG 1026 0 

CAACATTCGT ATATCATGGC TCTGCTTTTT TATCTTTATA AAACATCATC TATTAGAAAT 1032 0 
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TITGTAATCT TTTTAACTrC CAAATTaTCG CATATAAATA TGCTATATTA ATGATAATAA 
TTATCAATTA AAAGGAGGTT ATGCTATGTC TAAAGAAGCT GGTCATACAT TTTTAGCTAA 
ATTAGGAAAA ACTCGTCTAC GCCCCGGTGG TAAAGAAGCA ACAGATTGGT TAATACAACA 
AGGGGCATTT TCACAAGATA AACAAGTGTT AGAAGTGGCA TCTAATATGT GCACAACATC 
TATTTATCTA GCTCATACAT ATCGCTGTCA CATTCAAGGC GTTGATATAA ATAAGAAAGC 
ATTAGAAAAA GCACAGGAAA ACATTTCAGC AGCAOGTCTT GAATCATATA TTCAAGTTCA 
ACAAGCGAAT GCTGTTAAAT TGCCCTTTGA TGACAATCAA TTCGATATCG TTTTAAATGA 
AGCAATGTTA ACAATGTTAC CCATCGCCAT AAAGGAAAAA GCATTACGCG AGTACTACCG 
AGTCTTAAAG CCTGGGGGTA TCTTGrTAAC ACATGATATT GTCATCGTTA ATGAATCACA 
TGCCACACAT GTTGTTAAAT CATTATCTGC AGCAATTAAT GTCAATGTCT CACCGCAGAC 
GAAACTTGGC TGGTTAGATr TATATAATCA AGCTGGTTTr AATCATGTGC ATTATCATAC 
TGGTCCAATG AGITTAATOA CACCAAAAGG TTrAATTTAT GACQAAGGTA TTGriGGAAC 
TATAAAGATT ATCAACAATG CTTTGAAAAA AGAAAATCGA CCAATGTTTT GTAAAATGTT 
TAAAACGATG ACTAAATTGC GTAAAGATAT GAATTATATT ACTTTIGTCG CTAAAAAAGA 
GCACIAAATA TAATGCCACT AACTGTACTT TGTATCTATG rTTGACTATC ACTTTAATTT 
CTTTGTGACA CTAATCATCT ACITAACAAT ATCGTTATCG ITGATTAGTA AGTCATCAAT 



TTTGCTTAAA GACTTTCATA AACACTCAAA CATTAACACT ATACATAGrT AGTCGcATTA 
TrrrTTyCTn AAAATTTTAA CmTCmCGGGr TtGGGAmCrG AAaTGrtAwT TcGCrraAAtT 

TAWTCT 

(2) INFORMATION FOR SEQ ID NO: 445: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 2176 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 5: 
TTACATAGTT AACACTAGTT AATCTAITAG rTAACATTAG TTAATAATTA GTTAATTTCC 
ATTTGTArrC TCATGTGATA AATTCTAAAA GCATACaATA AATTTAATAT GTAAAAAGAA 
AGGGAATACA CATGAAAAAT AAATATATCT CGAAGTTGCr AGTTGGGGCA GCAACAATTA 
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AAGTATTACA TCTAAAAGGT ATCACAGAAG AACAACGTAA CCAATACATC AAAACATTAC 3 60 

GCGAACACCC AGAACGTGCA CAAGAAGTAT TCTCTGAATC ACTTAAAGAC AGCAAGAnCC 42 0 

CAGACCGACG TGTTGCACAA CAAAACGCTT TTTACAATGT TCTTAAAAAT GATAACTTAA 480 

CTGAACAAGA AAAAAATAAT TACATTGCAC AAATTAAAGA AAACCCTGAT AGaAGCCAAC 54 0 

AAGTTTGGGT AGAATCAGTA CAATCTTCTA AAGCTAAAGA ACGTCAAAAT ATTGAAAATG 60 0 

CGGATAAAGC AATTAAAGAT TTCCAAGATA ACAAAGCACC ACACGATAAA TCAGCAGCAT 660 

ATGAAGCTAA CTCAAAATTA CnTAAAGATT TACGTGATAA AAACAACCGC TTTGTAGAAA 72 0 

AAGTTTCAAT TGAAAArGCA ATCGTTCGTC ATGATGAGCG TGTGAAATCA GCAAATGATG 78 0 

CAATCTCAAA ATTAAATGAA AAAGATTCAA TTGAAAACAG ACGTTTAGCA CAACGTGAAG 84 0 

TTAACAAAGC ACCTATGGAT GTAAAAGAGC ATTTACAGAA ACAATTAGAC GCATTAGTTG 900 

20 CTCAAAAAGA TGCTGAAAAG AAAGTGCCGC CAAAAGTTGA GGCTCCTCAA ATTCAATCAC 96 0 

CACAAATTGA AAAACCTAAA GTAGAATCAC CAAAAGTTGA AGTCCCTCAA ATTCAATCAC 102 0 

CAAAAGTTGA GGTTCCTCAA TCTAAATTAT TAGGTTACTA CCAATCATTA AAAGATTCAT 10 80 

TTAACTATGG TTACAAGTAT TTAACAGATA CTTATAAAAG CTATAAAGAa AAATATGATA 114 0 

CAGCAAAGTA CTACTATAAT ACGTACTATA AATACCAAGG TGCGATTGAT CAAACAGTAT 12 00 

TAACAGTACT AGGTAGTGGT TCTAAATCTT ACATCCAACC ATTGAAAGTT GATGATAAAA 126 0 

ACGGCTACTT AGCTAAATCA TATGCACAAG TAAGAAACTA TGTAACTGAG TCAATCAATA 132 0 

CTGGTAAAGT ATTATATACT TTCTACCAAA ACCCAACATT AGTAAAAACA GCTATTAAAG 13 80 

CTCAAGAAAC TGCATCATCA ATCAAAAATA CATTAAGTAA TTTATTATCA TTCTGGAAAT 144 0 

AATCAATCAA AAATATCTTC TCTAGTTTTA CATCATTTTT TAAATAATTT TCGTAACAAA 1500 

CCGTGATTAA AAAGAACCGT TGATTCTCAA TCGAATCTAC GGTTCTTTTT TCATTTTCCA 1560 

40 TCAATTAAAT GCTTCTTCGC TATTTGTCAG CCCACTTTTT TACCTGCAAC TTGTTAAATA 1620 

ATCCTTACAT CGTTAACGAA TAGTTCATCA TTTAGTTGAA TCAGCTCAAC TTTATTAACT 1680 

TCATATTTTC ACAAACTATT GCGCAATCCA TTCCTTTTCC ACTACAAGCA CCATAATTAA 1740 

ACAACAATTC AATAAAATAA GACTTGCAAA GCATAGTTAT GTAGCTATAT AAACGCCTGC 1800 

GACCAATAAA TCTTTTAAAC ATAACATAAT GCAAAAACAT CATTTAACAA TGCTAAAAAT 186 0 

GTCTCTTCAA TACATGTTGA TAGTAATTAA CTTTTAACGA ACAGTTAATT CGAAAACGCT 192 0 

TACAAATGGA TTATTATATA TATGAACTTA AAATTAAATA GAAAGAAAGT GATTTCTATG 19 80 

ATTAAAAAtA AAATATTAAC AGCAACTTTA GCAGTTGGTT TAATAGCCCC TTTAGCCAAT 2 04 0 
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OnXyCAAATG ACACaGACAT TACTAGCCAA CGATTTAGCT ATnACTCCAA ACCrrCCATT 

GGATTTGGTA AAGGnT 

(2) INFORMATION FOR SEQ ID NO: 446: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1557 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 446: 
;^CATGG CTTAAATCAA GTACGCTATA ACAAATTACA AGAACATCCT ATTGTTATGC 
ATCCGGCACC TGTCAATAGA GGAGTAGAAA TACAAAGCGA TTTAGTAGAA GCTTCAAAAT 
CAAGAATTTT TAAGCAAATG GAAAATGGCG TTTACTTAAG AATCGCAGTC ATTGATGAAT 
TATTAAAATA GGTAAGGGGA CGAAAATGAT GAAATTAATT aAAAACGGTA AAGTATTAC-. 
AAATGGCGAA TTACAACAAG CAGATATTTT AATTGATGGT AAGGTAATTA AACAAATTCC 
ACCTGCAATT GAACCAAGCA ATGGTGTTGA CATCATAGAT GCGAAAGGTC ACTTTGTGTC 
ACCTGGATTT GTCGATGTTC ATGTTCATTT ACGTGAACCT GGTGGTGAAT ATAAAGAGAC 
AATTGAAACT GGTACTAAAG CTGCTGCTAG AGGCGGATTT ACAACTGTAT GTCCAATGCC 
TAACACAAGA CCGGTACCAG ATTCTGTAGA ACATTrTGAA GCTTTACAAA AATTAATCGA 
TGACAATGCT CAAGTACGTG TAlTACCrTA TGCTTCAATT ACAACACGTC AATTAGGTAA 
AGAArrGGTT GATTTCCCAG CACTAGTAAA AGAAGGTCCC rTTGCGTrTA CAGATGACGG 
TGTAGGAGTA CAAACTGCAA GCATGATGTA TGAAGGCATG ATTGAAGCTG CAAAAGTAAA 
CAAAGCCATC GTAGCACACT GTGAAGATAA rTCATTAATC TATGGTGGTG CAATGCATGA 
AGGGAAACGC AGTAAAGAGT TAGGTATACC AGGTATTCCA AACATTTCTG AATCTGTTCA 
AATCGCAAGA GATGTACTAT TaGCTGAAGC AGCAGGTTGT CATTATCATG TATGTCATGT 
TTCTACTAAA GAAAGTGTTA GAGTCATTCG TGACGCTAAA CGCGCAGGCA TTCATGrTAC 
AGCTGAAGTT ACACCACACC ATrTATTGTT AACAGAAGAT GATAnCCTG GTAATAATGC 
CATTTATAAA ATGAATCCAC CAriGAGAAG TACTGAAGAT AGAGAGGCTT TGTTAGAAGG 
GTTACTAGAC GGTACAATTG ACTGTATCGC AACAGAcCAT GCACCACATG CACGTGATGA 
AAAAGCACAA CCAATGGAAA AAGCaCCATT CGGAATTGTr GGTAGTGAAA CAGCATTCCC 
ATTATTATAT ACGCATTTTG TAAAAAATGG TGATTGGACA TTACAACAAT TAGTAGATTA 



2160 
2176 



60 

120 

180 

240 

300 

360 

420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 

aoao 

1140 
1200 
1260 
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TTATGCAGAT TTAACAATCA TTGATTTAGA TAGTGAACAA GAAATTAAAG GAGAAGATTT 1380 

CTTATCAAAA GCAGATAATA CACCATTTAT CGGCTATAAA GTTTATGGAA ATCCGATCTT 144 0 

AACAATGGTT GAAGGCGAAG TTAAATTTGa GGGGGATAAa TArTATGCAA gCAAACGTTA 1500 

TCTAGTGTTA GAAGACGGTC TTTTACGAGG CTACCGTTAG GTCTGATAAC TTACTGA 1557 



(2) INFORMATION FOR SEQ ID NO: 447: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1799 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 



ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 447: 



20 


GCTAGAAATm 


TTGmATGaCA 


ATACAACTCT 


GTTAAAaTGA 


TGGACGTAGA 


CAAATATGCG 


60 




TATTGACGCT 


TTATTTTAAA 


AATTaACATG 


CTTATAACAT 


GTTTATAGAA 


GGAGATTAAC 


120 




CTATGAACTA 


TCaAGTTCTT 


TTATATTATA 


AATATATGAC 


GATTGATGAC 


CtGAACAGTT 


180 


25 


TGCTCAGGAT 


CACTTAGCCT 


TTTGTAAAGC 


ACACCATTTA 


AAAGGTAGAA 


TTCTTGTTTC 


240 




TACAGAAGGT 


ATTAACGGCA 


CATTATCTGG 


TACAAAAGAA 


GAAACCGAAC 


AATATATGGC 


300 




ACATATGCAT 


GCCGATGAAC 


GATTCAAAGA 


TATGGTGTTT 


AAAATTGATG 


AAGCTGAAGG 


360 


30 


ACATGCTTTT 


AAGAAAATGC 


ATGTACGTCC 


TCGAAAAGAA 


ATCGTTGCTT 


TAGATTTAGA 


420 




AGATGACGTC 


GATCCAAGAC 


ACACAACTGG 


CCAATATTTA 


TCACCTGTAG 


AATTTAGAAA 


480 




AGCTCTTGAA 


GATGATGACA 


CAGTCATTAT 


TGATGCACGT 


AATGATTATG 


AATTTGATTT 


540 


35 


AGGTCATTTC 


CGAGGTGCAA 


TTCGTCCAAA 


TATCACACGT 


TTTAGAGATT 


TGCCTGACTG 


600 




GATTAAAGAG 


AATAAAGCGT 


TATTTGCAGA 


TAAAAAAGTG 


GTTACGTACT 


GTACTGGTGG 


660 


40 


CATTCGATGC 


GAAAAATTTT 


CTGGATGGCT 


TTTAAAAGAA 


GGTTTCGAAG 


ATGTAGCTCA 


720 


ACTTCATGGC 


GGTATTGCTA 


CATATGGTAA 


AGATCCTGAA 


ACAAAAGGTG 


AATATTGGGA 


780 




CGGTAAAATG 


TACGTATTTG 


ATGACCGTAT 


CAGTGTTGAT 


ATCAACCAAG 


TTGAAAAnAC 


840 


45 


AATTATTGGT 


AAGGATTGGT 


TTGATGGCAA 


ACCATGTGAA 


CGTTATATTA 


ATTGCGCTAA 


900 




CCCAGAATGT 


AATAAACAAA 


TATTAGTTTC 


TGAAGAAAAC 


GAAACTAAAT 


ATTTAGGTGC 


960 




ATGCTCTTAT 


GAATGTGCTA 


AACATGAGCG 


TAATCGTTAT 


GTTCAAGCAA 


ATAATATTAG 


1020 


50 


TGATAATGAG 


TGGCAACAAC 


GTTTAACAAA 


CTTTGATGAT 


TTACATCAAC 


ATGCTTAGTT 


1080 




TTAATTAAAT 


ACCTTTCAAA 


ACACGCTTTG 


AAAATCCGAT 


TTATAAAGGT 


TTTTCAAGGC 


1140 
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TAAATTTTAA TACTGCGGGG TGTCTTAAAA TGCACATTTT AGTAACAGGG TTTGCGCCTT 1260 

TTGACAATCA AAATATCAAT CCCTCATGGG AAGCTGTGAC TCAACTAGAA GATATTATTG 132 0 

GCACACATAC AATCGATAAA TTAAAACTAC CAACCTCTTT TAAGAAAGTA GATAATATTA 13 80 

TAAATAAAAC GTTGGCATCT AATCATTATG ATGTTGTACT AGCTATAGGA CAAGCTGGTG 144 0 

GTAGAAATGC CATTACCCCA GAACGTGTCG CCATTAATAT TGATGATGCA CGTATTCCAG 1500 

ATAATGATGA TTTTCAACCT ATTGATCAAG CCATTCACTT AGACGGTGCG CCAGCTTATT 156 0 

TTTCAAATTT ACCaGTTAAA GCAATGACTC AAAGTATTAT TAATCAAGGA CTTCCTGGAG 1620 

CACTTTCAAA TAGCGCAGGT ACATTTGTTT GTAATCACAC ACTTTATcAC TTAGGTTATT 168 0 

TACAAGATAA GCATTACCCT CACCTACGAT TCGGATTTAT TCaTGTGCCA TACATACCAG 174 0 

AGCAGGTcAT TGGTAAACCC GATACACCAT CTCATGnCCA TTGAGGAAAA GATnAGTTG 17 99 
(2) INFORMATION FOR SEQ ID NO: 448: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1341 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 448: 

ACTTGGTTTT TTATTGTTTA TAAATAAAAC TCACTTAATA ATGTTTTCAT AATCTTCTTC 60 

GACTACTTAA TTCTTTAAGA TATTCGTGAA AAGAGACATT ACACTAGTTA ATTTTCAAAC 120 

AATACAAAAA GCGTCTACCT CCTACATATA ATTGTAGCGG AGATAGACGC TTAATATTTA 180 

TTTAAAAATT ATTTTAAACC ACCGAATGTC ATAACATCAC GGGCAATCAT ACTTTCTTCA 240 

TCTGTTGGAA TAACGACAAC TTTAACTGGT GAATGAGGAT AGTTAATAAA TCCTTCTTTA 300 

CCACGTAGTA AGTTTTCATT TTTCTTAGGA TCCCAGTAAA CACCCATAAA TTCTAAGCCT 36 0 

TCAAGAACTT TCGCACGAAT TTCTACTGAG TTTTCACCGA TACCTGCTGT AAATACGATA 42 0 

ACATCAACAC CATGCATTCT CGCAGCATAT GATCCAATAT ATTTGTGAAT TTTAGAAGCA 4 80 

AATACATCTA AAGCCATTTG TGAACGTGCT TTACCTGATT CAGCTTCTTC TGATAAGTCA 54 0 

CGTAAATCAC TAGATGTACC TGATAATCCT AATAAACCTG ATTCTTTGTT TAAGATTTCC 6 00 

AATACTTGTT CAGCAGTTTT ACCTGTTTTT TCCATAATAA ATGGAATTAA AGCAGGGTCA 66 0 

ATATTACCAG AACGAGTACC CATTGTTACA CCAGCAAGTG GTGTGAAGCC aTTGATGTAT 720 

CAATAGATTT ACCGCCATCG ATAGCTGCAA TTGATGCTCC ATTACCAATG TGACATGAAA 78 0 
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TATGGCTTGT ACCATGGAAA CCATACTTAC GAATGCCATA ATCTTTATAA TAATGATATG 9 00 

GCAAGCTATA TAGATATGCT TTTTCAGGCA TTGTTTGATG GAATGCTGTA TCAAAAATTG 96 0 

^ CCACATGAGG GATATTTGGT AATAATTTAC GGAAAGCACG AATACCCAtC AAGTTaGCTG 1020 

GGTTGTGaAG CGGTGCTAAT TcGCTTAATT CTTCAATTTC CTTTTCAACC TCATCAGTAA 1080 

TAGCTACTGA TTCAGGGAAT TTTTCACCAC CATGTACAAC ACGGTGACCT GTTCCATCGA 114 0 

10 

TATCGTTAAT ATCATTAATA ATATTGTGCG CTTTAAAAGC ATCCAACATG ATATCAACTG 120 0 

CCTCAACGTG ATCCTTGATA TCTTGTACTG TTTTAACTTT TTCCCCGTTG ACTTCAATTG 126 0 

TAAAAATTGA ATCCTTCAAT CCGATTCTTT CTACTAAACC TTTTGTTACT AATTCCTCTT 13 20 

75 

CAGGCATTCT AATTAATTGA A 1341 
(2) INFORMATION FOR SEQ ID NO: 449: 

2^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 152 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 449: 

TTTTGAAGAT ACTACCGATG AAAATAGACA AAAGATTTTT CAATATTTAT CACCTGAAgA 6 0 

30 AGTTGCAAAT TTCTTTGATC AATTAGATAT TGATGACGAT GAATATGAGT TGCTATTTGA 120 

TAAGATGAAT GCGACATACG CAAGTCACAT ATTAGAAGAA ATGTCATACG ACAATGCAGT 180 

AGATATTTTA AATGAGTTGA CTAAACCAAA AGTTGCTAGT CTTTTAACAT TGATGAATAA 24 0 

35 

AGATGACGCG AATGAAATCA AAGCATTACT TCACTATGAT GAGGATACGG CCGGCGGTAT 3 00 

TATGACGACG GAgTATTTAT CACTTAAAGC GCATACGCCT GTTAAAGAAG CATTATTATT 36 0 

GGTCAAAGCG CAaGCACCAG ACGCAGAAAC AATATATGTT ATATTTGTCG TTGATGATGA 4 20 

40 

TGGTAAATTA GTAGGTGTTT TATCGCTAAG AGATTTAATT GTAGCTGAAA ATGATGCTTA 4 80 

TATTGAAGAT ATTATGAATG AACGTGTCAT TAGTGTGAAT GTAGCAGACG ACCAAGAAGA 54 0 

TGTTGCTCAA GTTATGAGAG ACTATGATTT CATGGCTGTA CCTGTTATAG ATTACCAAGA 6 00 

ACATTTGCTT GGTATCATCA CGATTGATGA TATTTTAGAC GTTATGGATG AAGAGGCTAG 6 60 

TGAAGACTAC TCTCGTTTAG CCGGGGTATC AGATATCGAT TCGACTAATG ATTCAATCAT 720 

50 TAAAACAGCA TTAAAACGTT TACCATGGTT GATTATTTTA ACATTTTTAG GAATGATTAC 7 30 

TGCGACAATT TTAGGGAGAT TCGAAAAAAC ATTAGAAAAT GTAGCGCTAC TCGCAGCGTT 84 0 
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TCGTAACATT ACGAcAGGGG AAATTAATGA GCAAAGTAAA TTTAGAATTG CATTAAGAGA 960 

AGCAGGAAGT GGTGTATTAT CGGGTGTTGT ATGTTCAACA ATATTATTTA CAATTATTGT 1020 

TGCAATATAT CATCAGCCAC TTTTAGCATT AATCGTTGCA GGAAGTTTAA CTTGTGCGAT 1080 

GACGGTGGGG ACGTTTGTAG GTTCGATGAT TCCATTATTG ATGAATAAAT TAAATATCGA 114 0 

TCCAGCAGTG GCTAGTGGAC CATTTATTAC AACAATTAAT GATATTATTA GTATGTTGAT 1200 

TTATTTTGGT TTAGCTACAT CATTTATGGC TTACTTAATT TAAGGAGGAG TTATGGAGTT 12 60 

TTTATCTTTA GTTATTGTTG TTTTAGCAGC GTTTTTAACT CCAATAATTG TCAATCGATT 1320 

AAATATTAAT TTCTTGCCAG TTGTTGTTGC AGAAATTTTG ATGGGGATTG TGATTGGAAA 1380 

TTCATTTCTA AATATAGTAG AAAGGGATTC AATTCTAAAT ATTTTATCAA CGTTAGGCTT 144 0 

TATCTTTTTA ATGTTTTTAA GTGGTTTAGA AATTGATTTT AAAGCTTTTA AAAAAGATAA 150 0 

ACGCGCACGT CAAGGACAAA ATGATGATG 1529 
(2) INFORMATION FOR SEQ ID NO: 450: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1827 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 450: 

TTCTGGAAAC CAAAGTATTG TCATCTTCTA CTAGTAGTAT nGGO^.TCCAT ATCACCCAAT 6 0 

ATCATTTAGT ATATTTCATA TTTTCTCCTG ATTTTAATCG ACTTTGAAAA TCTTTAATCC 120 

GGCAGTCAAC TTCAAAGCCA TGAATCATCA TTTTTTGCAT TGGTGCTTGT ATAAAGTAAT 180 

AAATCGGCCA AAATAATCGA GGGATATAAT CGTATAGATG TATATAAACG ACTGCCGACT 24 0 

CTTTGATTAA TCTAAATTCT AACTTCCCTT GATTAACTGT ATATTTTTTC ACTAAACTTC 3 00 

CACTCAATAA AATTAAAGTT ATTATTCCAT CAGCTGTTTC TTCTATTTTA AATACTGCGA 3 60 

GCGGTCGCAC CTTATTCTTA ATATATATCT TAAATTGATC ATGTGATTTT TCTGTTTTCA 4 20 

CAAAAGTTCC TTTAGTGTAA CCCATCCATG CAATAAAATG GTTTACAACG TTCTTTAATG 4 80 

TCCATCCCTT TGGTAAAACT ACCTTCATCG TTGATCTAAC ATCATCATAC TTTGAAACTT 54 0 

GTAATTCTAC ATTAACTAAA GAACGTTTAA AAACTAAATT TGTTGTTTCT ACAGGTGTAC 6 00 

CATATGCACC TAGGCGTTCA ATTGTTTCAT TATCATAACG ACTCCCAGGT ATGTAGATTA 6 60 

CTTTTTTTAC TTGATTGATA GCCGCTGCTC GACCAAAATT ATCTGCTGCG ATTAATGTTA 72 0 
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CAGCAATATC TATTTGATTC ATTGCTGCAA CAACCTGTTC GTAATGAAAT ATATCACACT 84 0 

GAATCCAAGT CATTTCAACA TCATCTGTTT kTTTATTGTC TGGATATTTT GATATAGCAA 900 

AAAGTTCAGC ATCATTTTCA ATCACTTCAC TTAAATACTT ACCAATATAT CCTGTTCCAC 960 

CTGCTAATAA AACTTTAGGT TTCATCTAAA ATACTCCTTT AAACTGTAAC CAAAAAACAT 1020 

ATTACTCCAC CTTTTAGTTA CATATATATT ATAATAGTAG CAAATGTTTT AAAATTTCAA 1080 

AATACTGGAG GCTTTTTATG GCCCATATTA TACGTAGAGT TAGTATCAAA GATGTAGAAA 114 0 

ATTTCATTTC AATGTTAGCG AACATATACG ACGAATCTCC GTATATGTTC TACACACCAG 12 00 

GAGAATATGA TCCTAGCGTC ACATCGGCTA GTAAACAATT AGAAGAATAT ATCACTTCTC 12 60 

CGCATAAAGT CATCTTCGTT GCTGAAAGTG ATGAACAACT CGTTGGCTTT GCCTTTGTTA 1320 

ATACGACACC ATTTCAACGC ATTAAACATG TTGCTAAAAT TGATTTAGGT GTAAAGAAAT 13 80 

TATATCAACA TCGTGGAATT GGCCAAGCAC TTCTTGATGC CATTATGGCT TGGTGTTTAA 1440 

ACAATCAAAT ACACCGAATT GAAGCAAATG TACCACTCAA TAACCAACCT GCCCTCGAGC 1500 

TTTTTAAAAG TGCCGATTTT CAAATCGAAG GCGTTTTAAA AGATAAGTTA TTTATCGATG 1560 

25 GTAAATATTA TGATGACTAT ATGATGGCTA AAATTCTTAA TTAAAGCTAT TTTATCATAA 162 0 

TCTTGTATCA GAATCGTATA ACAACGAATT TAATGGTTAC CTAATACATT ACTCATACTT 16 8 0 

ATCAATGTTA TCTAATCTCA AATAAATACG TACACTCTTA TTCATTTATC AAATTTAAAT 174 0 

30 TCAAAATAnA ACACCACTAA TGTGTAATTG ATTAACTATC AACTACGATT AGTGGTGCTT 1800 

TATATATGTG GTTAGTTTTC CTnACTA 1827 
(2) INFORMATION FOR SEQ ID NO: 451: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 616 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 451: 

ATATAGATTA ATGTTTGTTA TTTGTACTGT GTTTGCCAGT GACATTTATT ATAAAACATA 60 

TGACGTTATT CTATGJcTGTA CTCATTtTCT ATATTGTAGG TTTATTAACG ATTAGAAGTA 120 

TTATTAAAAA GTTGAAATAT CAGGAAACAT TATTACGAGA CTAAAAAACT TCCATTGGCA 180 

50 TGTATGTTGT AAAGGTGCAT GTAATGTTGA ACGCCAAATG ATACGGCGTT CAGATTACAT 24 0 

TAGCATCTAT ACGTTAACAG CATAACCAAT GGAAGTTTTC TTCGAATCTA TTCTTTTATT 3 00 
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AATGCGTCTT TTGAAAAATG GTCATTAAAG GCATCAGATT GCTTAAAGTC TTCGTATGCA 42 0 

TGTCGATCAG CAAATCCGAA ATAAATTTTG TATGTTGTAC CTTTAGCAGG TCTTAACAAA 4 80 

CGATAGCTTT TAAAGCCACC AAAGTTTCTG AAATTATCGT CTACACTAAT CAGTTTCTTT 54 0 

TCAAGTTGAT ATGCATGATC TTCTGTTGAT GGaATGAAgA TTGCACaATA GAAATGaTGT 6 00 

TCACtGAATT CACCAA 616 



10 

(2) INFORMATION FOR SEQ ID NO: 452: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 944 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 452: 
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GCACGAGTGA 


TTAAACGGTT 


AATCAATGAA 


ACATTTGATG 


CAAA i i ACAT 


TGAAGTTATT 


60 


GAGGGAGGAA 


TTGAAGAAAC 


GCAAACGTTA 


ATTCACTTAC 


CTTTTGACTA 


TGTCTTCTTT 


120 


ACAGGAAGTG 


nAAATGTAGG 


CAAAATCGTT 


TATCAAGCTG 


CCAGCGAAAA 


TTTAGTTCCT 


180 


GTGACATTAG 


AAATGGGCgG 


TAAAtCTCCa 


GTCaTCGtTG 


ATGAAACAGC 


GAATATTAAA 


240 


gTTGCTAGTG 


AGCGCATTTG 


TTTTGGGAAA 


TTCACTAATG 


CCgGCCaAAC ATGTGTTGCa 


300 


CCAGATTACA 


TTTTAGTACA 


CGAATCTGTA 


AAAGATGATT 


TAATCACAGC 


CCTATCAAAA 


360 


ACGTTGCGTG 


AATTTTATGG 


TCAAAATATA 


CAACAAAGTC 


CAGATTATGG 


CCGCATTGTA 


420 


AACCTTAAAC 


ATTATCATCG 


TCTGACTTCA 


TTACTTAACA 


GTGCACAAAT 


GAATATTGTA 


480 


TTTGGTGGTC 


ATAGTGATGA 


GGATGAACGT 


TATATAGAAC 


CAACATTGTT 


AGATCACGTT 


540 


ACAAGTGATT 


CAGCAATTAT 


GCAAGAAGAA 


ATTTTTGGTC 


CTATCTTACC 


GATTTTAACG 


600 


TATCAGTCAT 


TGGATGAAGC 


AATAGCCTTT 


ATTCACCAAA 


GACCAAAACC 


TTTGAGTTTA 


660 


TATTTATTTA 


GCGAAGATGA 


AAATGCTACA 


CAACGTGTAA 


TAAACGAGCT 


ATCATTTGGC 


720 


GGCGGCGCTA 


TTAATGATAC 


ATTGATGCAC 


CTAGCGAATC 


CTAAATTACC 


ATTTGGTGGT 


780 


GTTGGTGCCT 


CAGGTATGGG 


ACGCTATCAT 


GGTAAATATT 


CATTCGACAC 


TTTTACACAT 


840 


GAAAAAAGCT 


ACATTTTCAA 


ATCnACACGA 


TTAGAATCAG 


GTGTCCATTT 


ACCACCATAT 


900 


AAAGGTAAAT 


TTAAATrCAT 


CAAAGCTTtC 


tTTAAAnATT 


AATT 




944 



(2) INFORMATION FOR SEQ ID NO: 453: 
li) SEQUENCE CHARACTERISTICS: 
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(C) STRANDEDNESS : double 
CD) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 453: 

TCTCCAGTAG ACCTTGTGTA TGAACAGTTT CTTTCATATG AATGAACATC GTTTTTAAAG 6 0 

TTTGTTTCAA CTCAGCCTTA TCAGGATAAT ATCTAGAGAC AGTCGTCTCT GGCATCTCCA 12 0 

TTGTATGATA TTTAACCTTA TGCAGCGACC CATGATTTTC GTTATAAACA AATGTATGAT 180 

TCACTTCATC GAAATCATGA TCTTCTCCTG CAATCCAAAA AACTGGTACT ACTTGTTGCT 24 0 

TATGTGTATC CGTTAATTCC TTAGATAAAG TAATGATTGa AAATATTTTA TGGAATGTAT 3 00 

ACAATGGTCC CCCGAAAAGC CCTGCTTGTT GTCCACCAAT CACAACTTTT GAACCATTAG 360 

CTAAATGTTG TATGTTTAAT TCTTGTTCAC TTGAAAGCTT GAAAGCTTTA AATCACTCAT 42 0 

ATATTCACGA ATAACATTCG CTAATGCCGC TTCTCTTCCA TTATTTTCTT TAGACATCCT 480 

TTTTTCAAAA CTAGTTTGTT GAGCTGCATC ATATTGAAAT AATCCTGTTA TTACAGGGTC 54 0 

ACTGCTCTTT ATTTTTGGTA TAAACTGATC TTTTTCATTT AAACTAACTA CTTTACAGTC 6 00 

CATGTTTTTT CTCCTTAAGT ACGCGATTAC AAACTATAGT ATAAAGTCTA TACCGGTGAT 66 0 

TGACAATTTT ACGGCTTGAA AATCAATTTA ATCATGGAAA ATTTATAATA TTCATTGTTT 72 0 

TACATTTTCA AATCAATGAA AAACACAAGT GGTTTAATGT ATAATAATAG TAGTAAACAA 78 0 

ATAAGGGGTA GATAAATATG AGTGAAATCA AACGTCTTGA AATTAATTAC AAAACTGACG 84 0 

AATTATTCGA AAACTTTAGA GCGTTTGGCA ACAAAGACTT ATACATGGTC AATGAGTTAA 90 0 

ACGGTCAAAT GATTGATGCA AGTTCAGATT CACCATTTTA TGGCATATTT GTCGaGATCA 96 0 

ATTAGGAGCT AGAATGGCAT TACTAAAAAA AGGTGATGTC GAAGAAATCT ACTTCCCAGA 1020 

TTTTGAAGAT TATATATTAT TATGGAAGTT AGAAGTATTA CCAAAATATC AAAACAGAGG 108 0 

GTACGCTTCA GAATTGATTG ATTTTGCAAA GAGTTTCAAT ATGCCAATTA AAGCCATTGG 114 0 

CAGAAATGAT TCTAAGGATT TCTTTTTACA TCATGGATTT ACAGATGTGG AAGCTAAAAA 12 0 0 

TATAGAGGGA CATGATGTCT TATTGTGGAA ACCATAAGAT AATAATATTC GACACTACGA 126 0 

GCATGAAAAT GCATCTTTTC GTAGTGTCTT TTTTACAATT ACTTTCTTAA GCTAATATAA 1320 

GTaAATCATT TTCAAATTAT TTGTCTTAAC GTACAATATC ATTTAGTTGT TTCCATGrAT 13 8 0 

TAATTTCATA ATCAGGTATA ATTCCTGGAT TATGATCAAA TCCTCTAAAa TTAAACCAGC 14 4 0 

AAGTAGCTAT ACCcGCATTG ATTCCACCTA GAATGTCAGA TGTTAnAGAA TCTCCaACTA 15 0 0 

TAATCGAGTG CTGtCTTTCA TCCTCACCAA TATCATTAAA AACATAATTA AAAAATTCCG 156 0 
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ACGGCGTCTG ATTTAACCTT CTCTTTTGCG TTTCGGTTAC ACCATTAGTA ACAATATATA 168 0 

AATCATGTCT TTTCGATAAT TCGACAATTG TTTCTAATGT TTGATCAAAG TATTTAACTT 174 0 

TAGCTTCTGC TAATCCATTT CTAAATAACA CATCTGCACG ATGCCCATCA ACTTCCATTT 18 0 0 

GATGATGTTT GAAGTAATTC ACAAATCGTT CTGATAATAC TTCAGACTTC GkTAATTTAT 186 0 

TTTGkTGAAA AGCTTCCCAA TGTTGGTGaT TGaTTTTTTT AAATGkTAAA AAATCATCCy 192 0 

TTGTTGCTTT ATGATTAAAA ACATTCGCCA TATAGTGAAn CGCCCATTCT TCTGcATCAT 198 0 

AAAAATCAAC AATTGTATCA TCAAAGTCTA TCAAAATATT TTTATATCCC AATTTCCCCA 204 0 

TCTCCTATAT TGTCTATGTA TCTAAATCTT AACAGAGGCT CAAATTTCTG CAAATAAAAT 2100 

AAACTGAGTG CATAACATTA AAGTATGCTC ACCCAGTTTA TTTTAAAGAA TATTAGTTAT 2160 

TATATTAGAA TCCAAATAAT TTACCTAGTA AACCCACACC GTTAGCAACG ATGTCTACGA 2220 

TACTTGTGCC TAATTTCACA CTATCATGTT GTTGTGCAGC TTGCACAGTA TTTGCGATTG 228 0 

CTTCTGCTAG TCCAGTCATT TAAATCTCTC CCTCACCTTT GAAATAATAC TGATTACTTA 234 0 

CATAACATAT TGAAATTAGA ATCCGAATAA TTTACCTAAT AAACCTACGC CATTTTCT^C 24 0 0 

2s GATGCTCACA ATGCTTGTGC CTAATTTTGC GCCATCATTA TTAATTGCTG CAGTTACGGT 24 6 0 

ATCTTTAATT GCGTTAAATA AACCTTCCAT TGAAAACACT CCTTAAAATT TAAATTTGAA 252 0 

GATAACAAAA ACGTGCGTAg yTTTTAAATC ACCGAAATGT TATTCGCTTA ACGTTTTGTT 2580 

30 GTTGTTATTT TAAAATAAAT TTGATGCAAT TAGTTTGTTT ATCCGCACAA CATCTTATAA 264 0 

TGTACTTAAC TGTATTTTAA AGAGAAAAGA AATACAGTTA GGCATTCAAA ACTGTATTTA 2700 

ACACAATTAA GTTGCCTGAA TTCGTATTTA AGTCTTATTG AACCTTTTTA GATAAATAGC 276 0 

35 TQIhTPAThG TGAAM^ATAl AAACATTTTT TATTTACAAG GTATTGCTAA TTTAAGTTCA 2820 

TTTAGATATA ATAATTCTTG TGTTGTTAAA CGTGTCCTGG TAGCTCAGCT GGATAGAGCA 2 880 

ATGGCCTTCT AAGCCATCGG TCGGGGGTTC GAATCCCTCC CAGGACGTTT ATAGGTATTT 2 94 0 

TTATACGCAT TACCAAACAA AAGAGTTCCG TGATTACGGG GCTCTTTTTG TTTTGAATTT 3000 

CAGTAATATA GTATGATGCG TCACCAAAAC GTCCCCCGCA TAAGCCCCGA AAATACAGTA 3 060 

ATTAAAACAA GCATGCTTAT TCGTTATAGA ATTTTTTGAC ACACAATTGA CACGCGTCTG 3120 

ACAC'rrGTTT ATACATTTTT AATTAAGTAA TTTTGTGCTC AAATTTCATC TATACTGCAC 3180 

CTGAACTACA CCAACACTAC ACCAAGATTT TTAACACTCA CCATTTGCAT GCGTAGAGAT 324 0 

TTTTATTATT ATATTATTCC TATAGATTTT GATACTATTC AAAATTTTAG GGACTTTTCA 3 300 

GGGGCCCGAA ATCCTATAAT TATAATTATA TACATCTAAA AAAAATAACC ACGTCCATCG 3 360 



40 



50 



EP0 786 519 A2 



ATACTATTGG CAAATTTATA AAGTAGTTCA GCGTTTTTCA ATGACATATT GTCTAATGAT 34 BO 

CTTTCATTTT TTCTCATTCT GTGTATTGTG CTTTGTGGAA CTCCTGTTTG TTTCGATATA 3 54 0 

5 

TGTAAACTGC TCAAATCACT GTCTAATAGT TTTTGAATTT GATTTCTCAT TTTGTACACC 3 6 00 

CCCTTGTAAA TCTTCAATCA TCATTTAAAT TAAATAATTA CTTTTCCACA TATTCCAATA 3 66 0 

TTTAGGTTGC AAAGCATACC TCAAATATCA TTAATTTTGA GATTTAAATG TCAATTATGT 372 0 

GTCTATCAAT CCAATATACA TACTCTAATA ACGTAATAGT ACACACTCTT CTTATTAATT 3 78 0 

GGCCATAGCT ATCATGATAT AATTAGTGAA GAAAATCACA TAAGAAAGGT TGTAAATCAT 3 84 0 

GAGACTTCAA AAAGCACCTC TAGTAACGTC AGGACTAGTC TTAGGATTAT TAGGCCTGGG 3 900 

15 

TAATCTATTA AAAGACTTAT CTCTTACTTT AAACGCTGTT TGCGGAATCT TTGCTTTCTT 3 96 0 

GATTTGGATT CACCTTTTAT GTACTATGAT CAAATATTTT AATAATGTGA AAGAACAATT 4 02 0 

AAACAGTCCT CTAGTTTCAT CAGTGTTCAC AACATTTTTC ATGTCTGGCT TTTTAGGTAC 4 0 80 

TACTTATTTA AATACATTTT TTAGTAACAT AACTTTTATC AATAGCTTAA TAACGCCTAT 414 0 

TTGGATTTTA TGCCTTGTGG GAATTATGAC GCATATGATT ATTTTTTCAA TAAAATATTT 4 2 00 

25 hAMKOATTTT TCACTTGAAA ATGTTTATCC TTCGTGGACT GTACTTTTTA TTGGTATTGn 4 2 60 

TATCGCAGGA TTGACGGCAC CCGTTAGCGG ATATTTTTTC ATAGGTCAAT TAACAGTAAT 4 320 

ATATGGCTTT GTAGCTACTT GTATTGTCTT ACCTATAGTT TTCAAGCGAT TAAAAGCATT 4 3 80 

30 TCCATTGCAG ACGTCAATCA AACCGAACAC ATCGACAATT TGTGCACCAT TTTCTTTAGy 444 0 

CGCTGCAGCA TATGTTATAG CTTTTCCTAA GGCGAATGCT TTTATCGTAA TTATATTTTT 4 5 00 

ACTATTAGCT CAAATATTTT ATTTTTATAT CATTATACAA TTGCCTAAAT TACTAAAAGA 4 560 

ACCTTTTTCG CCCGTATTTT CAGCTTTCAC ATTCCCTTTA GTAATCTCAG CAACTGCTTT 4 6 20 

AAAGAACAGT TTGCCTGTAC TTATGTkTCC AGACATTtGG AAAGGkCTTT TGTTTATCGA 4 680 

AGTGTTATTA GCCACTGTAA TAGTACTTAG AGTCTTTATA GGATATCTTC ACTTCTTTTT 4 740 

40 

AAAAAAGGAA AAACAAGATA AATTTCTnCG TAATGCGTCT CAGTAACACT ATTACCAAGA 4 800 

ATTAACACGT ATATTTAATA 4 82 0 
(2) INFORMATION FOR SEQ ID NO: 454: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4358 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
„ (D) TOPOLOGY: linear 
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15 



20 



25 



30 



35 



40 



50 



ATTTGGATCT 
TACTGTATCT 
AATTTGTGAT 
ATCAACGACA 
TGATGATGGC 
TTTTGGCAAA 
AGTTTCATCA 
GGCATCTACC 
TTGCAAATCA 
ATTCGTAGAT 
TCCAGTGGTA 
ATTACCTTGa 
TAGTTGCTGC 
TTCAATTCTT 
AATGTTGTAC 
TTTAATCGAT 
ATTAGATTAC 
GTATATCAAT 
TCGTAAAACG 
TATTTAATCA 
AATTCATAGA 
TTGTTTTATA 
TAATCATATT 
GCGAATAATC 
GGTAATGCTT 
TGTTTACCTG 
TCGCTATCTG 
TCTGAGTCTG 



TTAATATCAC 
GAGTATTTAA 
TTTAATGATA 
GTtAATGTTG 
TGTACTGCTG 
ATGATATCTG 
TTACTCTTGA 
TTTTTGTTTT 
GGTTGTAACG 
TTTGGTGAAG 
GATTGTACAC 
GATGATACCA 
ATATCAACTT 
TATTAGTACT 
CTACTAAAAT 
TGGATACCAT 
TCCTAATTCA 
ATAGTATTAC 
TTATTTAACA 
TCATAAAAAA 
CAATTACATT 
TCTGGGAATT 
ATTTATTTTG 
CACCGAATAA 
TAGCTGTTTT 
CATCTGAATC 
AATCTGAGTC 
AGTCGCTATC 



CAATATTTTT 
TTGTGAAATA 
AATAATCTTG 
TATTTGATGT 
CTATACGCAT 
CATTATTTTC 
TAGCATCACT 
CCTCATTAGT 
CTTCTTGATT 
CTTGCTCATC 
TTTCTTTTTC 
TTTCTTTTTG 
TATCACTCGA 
TTCTGCAGCC 
TGATGCTGTT 
GCCTTTCCTT 
TCAAATTTTT 
ATTTTTAGAT 
TTTGTQTTTA 
TAGTATTATT 
TCTGTAACAA 
TCTAAACAGA 
TTTTTTACGA 
TGTGCCATTA 
ATGCTGATCT 
TGAATCGCTA 
GCTGTCTGAA 
TGAATCTGAA 
TGAGTCGCTA 



AATATCTTCC 
ATCACCTGAT 
GGCTGGTACG 
GATTAAATCA 
TCTTGTATTC 
ATTATTTGAA 
TTTAACATTT 
TGGTTGAACA 
ACTTATAGTT 
TGATTTGGCA 
TATTAATTTA 
ATTATCATTT 
TTGATTATCA 
TTTGCTTCTT 
CCCACTGTGT 
GTTATTGCCG 
AAATAATACA 
AAAGCACAAT 
AATAAAAGT-r 
TGCCCTTGAA 
ATTAAATTGT 
AAAAACCAGG 
CGACCGAATA 
TTTGAATTAT 
TTAACCGTAC 
TCTGAATCTG 
TCTGAATCGC 
TCGCTGTCTG 
TCTGAATCTG 



GGATTCAATC 
TTAACTTTGT 
ATTTTATTGT 
TTAACATTTT 
AAACGTTTAG 
TTACTATTGT 
AATGTAGTTG 
TTTACCACTG 
TGTTTAGTGT 
GTTGAAACTT 
TTCCCATTTG 
TTAGTATTGT 
CTTGCTGAAG 
GGTTCCCCAG 
ACTTTCTAAT 
TTTTATTTTC 
ATTGTTTTAA 
ACTTTAATTA 
TTTATGAGTt 
ATTAATATCT 
ATCTATTCCT 
CCACATGGAC 
ACAATAATGA 
TATTTTCACT 
TCATTGGTTT 
AGTCGTTGTC 
TATCCGAGTC 
AGTCTGAATC 
AATCGCTATC 



CATATACTTG 
CATCAACTGT 
TTTTATCTGC 
TAGCCTCTGT 
GTGCTGTACT 
TATCAACAAG 
ATTCAGTTTT 
ATTTATTCTC 
TTAAATCTTC 
CAACTTTATT 
AAGTCGTTTC 



TTGTCGCTcG 
ACCAAAAATT 
CGAAAATTTA 
TCTGTTTAGC 
ATACAAAAAT 
TTTTTCTTTA 
TTGTAATCTT 
TAGCTTTTCT 
TAAAGATTTT 
CTGGTTAAGT 
TCCTAATGCC 
ACCTGTTTCT 
AGCCGGAGTA 
TGAGTCCGAA 
TGAGTCGCTA 
GCTATCTGAG 
TGAATCTGAG 



60 
120 
180 
240 
300 

360 

420 

480 

540 

600 

660 
720 
780 
840 
900 
960 

1020 

lOSO 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

X b CI V 

1620 
1680 
1740 
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TCTGAATCTG 
TCTGAATCGC 
^ TCCGAATCGC 
TCGCTATCTG 
TCTGAATCTG 

10 

TCGCTATCTG 
TCTGATGTTT 

^5 GTTACATCAA 
TGAGTTAAGC 
AAGCGGTATT 

20 TGCAAAGTAA 
TTACTGTCGT 
TCTAATGTCA 
ACAGCATCAT 
TCTACACTAT 
GTCGTACGAT 

30 

CCTTTTTCAT 
AGATTATATT 
TTTGACCAAC 

35 

GTTGAACTAT 
TGGCCTTTCA 
40 TGATCAGTAA 
TGATCTGtCA 
AAATTAGTAA 
ATATTACGCG 
TTTTTATTAC 
TCCATTTTAT 

50 

TCAAAGCTAC 
GTGTTTGTTG 



AGTCGCTGTC TGAATCTGAA 
TGTCAGAATC TGAGTCGCTA 
TATCTGAATC TGAATCGCTA 
AGTCTGAGTC GCTATCCGAG 
AATCGCTATC TGAGTCTGAG 
AATCTGAGTC GCTATCTGAG 
CTTCTTCGTA GTAGCCATTA 
CTTCGCCACC ATCGGCATCT 
CAGCAGGTTT TTCAAAGATA 
TACCATTTTC ATCTGTyTCA 
CTTTAACACC TTTAATTCCT 
ACCAAACATA ATCACCTAAA 
TGTTGTCAGC GTCTTTAATG 
CTGTACCTAC ATTTGCAGTT 
AAGTTCCATT GCTTAAACCA 
CTAATTCTTT ACCGTTACTA 
TGGCATCTTG TTTACCATCT 
TCTTTTGGTC GCCATTAGCA 
TATATTTAGT TTTGTCAGTG 
TATCTGGATA AGCAACTTGT 
TTAAATCGAC TGTAGCTGTT 
CATCTTTAAG TTTTGAAGTA 
CTtCGTAAAT TTTGAAGTTT 
CAAACGTTTG TTTAGTATAT 
ATAAATCTTC ATTGTTAATA 
CATAATCGAC AATGATTTCT 
AAGCTGTTTT ATCAGTTGTT 
CTCTAACATT TGTATATTGA 
TACTATCATA AATACCTTTT 



TCACTGTCTG AGTCTGAGTC 
TCTGAGTCTG AATCTGAATC 
TCTGAGTCTG AGTCGCTATC 
TCTGAATCGC TGTCTGAGTC 
TCGCTGTCTG AATCGCTGTC 
TCTGAATCGC TGTCAGAATC 
TCAAGTGTGA AATCATCATG 
TTATCATCTT CAGTTGTATT 
ACTTTGTATT TACCACTATC 
GTTGTACCAA TTACTTCGCC 
TTTTCAGTCG AATCTTGTTT 
CTATATTTTG GTGTTTTGTA 
ACACCTGTTG TAGTTAGTCC 
GTCGGTGTAT AACCGGCTGG 
GTGAACTGAT ATTTACCATT 
TCTTTAAGAA TGACATAAAC 
TTATTTGTAT CTTCCCATAC 
GTTGATGAGC CATTCACATT 
TCTAAAGTAT AATCAATTTT 
TGAATGATGT ATTGTTTATT 
TTATTATCAT TACTATAAAT 
TCAGGGGTGA AACTATCCAC 
TTTGCATTTG GATTAAATTT 
GTATTTTTAG GTTGATTTAC 
TAGTTTGTAC TTGAAATAAG 
TCGCTATATG TATCATTACC 
GCATTTTTAC GTTTCGCAAA 
TCTACATAGT TCGTAAAAGT 
GCAATAATAT TACCTTGGGC 



GCTGTCTGAG 18 6 0 

ACTGTCTGAG 192 0 

CGAATCTGAG 19 8 0 

TGAGTCGCTG 204 0 

TGAATCTGAG 210 0 

TGAGTCGCTA 216 0 

ATCCGTAATT 22 2 0 

TGTACCTGTT 22 8 0 

TAAATTATCA 234 0 

TTTTTCGTTT 24 00 

ACCATCTTTA 24 60 

GAATCCACTA 2 520 

ATCAGAATCT 258 0 

TGTTGAAAAC 2 64 0 

TTCATCTGTT 270 0 

ACCTTTAATC 276 0 

ATAGTCACCT 282 0 

TGAATAACTA 2 88 0 

TCCATTATCT 2 94 0 

GCTGCTTGTT 3 000 

AACATCGAAT 3 06 0 

AAATTGATTT 312 0 

ATATCCAGTT 318 0 

ATATGCAGTC 3 24 0 

CGGTTGTGCT 3 3 00 

TAAAGTTACT 33 60 

TGCAACTTGT 34 2 0 

ATATGTTGTT 34 8 0 

ATTATATAAA 3 54 0 
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GTAAATGTAT CGCCCTCTTT 


AACAGAATCA TCGATTGTGT 


AATTTGCTTT 


TAATTTTAAA 


3660 




ACATCACTTG AAGTTGCCCA 


AAATTCAGTT TTACCAGTAG 


TCTGATTAAC 


ATGTCCTTTA 


3720 


5 


TCAATCGCAA TGTCAATATT 


TGAAAAATGT ACTTTATCAT 


TAACATTTGT 


TCCTTGTTGT 


37B0 




GGAGCTGCAA CAGTATTCAC 


TGCCATGCGA TTTAAAGTTC 


TTGGTTTAAT 


AGTCGTTGTT 


3840 




TTAGGTGTAG TTGAAACATC 


TTTTGCTTGT GTTAAATTAC 


TTTTATCAGT 


TTCATTACTA 


T a n n 

^ ✓ w w 


10 


TATGTAGTTG ATGATTTATC 


ATTTGTTGTT ACATTGCTAG 


TTTTTGTAGT 


AGATTGATTA 


3960 




GCTGTAGCGT TTTGTGGTGA 


TTGCATGTTA CTACTAGTTT 


CTTTAACTGT 


TGCACTATCA 


4020 


15 


CTCATTGTCA CTTTAGGCTG 


ATCTGCAGTT GCAGTTTGCG 


TATTGTCTTT 


TAGTTGACGA 


4080 


CTATCAACTT TTTTAGTTGT 


TTTATTCTCA CTTGGGGCTG 


TCGTTTCATT 


TTTTGATTGA 


4140 




TTTAATTCTC CATTCGTATG 


TTCTGCCGCT TTAGCTTCAT GACCACTTAA 


CCCAAAAATC 


4200 


20 


AATGTTGTCC CTACTAAAAT 


TGAAGCAGTA CCTACAGAAT 


ACTTTCTTAT 


CGAAAATTTG 


4260 




TTTAATCGAT TTGGTATCAT 


GCCTTTnCTA TTTGTnGCTG 


TCTTTTTATA ATTCATTTAA 


43:^0 




TAATACTCCT TTAAAATATC 


AAAATTTGAT AAATATAA 






4358 



25 (2) INFORMATION FOR SEQ ID NO: 455: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1060 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 455: 



TTGACTTCTT AATTCAGCAT 


TTTCTGCACT 


TAATGCTTTG 


TTCTTTTTAA TAAGTTGCTT 


60 


TCTTGCATAA ACTTCGGTAT 


CTATTTTACT ATTACTATAC 


CTTTGATTTA 


AAACTAATAT 


120 


ACGAATTAAT GCTACAATGA 


TAATGATAAG 


TACAACATAA 


AAAGACATTT 


TTTCACCAAT 


180 


CCTTTTTGAC TTCTTTAACT 


TTGTATACAA 


TAATAATTAA TAAAGATTAA 


TTGTTATTCA 


240 


ATTTCCCACA TTTTTATTAG 


TTGATTTTAG 


TTCATCATTG 


TTATAATCAA 


ATTATAAACT 


300 


GACAGATATT GATGTTCAAT 


GAATATGACG 


TGAAAGATTC 


GTGAATTCAA 


GTTTATGTCG 


360 


AATTTATGTT ATAACGGTCA 


TTTAAATGAC 


AGAATTAGGT 


CACTCATAGT 


ATTTTGAAGA 


420 


TTGAATTCAT TAATTTTAAA 


ATGTATAATG 


ATATTTGTGA 


AAGCGCTTGC 


TTAGGAGGTG 


480 


TATTTGAGAG TGAATGAAAT 


GAATGCTAAA 


GAACAATTAG 


TGGACAATTT 


AATGAAAACA 


540 






GTTGCCATGC 


AGCTTTTCTT 


AAATGATGAA 


600 
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AAAGTTATTC CGCAATCATA TGCGTTACTA TACATAGATA AGCAAGATCA AGCAATAGCT 72 0 

AAAGAAGATT TATCACTTTC AAAAATTGCA AAAGTTTATG TGCAATATGA TGATACAACA 780 

ATAATGAGTA TTTTCGTTTA TGATGTAGTA AACGATGAAT GGATTTTTAG ATTGGATCCG 84 0 

AATATACGTA TACCTAAGAG TAACATATAC TTCCATAGTT TAAATTGGGA TGTGGATATA 50 0 

TTAAACCGGA GtCGTCTAAT GTATGTCTAA TGCaCACCAT CAGaTCATCA TtATCCATTA 960 

TAAcGrGCAT AGTGCATAAn yACTwCAtTT TaTTaaATTG AGaGGgGCAC GATAGGTGCA 1020 

TCAGGACATA ATATAGGAAG CATCAACGCG TGAnCAGGTC 1060 



(2) INFORMATION FOR SEQ ID NO: 456: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1262 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

20 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 456: 



25 



30 



35 
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ATAATnACTA 


AATACnAAAG 


TTTAACTGTC 


TTACTAATAA 


TGACTATGTT 


ATAATTTTAA 


60 


AAGTGATATT 


TTGGGTAATC 


GCTATATTAT 


ATAGAGGAAA 


GTCCATGCTC 


ACACAGTCTG 


120 


AGATGATTGT 


AGTGTTCGTG 


CTTGATGAAA 


CAATAAATCA 


AGGCATTAAT 


TTGACGGCAA 


180 


TGAAATATCC 


TAAGTCTTTC 


GATATGGATA 


GAGTAATTTG 


AAAGTGCCAC 


AGTGACGTAG 


240 


CTTTTATAGA 


AATATAAAAG 


GTGGAACGCG 


GTAAACCCCT 


CGAGTGAGCA 


ATCCAAATTT 


300 


GGTAGGAGCA 


CTTGTTTAAC 


GGAATTCAAC 


GTATAAACGA 


GACACACTTC 


GCGAAATGAA 


360 


GTGGTGTAGA 


CAGATGGTTA 


TCACCTGAGT 


ACCAGTGTGA 


CTAGTGCACG 


TGATGAGTAC 


420 


GATGGAACAG 


AACATGGCTT 


ATAGAAATAT 


CACTACTAGT 


TTAGCTCTCC 


TAGATGATGG 


480 


AGAGC'i-ri-rr 


TCATGAAAAG 


AACACTTAAA 


ATTAACGCCy 


TGTCTTGaTA 


tAATGACaCT 


540 


GCcTTGTTTT 


AAAATAGTAA 


GCGGATGCgT 


TAATGTATCA 


GCGATTAAAT 


TTGTTGGAAA 


600 


TGTATAAAAA 


ACACAAGCTA 


AGAATAAAAT 


ACCTGTATAA 


AAGGAGAATC 


ATATATGTTT 


660 


CAATTACTTG 


CAGTTTGTCC 


GATGGGATTA 


GAAGCTGTTG 


TTGCTAGGGA 


AATTCAAGAA 


720 


TTAGGCTATG 


AAACAAATGT 


TGAAAATGGT 


CGTATATTTT 


TTGAaGGAGA 


CGCAAGTGCA 


780 


ATTGTAAAGG 


CAAATTTATG 


GTTGCGCACA 


GCAGACCGAA 


TCAAAaTTGT 


TGTTGGACGT 


840 


TTTAACGCAA 


CAACGTTTGA 


CGAATTATTC 


GAACAAACCA 


AAGCGCTCCC 


TTGGGAATCT 


900 


ATAATTGATA 


AAGAGGGTAA 


CTTCCCAGTT 


CAAGGTAGAA 


GCGTTAAATC 


AACACTACAT 


960 
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TATAACGAAA AAGGTTGGTT AAATGAATCA GGTGCCAAAT ACCCTGTTGA AGTTGCCATT 10 80 

TTAAAAGATA ATGTATTATT GACTATCGAC ACATCAGGTT CTGGTTTGAA CAGACGTGGT 114 0 

TATAGATTAG CACAAGGTGA AGCACCAATT AAAGAAACGT TGGCAGCAAG TTTAATCCGT 1200 

CTTGCCAAAC TGGGAAAGGT GATTACACCT TTnAATTGGT CCCATTTGCG GTTCnGGTTA 126 0 

CA 1262 
(2) INFORMATION FOR SEQ ID NO: 4 57: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1142 base pairs 

(B) TYPE: nucleic acid 

J5 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 457: 

CCTGGCTGCT TTATCAGCAT CTACTACTTT AAAACCGAAT ACGGATAAa^ GTTCTGATAC 60 

TGTTGATTTT CCTGAGGCGA TTCCACCTGT TAGACCAATA ACTTTCGGCA TAATTTCACT 12 0 

25 CTTTCTTTAT TTTTGACATA CTGGACAATA ATGACTATTT CTTGTCGCGA TGATTTTTGT 180 

TTCAATTTGA CTTCCACACA CTTTGCATAC CGGCTGCTTA TATACATTAA GATGCAATTG 24 0 

CATCTCACCA GTTTTTCCAT CAGCATGACG ATAATCTGAA ATACTTGTAC CGCCATATTT 3 00 

AATACCTTCT TCTAGTACTT CTCTAACATA ATAAAAAACC ATTTCTTGTT GTTGGTGTGT 36 0 

TAAGTCTTTT ACTTTTTTAT CTGGTAAAAC ACCTGCACGA AACAACGCTT CAO^.TGCGTA 420 

AATATTTCCA CAACCTGCGA TTACTTTATG ATCCAAAATC ACTTGTTTGA TTGGTTTATT 4 80 

CTTATTAGAC TGTTGATGAA TTCGATTTAA ATAATACGTC AATGCTTCAT TTGAAAAAGG 540 

TTCAGGCGCT ATTTCTAAAA ATGAAGGATA AGATGCTACA GACGCAACAT TTCTAATTTC 600 

TCCAAAACGA CGTATATCTG AATAAATTAA CTTTTTGTCA TTTGACAACT CAAAAATAAC 66 0 

ATGCCAATGC TTACGATAAT TAGGTATCAT AATATCTTCA AGTTCATCTA CAATGAAAAA 720 

ACCGCCCGCC ATACCTAAAT GACTAATTAA TGTACGTTGT TCTCGTTTAT TATCTAGCTG 78 0 

AAAAACGATA TATTTACTTC TTCGTTCTAC ATTTGTAATG GTATAGCCTT CCGATAAAGT 84 0 

TTTAAAAGTA TCTAATTCAA TTCCTTTTAT AATTGTTTCC TTGCCTTGAG CTTTACCTTC 90 0 

GATTACTTTA TCCGAAAATA TAACGTGTTC AATTTTTTGA TTTATAACGT AGGGTTCAAT 96 0 

50 TCCTCTTTTT ACATGTTCTA CTTCTGGTAA TTCGGGCATA CCATTAACCT CACTTTATTT 102 0 

TGCATCATAC CAGGTTGCAC CATAACTTGA GTCTACTTTT AATGGAACAT CTArTTGCAA 108 0 
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TT 

(2) INFORMATION FOR SEQ ID NO: 458: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1814 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



1142 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 458: 

CCTTTAGTAA ACAATCCTTC TTTAGTTTTA GTACGTTGTT CCAATCCAAA TAATTTGTAT 6 0 

TTCATTGCCT CGCCCGATTG AGTGCCGCTA AAGTTATCAT CTTTCATGTT AGGCGTGTTG 12 0 

GTAAACATGT GTATATCACT GTTTAAACGG TCTTTATAAG CTTCGGTACC TTGTACATCG 180 

TATTGCTTAT AAATATAACC ACCATCAACA GAGCCTTCTG TTTCTCTACC TTCGCTATCA 24 0 

GCATAAACAG TCGGTTCTAA AAACAACACG TTAGCTTCCT TTTGTTTTCT AACTTCTACA 300 

GGATCTAAAT TTAAATTACC TTTAATAAGT AACATAGCGT CATTTAAATC ACTCATATAG 36 0 

25 TTAGCAsymy CTGATTCAGC ATTATCATAC AAATCAATTA AAGTGATTAC TTTCTCATAA 42 0 

TCCCCTTTTC TTCTTTCGTT GTTGCTAAAT TCTGTAATAG GCATACGTTC GAAAGAGTGT 4 80 

GATTCAAAAC CGTTTTCACG TGGTGTGAGC TTCAATCCAT TTGTTCTACT GGTAAGATAT 54 0 

CTATAAACAC CGTGTGAAGT GAATAAATCA ACTGTAAACA CTTCATCTTC GTCAGTCTTG 6 00 

TCTATTGGTT TAGTTCTTAA ATATCTAACG CCTGCGATAC TATTACGTTC AATTGTATTG 66 0 

TCGTATATGA CAAAAGTACT CATTGCATCA CTCTTGTATA AACGCGTTTC ATCATCTTGG 720 

TTTCTAATCA TTAACTCATA AGCTTTGCCA TAAATTGACA AATCTAATCC TAAAGATCTA 780 

TTGTGTGACT CAACATCATT TAAATCATTG AACGCCTCAA TAACTTCTAA TACATCTTTG 84 0 

TCATCATCTT GATATTGAAT TGGATTACCC AAGAAATAGC CGTTGATAAA ATCGCTAATA 900 

TAAGATGCGT AATCATGCGC TACACGGTTA TCTGCCATGT ACTCTTCTTT GCGTCGTGTT 96 0 

AACTCAACTA AGTTCTTAGT TTTACCTTCG TAATAATCAC TTAACACTTT CAATCTAGGT 102 0 

CGTTGGTAAT CCATGTGATG TTCAATGTAT TTACTTACTT CATTAACGTT TTGTAATAAA 1080 

TCGGATTCCG TCCCGTCATA TGTGTAAACA ACATTGGCTT CATCATTAAA TAAGTAATTT 114 0 

ATGTTTCCCC GTAGATCTGT ATCTGTTTCA AATTCGTTTA CTTTTAACAT TTGTTCCCTC 120 0 

50 CTATAATCCT AGAGATTTTA TTGTGTCAAC TTTCGAACTG AGATTTGTGC GTTTtCTAAC 126 0 

CGGTCTGTAG AATCGTTCCA CTGAATAACG CAACGAATCG ATACAATGAT TGTATGTATC 13 20 
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CTCTTCAATA GTCTTGAAAC AACGTTCATC AACAATGATT TCAAATTGCA TTAAGAATTG 1440 

TAACCCTTGT ACAACCGAGC CCTTCCCTTT TTTGGTTGGT AAAATCCTTT TAAGCCCTAG 1500 

5 ATTCCTTAAT TCAGCTATAC TTTTTTGTTC TGCACTATCT GCTGTAATTT CTTCTTTAGC 1560 

ATAACCAAGT TGCTTTATGA CATTAGCTAT TTCATCATTC AGCATACCTT GTTTAACATA 1620 

CTCTTCAATG ATGTATAACT TCTTTTTCTT TACATCTATT TTAGAATGTA TAAAAGCACT 1680 

AGGATCATTA ACGTAGCCAA AGTCCAATCC AAAATAAGAA GGTAAATGTC TTAACTCATC 174 0 

TTTATTTATT AAACGTTTTT CATACTTAGG GAAAACCAAT TTGTCTAGTG TAGCAAATTC 1800 

ACCTAACGCA TAAA 1814 

15 

(2) INFORMATION FOR SEQ ID NO: 459: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 686 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 459: 

AATTnAGATT ATTACCCTCC TTTAAAAATA TTTGTTTCAC AATTTTTTAT TACCTATTTA 60 

CTGGTTTCAT GTCTTATGGG GCATTTTACT TGTTGGTATT TGAAAATGTG CAAAATTTAA 120 

30 TCTTATATGT TTCTTGGCTT TTCATGACTA TGCTATTTAT GTTTATGAAT ATGCATTCAA 180 

TTATAGATAA AAAAGTACAT ATATTCTTAA AGTCTAATAA ATAGTTACAA ATTTAGTTAG 240 

TTTTCAATTG TTAATTAGGG GTGGTAAACA GTGCTTTGTG AATCTAGACA AATTTATAAA 3 00 

AATCCTAAAT ATCGAGTTAT TAGATATAAT AATGAATATT TCATGGTCGA TTTAGTAAGT 3 60 

ACTTGGATTA CTTATTTTTT CCCTATGATT AATTGGTTTT TGCCCAAAAA ATACGCAAAA 4 20 

ATTAGCGAAA ATGAATTTGA AAGGTTAAAT ATAGTCGAGC CTGTTAAAAA TAATGTTTTT 4 80 

40 

TGGCCGGTTG CAGGAAGTTC AGTTCTATTT GGAATTATAT TGAGAAAGTA CGGTAACTTC 54 0 

TTTAATGTTC AGTTTGAAAA ACAACTAGCA ATCACTGTAT TTTTTATCAT GTTAATAGGG 6 00 

ATGTTAATTT TTTATTTTTA TCTAAATAAA AAATTAACAT TAAAAATTTT TAATACCAAC 66 0 

GTGGGTAATA AGAATAGGAG TTGTAT 6 86 
(2) INFORMATION FOR SEQ ID NO: 460: 

50 (1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1300 base pairs 

(B) TYPE: nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 460: 

^ ATCTGCAATT ATGGGCACAC CCAAGCTnAT GCAAGTAACT AAAGGAGAAG TACTTTTAGA 60 

CGGTGTAAAT ATTTTAGAaT TAGAAGTTGA TGAAAGAGCA AAAGCAGGAT TATTCTTGGC 120 

AATGCAATAT CCATCAGAAA TTACAGGTGT TACAAATGCT GATTTCATGC GTTCAGCAAT 180 

10 

CAATGCGAAA CGTGAAGAAG GACAAGAAAT CAACTTAATG CAATTTATTA AGAAATTAGA 24 0 

TPAAAACKTG GATTTTCTAG ACATAGATAA AGACATGGCA CAACGTTATT TAAATGAAGG 3 00 

TTTCTCAGGT GGAGAGAAGA AACGTAACGA AATCTTACAA TTAATGATGT TAGAACCTAA 36 0 

15 

GTTTGCaATC TTAGATGAAA TCGATTCAGG GTTAGACATC GATGCATTAA AAGTTGTATC 42 0 

TAAAGGTATT AACCAAATGC GTGGGGAAAA CTTTGGTGCA TTAATGATTA CACACTATCA 4 80 

20 ACGATTATTA AATTACATTA CTCCTGATAA AGTACATGTA ATGTATGCTG GTAAAGTCGT 540 

TAAATCTGGT GGTCCAGAAT TAGCAAAACG TCTTGAAGAA GAAGGATATG AATGGGTTAA 60 0 

AGAAGAGTTC GGTTCAGCTG AATAATCTTA TTAATACAGT ATCCATGAGA TGTTCATCTA 66 0 

25 TATATGATGA AAATGAACAT TTATACGAAA TAGTAAATTT CATCAAGTAG GAGGAAAAAG 72 0 

TTATGACAAC TGATATTTTG rACaTTyCTG AAGAACAACT TGTTGATTAT TCTAAAGCCC 78 0 

ACAATGAACC TTCTTGGATG ACAGAATTAC GTAAAAAAGC TTTGAAATTA ACAGAAACTT 84 0 

30 

TAGAAATGCC AAAACCTGAT AAAACAAAAT TAAGAAAATG GGATTTTGAT TCTTTTAAAC 90 0 

AACACGATGT AAAAGGTGAT GTTTATCAAT CTTTATCACA ATTACCTGAG TCAGTAAGAG 96 0 

AAATTATTGA CGTAGATCAT TCTAAAAACT TAGTAATTCA ACATAATAAT ACGATTGCGT 102 0 

35 

ACACACAAGT TGATGATAAT GCATCGAAAG ATGGCGTTAT CGTTGAAGGT TTAGCAGACG 10 80 

CTCTTATGAA CCATAGTGAT TTAGTACAAA AGTACTTTAT GAAAGATGCA GTAACAGTAG 114 0 

ATGAACATCG TATCACAGCG CTACACACGG CATTAGTTAA TGGTGGCGTA TTTGTTTATG 120 0 

40 

TTCCTAAAAA TGTAGTTGTA GAACATCCAG TACAATACGT TGTGTTGCAC GACGACGAAA 12 60 

ATGCAAGCTT TTATAACCAT GTTATCATCG TTACTGAAGA 1300 
45 (2) INFORMATION FOR SEQ ID NO: 461: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3135 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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GACAGCAAAT TCAAGATACA TTAAATAAAG ATATTGTCAT AAAGCATATT CTTGTTCGAG 60 

ATAAATCTAA AAAGAGACCG CTAAATATTA GCCAATATCA TTTAACTGAA GATGTTAATG 120 

^ AAATTTTAAA TGATGATTCA TTAGATATTA TCGTTGAAGT CATGGGAGGA ATTGAACCAA 180 

CTGTAGATTG GTTAAGAACA GCACTTAAAA ATAAAAAACA TGTTATTACC GCAAATAAAG 24 0 

ATTTATTAGC AGTACATCTT AAACTTTTAG AAGATTTAGC AGAAGAAAAT GGTGTAGCTT 300 

w 

TAAAGTTTGA AGCGAGTGTA GCAGGTGGTA TTCCGATCGT AAATGCCATA AATAATGGTT 360 

TGAATGCGAA TAATATTTCA AAATTTATGG GAATTTTAAA TGGTACCTCT AATTTTATTT 4 20 

TATCTAAAAT GACTAAAGAG CAAACGACAT TTGAGGAAGC ACTTGATGAA GCGAAAAGAC 4 80 

15 

TTGGTTTTGC TGAAGCGGAT CCAACTGATG ATGTAGAAGG GGTAGATGCA GCGCGTAAAG 54 0 

TTGTCATTAC ATCATATTTA TCATTTAACC AAGTCATTAA ATTAAACGAC GTTAAACGAA 600 

GAGGAATTAG TGGCGTAACT TTAACTGATA TTAATGTAGC CGATCAACTG GGGTATAAAA 660 

TTAAATTGAT TGGTAAGGGA ATATATGAAA ATGGCAAAGT TAATGCATCG GTAGAACCAA 72 0 

CGTTAATTGA TAAAAAGCAT CAATTAGCAG CTGTAGAGGA TGAATATAAC GCGATTTATG 780 

25 TTATTGGTGA TGCCGTTGGT GACACGATGT TTTATGGAAA AGGAGCAGGC AGTTTAGCAA 84 0 

CAGGTAGTGC CGTTGTCAGT GATTTATTGA ATGTAGCATT ATTCTTTGAA TCAGATTTAC 900 

ACACATTGCC ACCACATTTT GAATTAAAGA CAGATAAAAC ACGGGAAATG ATGGATTCAG 960 

ATGCAGAAAT TAATATTAAA GAAAAATCCA ATTTCTTTGT AGTAGTGAAT CATGTCAAAG 1020 

GTTCAATTGA AAATTTTGAA AATGAGTTAA AGGCAATATT ACCATTTa^C CGATCATTAA 1080 

GAGTTGCAAA TTACGATAAT CAATCATATG CCGCTGTTAT AGTTGGATTG GAATCATCAC 114 0 

35 

CGGAAGAATT AATCACTAAG CATGGATACG AATTGACAAA GTATACCCAG TAGAAGGAGT 1200 

TTAATTATAA TGAGAAGATG GCAAGGATTA GTAOAAQAQT TTAAAGCACA TTTACCAGTA 1260 

AATGAAAATA CACCAAAATT AACATTGAAC GAGGGAAATA CACCACTCAT TCATTGTGAA 1320 

40 

AATATGTCTA AAATACTAGG CATAGATTTA TATGTGaAGT ATGAAOGTGC CAATCCGACA 1380 

GTTCATTTAA AGATCGCGGT ATGOTAATGG CTGTGACAAA AGCAAAAGAG CAAGGTAAGA 144 0 

AAATTGTAAT ATGCGCTTCG ACTGGAAATA CATCAGCGTC TGCAGCAGCA TATGCAGCGA 15 00 

GAGCAGGTTT AAAAGCTATC GTCGTAATAC CAGAAGGTAA AATTGCATTA GGTAAATTGT 156 0 

CGCAAGCAGT AATGTATGGT GCAGAAATCG TTTCTATTGA AGGAAACTTT GATGAAGCTT 162 0 

50 TAGAAATTGT AAAAGAAATT GCAAAAAGTG GCGAAATCGA GCTTGTAAAC TCTGTCAATC 16 80 

CATTTAGAAT CGAAGGACAA AAGACAGGCT CATTTGAAAT TGTACAACAA TTAGACGGTG 174 0 
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AAGGCTTTAA AGAATATCAT GAAGCTAAAG GATCACAATT GCCGAAAATG TTTGGCTTCC I860 

AAGCTGAAGG CGCATCACCA ATTGTTCAAA ATAAAGTCAT TAAAAATCCT GAAACGATTG 192 0 

^ CAACTGCTAT TCGAATTGGT AATCCTGCTA GTTGGGATAA GGCGACTAAT GCTCTTAAAG 1980 

AATCAAATGG ATTAATAGAT AGTGTTACTG ATGATGAAAT TCTAGAAGCA TATCAGTTAA 2 040 

TGACAACTAA AGAAGGTGTC TTTAGTGAAC CAGCGAGTAA TGCTTCTATT GCAGGTTTAA 210 0 

10 

TTAAATTGCA TAGACAAGGT AAATTACCTC AAGGTAAAAA AGTAGTTGCT ATTTTAACTG 2160 

GTAATGGATT AAAAGATCCT GATACTGCTA TTTCACTACT AGATAATCCG ATAAAGCCAT 2220 

TGCCAAATGA TAAAGATAGC ATTATCGATT ATATTAAAGG AGCTTTATAA CATGTCGAAT 22 8 0 

15 

GTTTTGGAGT TAACAATTCC TGCATCAACA GCCAACCTTG GAGTTGGCTT TGATTCTATA 234 0 

GGTATGGCTT TAGATAAATT TTTGCATCTG TCTGTAAAGG AAACATCAGG GACAAAATGG 24 00 

20 GAATATATTT TCCATGATGA TGCATCTAAG CAATTGCCTA CTGACGAAAC AAACTTTATT 2460 

TATCATGTAG CACAACAAGT TGCTTCTAAA TATAGTGTTG ACTTGCCTAA TTTATGTATC 252 0 

GAAATGAGAA GTGATATTCC ATTGGCAAGA GGGTTAGGTT CGTCAGCTTC TGCTTTAGTA 25 8 0 

25 GGAGCTATAT ATATCGCAAA TTATTTTGGT GATATCCAAC TGTCTAAACA TGAGGTATTA 2 64 0 

CAATTAGCGA CTGAAATCGA AGGACATCCT GATAATGTTG CGCCGACCAT TTATGGTGGT 2700 

TTAATCGCTG GATATTATAA TGATGTCTCG AAAGAAACGT CaGTtGCACA TATCGACATA 2 76 0 

30 

CCAGACGTGG ATGTGATTGT AACGATACCA ACTTATGAAC TAAAAACAGA AGCATCAAGA 2 320 

CGTGCTTTAC CACAAAAATT AACACATAGT GAAGCGGTTA AAAGTAGTGC AATTAGTAAT 2 8 80 

ACAATGATTT tGgCATTAGC ACAGCACAAT TATGAATTAG CAGGTAAACT CATGCAACAA 2 94 0 

35 

GATGGCTTTC ATGAACCGTA TCGTCAGCAT TTAATTGCTG AATTTGATGA AGTGAAAACA 30 00 

ATTGCTAGTC AACATAATGC CTATGCAACT GTAATTAGTG GTGCTGGACC AACTATTTTA 3 06 0 

ATATTTAGTC GTAAAGAAAA TAGTGGGGAA TTGGTTCGCT CTTTAAATAG TCAGGTAGTA 3120 

40 

TCATGCCATT CTGAA 3135 
(2) INFORMATION FOR SEQ ID NO: 462: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 09 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 462: 
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AGnTCAATAT TTAGATCAAG CCGTTTTAAG TAATTACGAA CAAGTTTATA TCATTCATGG 12 0 

TAAAGGTACA GGTX3CACTTC AAAAAGGTGT ACAACAACAT TTGAAAAAGC ATAAAAGTGT 180 

^ TAGTGACTTT AGAGGTGGTA TGCCAAGCGA AGGTGGATTT GGCGTTACCG TTGCAACACT 24 0 

AAAATAAATT ATAATTTGAT AAATTAAATA GCTGCAGTTA AAATAATGTA AAGCAACAAG 3 00 

AATACATTTC AAACATGTTA TTTGAAATAA GCATAAAAAT TGAGCAAATA GAAATACATG 3 60 

AAGCATGTTA TCTGATATAA TTTGAACATC ATAATAATAA TTAAGGAGGA TTGGCATTTA 4 20 

TGGCAATCGT AAAAGTAACA GATGCAGATT TTGATTCAAA AGTAGAATCT GGTGTACAAC 4 80 

TAGTAGATTT TTGGGCAACA TGGTGTGGTC CATGTAAAAT GATCGCTCCG GTATTAGAAG 54 0 

15 

AATTAGCAGC TGACTATGAA GGTAAAGCTG ACATTTTAAA ATTAGATGTT GATGAAAATC 6 00 

CATCAACTGC AGCTAAATAT GAAGTGATGA GTATTCCAAC ATTAATCGTC TTTAAAGACG 6 60 

GTCAACCAGT TGATAAAGTT GTTGGTTTCC AACCAAAAGA AAACTTAGCT GAAGTTTTAG 7 20 

20 

ATAAACATTT ATAAGTTACA ACCAATGACG ACTGGGGCAT TTCTTTAATG AATTGCTCCA 780 

GTTTTTGTTT GTCTTTTTAA TATAAAAAGT TGAATGATAA GTCATCATAT TGTTTACGAC 84 0 

25 TTGAGAATGG TGGGATTAAT AAATCTATGA ACGTTAAATG ATAATCTAGC ATGCTGATAG 90 0 

ATTTGTAGCA GTTGGTTTGA TAAAACCATG TTCAATATTA CATGATG7GC ATGAAAAGTC 960 

ATACTCGAAG ATGTTGATTA TTAACTAGAA TTAGTGGTGA TAAATTTGAA GCACTTTTGT 1020 

AGCATCATTC ATTTTAAAAT TAGAAGGGGG GATATTTTTG GAAGACTATA AGCAACGAAT 108 0 

TAAAAATAAA TTAAATGTCG TACCTATGGA ACCAGGATGC TATTTAATGA AAGATCGTAA 114 0 

TGATCAAGTG ATATATGTTG GCAAAGCTAA AAAGCTAAGA AATCGATTGC GATCATATTT 1200 

35 

TCACGGGTG 12 0 9 

(2) INFORMATION FOR SEQ ID NO: 4 63: 

(i) SEQUENCE CHARACTERISTICS: 
■^0 (A) LENGTH; 2410 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO; 463: 

AGTTCAACAC GACGAATTTT ACCTGAGTTT GTTTTTGGTA AGTCGTCAAC GAATTCAATC 6 0 

50 TCTCTCGGAT ATTTATATGG TGCAACTTCA TTTTTAACAA ATTGTTGTAG TTCTTTAACT 12 0 

AACGTATCAT CACCCGCAGT ATGGTCCTGT AAAATAACGA ATGCTTTAAC AATATTTCCT 18 0 
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GCATCTTCAA CTTCAAAAGG CCCAATCGTA TAGCCTGAAC TAATAATAAT GTCATCTCGA 300 

CGTCCTTCAA ACCAGAAATA ACCATCATCA TCTACATGAG CTAAGTCACC AGTGATGTAG 360 

TATTtACCTG TTTGCGCTTT CGCCGTACGT tCTGGCTCTT tATAATACCC TTTGAAAAGT 420 

GCTGGCAAAT CAAGTGGTAC TGCAATATTC CCTTTCGTAT TAGCAGGTAC GCTATTCCCC 4 80 

TCATCATCTA CTACAGTGAC CGAACTACCC GGAATGCCTT TACCCATTGA TCCAATCCTC 54 0 

TGTGGTGTAT CTTTTAAAAA GCCTATAAGC AAGGTACTTT CAGTCTGGCC ATATCCATCT 600 

CTTACAGTTA AATTAAAGTA TTTCTTGAAT TGTTCAACTA CTTCTCGATT TAGTGGCTCA 660 

CCTGCAGAAA CGGCACTATG TAAATGCGTT AAGTCATAAT CATTTAAGTT CTGTAATTTA 720 

GCCATCATAC GATATTCTGT CGGTGTACAA CATAAAACAT TAATTTGATA TTTTTGAAGC 78 0 

AATTCTAAGT ATGTTTCAGG ACTGAACCTT CCATTAAATA CAAAAGCAGT TGCACCTGAA 84 0 

CCTAATACAG ATAAGAAAGG ACTCCATACC CATTTTTGCC AACCTGGTGc TGCTGTTGCC 900 

CAAACTAAGT CATCTTCATT aATACATaAC CAATGTTTTG GTGCCATTTG TaAATGTGcA 96 0 

AATCCCCaTC CATGACAATG TGTAACGGCT TTAGGATTGC CAGTTGTACC AGATGTATAT 102 0 

25 GACAGAATCG CCATATCATC ACGCGTCGTA TCTGCCATTT CTAGTTTGTT ACTTGCGTTT 1080 

TCTTTTTCAG CTTCAAGTGA AATCCATCCA TCTTTTTGAC CGGCAATAAC AAATTTAGTT 114 0 

AACGCATCAT ATTCTTTAAT TTTTTCAAAT TCAACTGTGA ATGGCTCTAG TGCAATAACT 1200 

GCATTAATTT CACCATGTGT GATACGGTAT TGTAAATCTT TAGTTCTTAG CATTTCAGAA 126 0 

CATGGAATGA TTGCAACACC TAATTTTAAA GCAGCAATAT ATAATTCATA CGTCGCAATA 1320 

GATCGTGGCA TCATAATGAG TACTTTATCG CCTTTAGATA AACCGTGCGA TGCTAAAACA 1380 

TTACCTACTT TATTAGACTG TTCAATGAGC TGTTGGTAAG TGACTGATAT ATCTTCGCCT 144 0 

TCAGTATTAT GATATAAAAT TGCCTTTTTA TCTGGTATGT GGCTATATTT TTCGATTTCC 1500 

GAAATAATGT TATATTTTTC AGGCGCGAAT AGAGCTGACT TTTGCATAAC TAACTTCCTT 1560 

TCATACATCC ACTTTTCCTG TGATGAACAT TGTAATTTTA TAAATGAATT ATATACATCA 1620 

TACGCCTATC TTTACAGAAT TTTCAATTAA ATAGGGTTAA ATACCAAAGT CCTCGACACT 1680 

45 ACACTTTGAC ATGACGTAGC ATTCAAGGAC TTTCAAATGA TTGAGGGTTG ATATCTCGGG 174 0 

CTAGACCATA TCAGCTAATT CAATACGAAT ATTGTATGAT AATTCACGAT TAATTATTTT 18 00 

TACATCTGCA CCTTTCGAAG TGCCACGATG CTTGTGTGTA TGCTTGTACT CAGCTGAATT 186 0 

5^ TTGCCAATGA TAAAATGCTT GCCTATTTTC CCACAGCGTA ATAATGATAT AGTGTCTACC 192 0 

AGCTGTTCTA GGTCTTAAAA ACCTTAATGC TTTAAATCCA TCAACGTTTT TTAAATGCTT 198 0 
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ATTTPJiCACA CATAATGAAT CATTTGATAA ATCATTTATC GCTTCTAGCA GATCGTAATA 2100 

TGCAGTGTCA TTATTTTTTT GTATTGTGAG ACAATCATCC AGTTCTTCTA TTACATAACT 2160 

TCTATATTCA TCATAAATTT TCATAATAAA TGCCTTCATT TCATTTATAT TTTTGGTCAT 2220 

ATTACTkTAT ATCTATTACT AAtkCATTCC CGTATTTATT AATTACAATC ATAGTTTGGC 2280 

TyCTTTTTAA AAGATAAGAC TTTGTAAAAA GTATTAATAT TTCATGCAAA TGGGGGACAG 234 0 

GAGTCGCCCA CTATTTTTGT GTCTTCAATT TCATGATCAT TATTTAACAT TAGTCATGAA 2400 

AATAGCCGAC 2410 
(2) INFORMATION FOR SEQ ID NO: 464: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 590 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 464: 

TTTATTAATT GTAAAAAATT GAGTAAATTA TCTTTACATT CTAAATTAGT CTTAACTACA 60 

ACTAGTATCC TAATAATTAT AGGAGCTATT ACATTCTTTT TATTAGAACA GTTTAATACT 120 

ATGCAACATA TGGGACTAGT TGAAAAAATC GGAAATTCTT TTTTCCAATC AGTAACAACA 190 

CGAACAGCGG GTTTTAACAG TATAGATATA GCAAGCATTA rCAAATCTAC CGCATtAaTG 24 0 

TTAATGCTAC TTATGTTTAT TGGTGGTGCC CCTCTCAGTG CAGCTGGAGG AATTAAAATA 3 00 

ACTACTTTTG CAGTTGCGTT TATTTTTGTA CTAAATTATA kACGTAAAGA AAATAATGTT 360 

TCAGTATTCA ATAAAGAAAT ATCTGACAAA CATATAAAAC TATCTATTGT TACCATTAAT 420 

ATCTCATTTC TATTTATCAG CATCATTACT TTTATATTAT CGATAATTAA TCCGAACATA 480 

TCATTAATCA AGTTATTATT CGAAGTGGTT TCTGCATTCG GAACAGTAGG GTTAAGTATG 54 0 

AACCTTACCA CAGAATATCA TGGTATTACT AAAATAATTA TTATATTCGT 590 
(2) INFORMATION FOR SEQ ID NO: 465: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 905 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

'y:^ SEQUENCE DESCRIPTION: SEQ ID NO: 465: 
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TATTCGATTT 


GAnTCTTTTA 


AAATTATGTT 


TAAAAACATG 


TCTAATGATT 


CCGCATCATT 


120 


TTTTGCACCA 


TCAATAAGCG 


TTTCAGcAAA 


CCCCTTAATT 


GAAGTAATAG 


GTGTTTTTAA 


130 


TTCATGTGAA 


ACATTTGCTA 


CAAATTCACG 


TCTTAGATTT 


TCAAGTTGTT 


TCAGATTTGT 


240 


TATATCATGC 


ATCACaACTA 


AAATCCCtTG 


CAAACTTTTT 


TGAGACCTAG 


TTAAAATCGG 


300 


AACGCATGAA 


ATATCAAAGT 


ACTTGGCATG 


GACTTGGTTT 


ATTGCAACTT 


CCAATTGTTC 


360 


ATAAATAGGT 


TTTTCAACTT 


TAAAACTTTC 


TAAAATTAAT 


TGCTCAATTT 


CAGTATTAAC 


420 


ATAGCCGTGA 


TAGCCTACTT 


GTTCAATATT 


ATGCGAGATG 


TTGAACTGTT 


CATAATACGC 


480 


TTTATTTGCA 


ACAACGATTT 


TTCCATTTCG 


ATCTATCATT 


AAAATAGCAC 


TTGGAATATT 


540 


TTCAATCGTT 


GTTTTTAAAC 


GGTTGGATTG 


AATTTTTTGC 


TCATTATTAA 


GCTTTTGAAG 


600 


GCGTCGTGCT 


AAATCATTGG 


TAGACACAAA 


AAGCGCTTTA 


GTTTCTACAA 


CATTACTTTC 


660 


AGGTACACGT 


ATGTGATAAT 


AACCATTTGC 


CAACAATTGT 


GTTGCATAAG 


TAACTTCTTG 


720 


AATGGGACGG 


ATTAATGTAC 


GCTTAAAACT 


ACGGCTTGCA 


AAATACAGAC 


AAATGAGTAC 


780 


AACTAAACAT 


GTCAAAATAA 


GATATTTCCA 


CAACGTCCAA 


TGCATTTCTG 


TAATATCGTT 


840 


ATTGTAACCT 


TTAATCCATA 


CATGATAACC 


GTTAACCTTC 


TTATtAAAAA 


TAAAAACGTC 


900 



CCTTT 905 
(2) INFORMATION FOR SEQ ID NO: 4 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1016 base pairs 

(B) TYPE: nucleic acid 
fC) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 466: 
TTTGGTTAGC CCATAAAAGA AAAAAACAAG TAGTCATTTT TAAACAACAT ATCAAGTCTA 6 0 

40 

CCCAAGAAAT ACGTTTTGAC AAAGCGAAAG TGCTTGAACA CAAAGATGAA ATAGCAAATT 12 0 

TTATTTCTTT CGAACCACAA AGTTTTGAAT TTTATTATTT TACAGAATCG GAATTTTCAG 18 0 

AAGAACAATT AAATGAAGTT TCGCCAATTA GAATTAAATT CAATGTTATA AGACACACAA 24 0 

45 

AAGATTTGaT AAAGCATATG CCGAATATAT TTTTGGcTAG ACTTATTTCA GAAGATAATG 3 00 

ATAAAAAGAC ATATATGTTC TATAAACGCA AAGTATTAAC CGATAACTTT TTAGATAAAT 360 
50 ATATGCAGAA ATTTTCACCG GCAACATACA CAATAATATT TGTAAATGTC TTAATATGGT 420 

TATGTATGAT TTTATATTTA AATAATTTTT CGGATGTAAA ATTATTAGAT GTTGGCGGGT 4 80 
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ATTTTAGTTT TGAACATATA CTTATGAATA TGCTTTCATT ATTTATTTTT GGTAAAATAG €00 

TCGAAGCAAT TATTGGTTCA TGGCGGATGT TAACTGTATA CTTTATTGCA GGGTTGTTTG 66 0 

GAAACTTTGT ATCACTATCA TTTAATACGA CTACAATTTC AGTTGGGGCT AGTGGTGCTA 72 0 

TATTTGGTCT GATTGGATCA ATTTTTGCOA TGATGTATGT TTCAAAAACA TTTAACAAAA 78 0 

AAATGTTAGG ACAGTTATTA ATIXiCATTAG TGATATTAGT TGGTGTTTCT CTGTTTATGT 84 0 

CAAATATAAA TATTGTGGCG CATATTGGAG GATTCATTGG TGGTTTATTA ATAACTTTAA 900 

TTGGCTATTA CTATAAAGTG AATCGtAATA TmTTGGaT TTaCCAATTG GTATGCtTGt 96 0 

tATATTTAwT GCACyTCmAA TTAgAtTTTT ACmATTAAAG AAGATAATAw TTATAA 1016 
(2) INFORMATION FOR SEQ ID NO: 467: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 06 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



25 ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 467: 

AACTTTAAAT TTAGACATCT TTAAAACCTC TCTTAAACCA TGCCTATATC TCAAGATGAT 60 

ATTTCAAATG AACAATACTA TTGCTTGAGA CCATTAATGA ATGATCATAA ATATTTCTTT 120 

-jp 

CTATAAAATT AGCTTTCCAA TAACTGTGTT GTTGCATAAT ATCATTCACA AGTACACCAT 180 

TTTCGGAAGT ATGATTATCT TTATCTATAC TTAAAACAAT TTGTTTAGTT TTAGCATGGC 24 0 

TAAATTGTTG AAGACCCTTA CACGATAAAC GTATAGCGTC TGAATTCTCA TTTAACAATG 3 00 

35 

CGGCTGGgCA AACAAATGAC ACATTGTACT TCATGTTTGA ACtTCGTTAC AATCaTCGTG 360 
kCatTTTGat AAATAACAAT CCCTCGTAAT kGATTAAGTA TATTAT 4 06 

(2) INFORMATION FOR SEQ ID NO: 468: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1378 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; double 

(D) TOPOLOGY: linear 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 468: 
AGATAATACA CTTGAAGTTG GAATGGTTTG TGACGGTTAT TTAATGCGAA TTGAAAACTT 
AACACCATCA AATTTCTTCA ACTCAGCAAG TGAAGATACG ATTACTAAAA TTAAATTAAA 
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AGGTACAGCG TTAAAACTAA GAGAAGCCAT CAATTATGAT GAAATGGTTA TTGTAGATAG 24 0 

TATGACGTAG TTCCTAATTA TGCkAAAAGG GATTGATGAA AAACTGAAGG GCTTTTCATC 3 00 

AATCCCTTTT ATTTTAGGGG AATTGAATAG ATAGTTTTAA ACTATACGAA TTATTAATAT 360 

TTGAGATTTA ATTGAAATAA GTTTTAAAAA TTGGAGGAGA TAGATTAAGC GAAGTCATTT 420 

AAAGGTGAAG TTAAGTGTAT TCACAAAAAn TAGCCACACT CATATGACAT CGGATGAGTG 480 

TGGCTTAAGG ATCTATGGGG GGAGGAAnCC ATAGATGTTT ACTTTGATAG GCCAGATTAA 54 0 

ATATCAAAGT ATGCGATTAT TTATAGCTTG ATGCAAAAGT GGTATGCCTA TTAAAAGTTA 600 

CTGCACATAG CTTTTAATAT TCCGTTCAAA GGAAAGGGGC ATACAATTGA ACAATCTGTA 66 0 

ATAGTACTTT TAACCAGCTA TGCTAAAAGT CTAGTAGGGA GAACAGTTGT CCAATCACAT 72 0 

AAGAACCTCT AACTTCGTTA GTACGATTAA GAAAAGCTTT TTAGTTAGTA TGTAATACAA 7 80 

TTTATTGACG CGCGTGAATC TCTTTTATAA GAGTGTGTAG GGAATGGCGT TGTATAAATT 84 0 

GTATTAGAAG AACTTCTAAC GCATCTCTGT GGTTAAAAGA GATGAAGGGA ACGACAGTTT 90 0 

aATTAAAACT GCATAAGAAC TTCTAGCTTT TCTCTCTCGT TCAAAGAGAA GCAGcTGTTC 96 0 

25 GCAGTTTAAT CAAAACCACA TAAAGCTTTT AACTTTACTC TTTGATTTAA AGAGTGATAA 1020 

ATGTTTACAG TTTAATTAAA ACTGCATAAG AACTTCTAGC TTTTCTCTTT CGTTCAAgAG 10 80 

AAGCAGCTGT TCGCAgTTTA ATCAAAACCA CATAAAGCTT TTAACTTTAC TCTTTGATTT 114 0 

30 AAAGAGTGAC AAATGTTTAC AGTTTAATTA AAACTGCATA AGAACTTCTA GCTTTTCTCT 12 00 

TTCGTTCAAA GAGAAGTTCT AATACCACCA TATCGTGCGA TCGGGAACGG TATATATATT 12 6 0 

AATAGGAGGG TAATATATAT TTAACGCACG ATATGGGACT ATTAGCCTTC GACTTTGTTA 1320 

35 

TGTTGATGTG TGGCCTAAAA TATTGGAGAT ACCAATATTT TAGGTTGCAT CAACATCA 13 7 8 

(2) INFORMATION FOR SEQ ID NO: 46 9: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 4171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 469: 

TCCCAACCAA TAATCGTGGC AAAAATACGG ATATTGGTAT GGCTTAACAA ATTGCAAATA 6 0 

TCGTTTAATC ATACATCCCC CCTAATCTAT TGCCCTATCC TATTCaTAAG CATAAAAATG 120 

AATAGAGGTT GGATACATAA TTTGTAGATG TAAATTCTTC TTACAATTTA CATTTTTAAA 180 
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TTATATTCTA TTCAATTTAA TCTATGGATA CTGTGTCCCC ACACGACAGC AAAAGTTATC 
ATACTTCTTT ACATCACTAA GTCAATATAA ATGATTTAAT CAGTATTTAC ACTTTATTTG 
CTTAATACTG TCTAATTrrr TrGTAACGTT CTTTCCAAAC TTTGATAAAA TCTGGCGCGA 
ATGGGCCCTT CTTCTGrrCT ATCCATTGTT GAAGAATGTC CACGTTGCGT CTTAAAATAA 
TATCAATATC ATGCGGATAA TTCATTTGAT TCATATGTTG CTCATATTCA TCTTCATCTA 
ATAAATGATA CTTTCCGTIT CGATATACTT TAATATCTAA ATCATAGTCT ATATATTTTA 
ATGCCTCTTC ATCACAAACA AATGGTCATC ACAAATTGCA ATAGTAATAA ATTCCATCTT 
CTCTAAACAT GCAGATAACA TTaAACCAAT ATTCTGAGTG AAAGTaAACA ATTGCCGGTT 
CACGTGTTAT CCAAGTTCTr CCGTCACTTT CAGTCACTAA CGTATGATCA TTTCCACCAA 
TGACAACATG ATCAGTACCC TTTAATArro TTOTTTCAGA CCAAACGCGA TGAATCTTAC 
CATCATGTTT ATAACTCTGA ATTTTAATGT TTrCCCClTC TTTAGGTATG GATTCTCTGA 
CCATACTCCA CACCACCTTC TGTTAATTTA ACCATTATAA ATTATAGCAT ATTTCAaW. 
TAGTATTATA TAAATACATA TrTTTACGAA ATAAGATTTT ACTACTTAAT AATTAAACTC 
GGTAATATTG CTAAGTACTA CAACAGAGAT TTACATGTCC CATTTAAAGT ATATAAAATC 
ATCACTTTTA TATATCAACA CTTTAACTTT TTGACATTGT TATTCTATGA GAriTAAAGA 
TATCATTTAT ACTTTTTAAA ATTAATGTCA CTATGTTTTC CGATAATATT ACCAATCATC 
GAATCTTACC CATTTATAAA TTGATAAATs TTTGACATAG GTACAGGGAA TGTATATTGA 
TCTCGATCAC TTAAATCAAA CCAAATCATG TCATCTGGTA ATGTTTCAAT GTTAATTGCT 
CCTGAAACGG CGTATACTTT AATCTTCCAT GTTAAATGAG TAAATTGATG CTTCAACTCA 
AAAATAGGTG TTrCTACTGG TTGAATGTCA TGACCGATTT TTTCAGTCAT TTTACGTCTA 
GCATGCTCAC TTTCAAACAT AGGAAATTGC CACATACCAT GCAATAATTT TTCGCTACGC 
rriTGCAACA GATATTGACC rTGATTATTT CTAATTAAAA AGACGGATTG CTCAATTACT 
TTrTTACTTA CATTTTTAGA TTTAACAGGT AACTTTTCAA ATGTACCTTT ATCAAATGCC 
TCACAGTrrr crrGAACTGG ACAAAATAAG CATAATGGAT TTTTTGGTGT ACAAATTAAC 
GCCCCTAATT CCATCATAGC rTGATTAAAC GTTCCAGCTT CTGTAGTAAC ATACGGTAAT 
AATTCTTGTT CGTACGATTT- CCTCGTCGAT rGTAATTT^^. TATCTCGATA GTCATCATTC 
AATCTAGACC ATACTCGAAA AACATTTCCG TCTACAGTTG CTAGTGGTAC ATTATATGCA 
ATGCTCATTA CTGCAGCTTG TGTGTATGGG CCAACACCTT TTAACGCTTT AAATTGATCA 
GGATCXrrGG GAACTAAGCC rrCATATTTA TCATGAACTT CTTTAATCGC CGTATGAAAA 
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GCTTGaCTCA AAACTTCCAC AGTTGGAAAT CGTTCAACAA AACGATGATA ATAGTCAATA 2100 

ACTGTTTTAA CTTGTGTCTG TTGTAACATG ACCTCACTTA ACCAAATATA GTACGGATTG 216 0 

5 

GTCGTTTGTC GCCATGGCAT TTCTCTTTGA TTTTCATCAA ACCAGTGTAT CAAATTTTCT 222 0 

TTAAAACTAG ACTGCTGATA CATTTATAAA ACCCTTTCCT CACCAAAATT AATTGTCTTT 2 280 

ACTCATAATG TTTTTATTGT ACATTAAAAT CATGGTTAGT ATGTAAGTTA ATTTAGTTAT 2 34 0 

10 

TTGCGAAATT GGATTATAAT AGTATATATA ATATTATGAA ATGAGTGAAC TGATATGGAC 24 00 

ACTGCAACAC ATATCGCAAT TGGGGTGGGC CTTACAGCAC TTGCAACTCA AGATCCAGCA 24 6 0 

ATGGCTTCTA CGTTTGGTGC AACAGCTACA ACCCTTATCG TTGGTTCATT AATTCCTGAT 2 520 

75 

GGGGATACTG TTCTTAAATT AAAGGACAAT GCAACATATA TTTCGCATCA TAGAGGTATC 2580 

ACGCATTCCA TCCCTTTCAC AATACTATGG CCAATTTTAA TTACATTTTT AATATTCACG 264 0 

20 TTCTTTAGTG GAACCAACCC ATTTCATGTA TGGATGTGGG CTCAGCTCGC AGTATTTTTA 2 700 

CATGTCTTTG TAGATATATT CAATTCTTAT GGTACACAAG CGCTTAGACC TATCACAAAC 2760 

AAATGGATTC AATTAAGTGT GATTAACACA TTTGACCCTA TTATTTTCAC AGTTCTTTGT 2B20 

25 ATTGGTATTG TATTATGGGT TATAGGCTTG CATCCATTTG CAGTCTTCTT TCCTATAATC 2880 

GCTTTACTAA TCATTTATTA CATGATTCGT TTTAAAATGA GAGCCGTAAT TAAGCAACAA 2 94 0 

GCTTTAAAAG CAATTCAACA AGAGCATCAC CCTGTTAAAG TATTTGTTGC GCCAACAATA 3000 

AAATTTATGG AATGGCGTGT CGCGATACAA ACTGATGCAC ATGACTATGT TGGAAAAGCA 30 60 

TATGGTAGGA ATGTGGTGTT TAGTGATAAA GTGGAACGTC AAACATTATC AACAGACTCC 312 0 

ATTTTATGGA AAGTCAAAGG TAATAAAGAT ATACGTACTT TTTTAAACTT TTCATCAATC 318 0 

35 

TATCGTTGGC AAACAACAAC GTTAGCAGAT GGTTCTACTG AAATTCGTTT GATTGATTTG 324 0 

CGTTATTTAA AAAATGATCA TTATTCATTT GTGGCAATTG CACATGTAAC AAACGATAAT 3 3 00 

GTCATAGACC ACTCTTATAT TGGCTGGGTA TTTACAGAAG ATAAGTTACA ACGTAAACTG 336 0 

40 

TATGCTAAAT AATTTCAAGT TATTATTCAC TAAAGTTAAT CTATAAAAAA TGAACAACCG 342 0 

GGCAGAATGA AAATCAAAAC GATTTTTACT CTGTCCGGTT TTTTAATGTA AAACTATGAA 348 0 

TGCTTTTACA APJ>JTCTAAAA TTTATATTGT TGCTAACAAA CTACCTTTAA TGACTCGAAA 3 54 0 

45 

TATCAAAATC AGTATAGGAA AACAATATCT AGATGATATT CTAATTGTTT CTGATTCTCA 360 0 

CAGATTAATT TACACAACAG GTCAGCTAAA CATCATGAAG AAGTATCCGC CTCGTCTGTA 3 66 0 

50 CTATCATTTG AAACATCCTG TTGATTATCa GTTTGTGTCA CTTTAGATGT TTTATTaTAA 3 72 0 

ATTGCGTGTG TCGTATACCT TGCTAATACG AAATTTACTG CAGCTATTAA ACATAAGAAT 37 80 

55 
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TGTGCTATAC CATTAACAAT GTAATACATT GGATTTAGCA TTAGGATGTG ATTGATAAAT 3 900 

ACATGATTTG GATTTGGTAT GAAAATAATT GGTAACAATA AGAAACACAA TACACAAACC 3 960 

CCATAAAATA TGATATTTAT TTTTTCAGwT AACAGTCGAA TAAGACCAAA AGTAACGGAT 4 020 

ATTAATCCTA CAAAAATAGT TGCCATCACA ATAAAATAGA ATAGCGCTAT ATATGATGTT 4 08 0 

TCGAAGTTTA CTGGTTTAAC CAATGCACTA ATCATCGTCA AAATGACTAG CATAATAAAA 414 0 

CTTAAAATAG ACATAATAAC TACTGGCGTC G 4171 

(2) INFORMATION FOR SEQ ID NO: 470: 

(i) SEQUENCE CHARACTERISTICS: 
'5 (A) LENGTH: 9821 base pairs 

CB) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SKy ID NO: 470: 

TGGTTGAAGT AGCAGTTAAT TCAAAATCTG CAACAGTTTC AGCAGAATaG GGGCTTTCAA 60 

25 AATAAtCAAA GGAGAATAAT TTATGACTAA AACTTTAAAG GTTTATAAAG GAGACGACGT 120 

CGTAGCTTCT GAACAAGGTG AAGGCAAAGT GTCAGTAACT TTATCTAATT TAGAAGCGGA 180 

TACAACTTAT CCAAAAGGTA CTTACCAAGT GGCATGGGAA GAAAATGGTA AAGAATCTAG 24 0 

TAAAGTTGAT GTACCTCAAT TCAAAACCAA TCCAATTCTA GTCTCAGGCG TATCATTTAC 300 

ACCAGAAACT AAATCAATTA TGGTAAATAC CGATGACAAT GTTGAGCC^ ACATTGCACC 360 

AAGCACAGCA ACGAATAAAA TATTGAAATA TACAAGTGAA CATCCAGAAT TTGTTACTGT 420 

AGATGAAAAT ACAGGAGCAA TTCACGGTGT AGCTGAAGGT ACTTCAGTAA TCACTGCTAC 480 

GTCTACTGAT GGAAGCGATA AGTCAGGACA AATTTCAGTG ACAGTAACAA ACGGATAGGG 540 

ATTTAAGGCG CAGTATATCT GCGTCTTTTT TATTTGAATA AAAGGAGCTA ATACAATGAT 600 

TAAATTTGAA ATTAAAGATC GTAAAACAGG AAAAACAGAG AGCTATACAA AAGAAGATGT 6 60 

AACAATGGGC GAACAGAAAA ATGCTATGAG TATTTAGAAT TAGTAAATCA AGAGAATAAA 7 20 

AAAGAAGCAC CTAACGCAAC AAAAATGAGA CAAAAAGAGC GACAGTTATT AGTAGATTTA 7 80 

TTTAAAGATG AAGGATTGAC TGAAGAAGAT GTTCTGAACA AGATGAGTAC TAAAACTTAT 84 0 

ACAAAAGCCT TACAAGATAT ATTTCGAGAA ATCAATGGTG AAGATGAAGA AGATTCAGAA 90 0 

ACTGAACCAG AAGAGATGGG AAAGACAGAA GAACAATCTC AATAAAAGAC ATITTATCGA 960 

ACATTAAGAA AATACAACGT TTCTGTATGG AGCAGTATGG GTGGACATTA ACTGAAGTCA 102 0 
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AAGAAAAACA AAGTGAACAA AAAGTCATTA CAGGTACGGA TTTAAGAAAA CTTTTTGGAA 114 0 

GCTAGAAAGG AGGTTAATAT GAATGAAAAA GTAGAAGGCA TGACCTTGGA GCTGAAATTA 12 00 

GACCATTTAG GTGTCCAAGA AGGCATGAAA GGTTTAAAGC GACAATTAGG TGTTGTTAAT 1260 

AGTGAAATGA AAGCTAATCT GTCAGCATTT GATAAGTCTG AAAAATCAAT GGAAAAATAT 132 0 

CAGGCGAGAA TTAAGGGGTT AAATGATAGG CTTAAAGTTC AAAAAAAGAT GTATTCTCAA 13 80 

GTAGAAGATG AGCTTAAACA AGTTAACGCT AATTACCAAA AAGCTAAATC CAGTGTAAAA 14 4 0 

GATGTTGAGA AAGCATATTT AAAGTTAGTA GAAGCCAATA AAAAAGAAAA ATTAGCTCTT 15 0 0 

GATAAATCTA AAGAAGCCTT AAAATCATCG AATACAGAAC TTAAAAAAGC TGAAAATCAA 156 0 

TATAAACGTA CAAATCAACG TAAACAAGAT GCGTATCAAA AACTTAAACA GTTGAGAGAT 162 0 

GCAGAACAAA AGCTTaAGAA TAGTAACCAA GCTACTACTG CACAACTAAA AAGAGCAAGT 16 8 0 

GACGCrtACA GAAGCAGTCC GCTAAGCATA AAGCACTTGT TGAACAATAT AAACAAGAAG 174 0 

GCAATCAAGT TCAAAAACTA AAAGTGCAAA ATGACAATCT TTCAAAATCA AATGATAAAA 1800 

TTGAAAGTTC TTACGCTAAA ACTAATACTA AATTAAAGCA AACAGAAAAA GAATTTAATG 196 0 

25 ATTTAAACAA TACTATTAAG AATCATAGCG CTAATGTCGC AAAAGCTGAA ACAGCTGTTA 192 0 

ATAAAGAAAA AGCTGCTTTA AATAATTTGG AGCGTTCAAT AGATAAAGCT TCATCCGAAA 1980 

TGAAGACTTT TAACAAAGAA CAAATGATAG CTCAAAGTCA TTTCGGTAAA CTTGCAAGTC 2 04 0 

AAGCGGATGT CATGTCAAAG AAATTTAGTT CTATTGGAGA CAAAATGACT TCCCTGGGAC 2100 

GTACAATGAC GATGGGCGTA TCTACACCGA TTACTTTAGG TTTAGGTGCA GCATTAAAAA 2160 

CGAGTGCAGA CTTTGAAGGG CAAATGTCTC GAGTTGGAGC GATTGCACAA GCAAGCAGTA 2220 

AAGACTTAAA AAGCATGTCT AATCAAGCGG TTGACTTAGG AGCTAAAACA AGTAAAAGTG 22 80 

CTAACGaAGT TGCTAAAGGT ATGGAAGAAT TGGCAGCTTT AGGCTTTAAT GCCAAACAAA 2340 

CAATGGAGGC TATGCCAGGT GTTATCAGCG CAgcAGsiAGC AAGTGGTGCA GAAATGGCTA 24 0 0 

CAACTGCAAC TGTAATGGCT TCAGCGATTA ACTCTTTCGG TTTAAAAGCA TCTGATGCAA 24 60 

ATCATGTTGC TGATTTACTT GCGAGATCAG CAAATGATAG TGCTGCAGAT ATTCAATATA 2520 

TGGGAGATGC ATTAAAATAT GCAGGTACTC CAGCAAAAGC ATTAGGAGTT TCAATAGAGG 2 58 0 

ACACTTCTGC AGCAATTGAA GTTTTATCTA ACTCAGGTTT AGAGGGGTCT CAAGCAGGTA 2 64 0 

CTGCATTAAG AGCTTCGTTT ATTAGGCTAG CTAATCCAAG TAAAAGTACA GCTAAGGAAA 2 700 

50 TGAAAAAATT AGGTATTCAT TTGTCTGATG CTAAAGGTGA GTTTGTTGGA ATGGGCGAAT 2 76 0 

TGATTAGACA GTTCCAAGAT AACATGAAAG GCATGACGAG AGAACAAAAA TTAGCAACAG 2B2 0 
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CAGATAAAAT TAATAGCTAT AGCAAATCAT 
CAGCTGATTT GATGAAAGAT AACCTCAAAG 
^ AATCGTTAGC AATTGAAGTT GGTAAAGATT 

GATTAACAAA ATTAGTTGAT GGATTTACAC 
TAGGCTTAGC AATTTTTGGT GCATCTATTG 

10 

TACGTGCAGT TGGGAGCGCG GCTAAAGGCT 
ATACAATTCT TTCTAATACC AATTCAAAAG 
TTCTTGGTTC TACAACAGGA AAAACGTCAA 

15 

TGTTTAATTT AAAACCTATA AATGTTTTGA 
TCAAACTTTT GAAAAACGGT TTAGGATTAG 

20 GCGCAAGATT TGCTGGTGTA GCCTTAAAGT 

CTGCTATTAC AATTGCATAT AAAGTTTTTA 
GAAACGGTAT TAACGGTTTA GGAGAAACTA 

25 GTGCTGTTAG GAAGCTAGGA GAGTTTAAAA 

AAGAAAAGTT TTCAAAGGAT ATGAAAGATG 
TGAAAGTAGG AGTCAACAAG TTTAAAGGAT 
AAGCATCTGA TACTGTAAAA GTGTTGGGGA 
TAGAAAAATA CGTACACTAT TCTGAAGAGA 
ACTCGGGTCA AATAACAGAA GACAAAGCAA 

35 

CTAATAACCT TATAGCTGAA ATAGAAAAAA 
AACTTATTGA TAAGTATAGT GCATTCGATG 
CTAAAGAAAA AAATGACTTG CGAATTAAAA 

40 

AATTGAAAGA AAAAGCTTTA AGTGATGGTC 
AAAAGCTTGA AAATCAAAGA CGTGACATCA 
AGCAAGAGCG TATTTTAGTA AGAATGCAAA 

45 

CGAGCAAAGC AATTAAAGAA GCAGAAAAAG 
AGCAGTATGA AGATGATGTC ATTGCTATAA 
SO AAGATAAATT GTTAGCTATT GCTGATCAAA 

CTAAAAAAGA TGCTGTAGTA GACGTTGTTA 



TGAAGAACTC TAATGGTGAA AGTAAAAAAG 2 94 0 

GTGCTCTGGA ACAATTAGGT GGCGCTTTTG 3000 

TAACGCCTAT GATTAGAGCA GGTGCGGAAG 3 060 

ATCTTCCTGG TTGGGTTAGA AAGGCTTCGG 3120 

GTCCTGCTGT 7CTTGCTGGT GGCTTATTAA 3180 

ATGCATCATT AAATAGACGC ATTGCTGAAA 324 0 

CAATGAAATC TTTAGGTCTT CAAACATTAT 3300 

AAGGCTTTAA AGGATTAGCC GGAGCTATGT 3360 

AAAATTCTGC AAAGCTAGCA ATTTTACCGT 3420 

CCGCAAAATC CTTATTTGCA GTAAGTGGAG 34 80 

TTTTAACAGG ACCTATAGGT GCTACAATAA 354 0 

AAACCGCATA 7GATCGTGTG aAATGGTTCA 3 600 

TAAAGTTTTT TGGTGGCAAA ATTATTGGCG 3 660 

ATTATCTTGG AAGTATAGGC AAAAGCTTGA 372 0 

GTTATAAATC TTTGAGTGAC GATGACCTTC 3 78 0 

TTATGCAAAC CATGGGCACA GCTTCTAAAA 3 84 0 

AAGGTGTTTC AAAAGAAACA GAAAAAGCTT 3 900 

ACAACAGAAT CATGGAAAAA GTACGTTTAA 396 0 

AAAAACTTTT GAAAATTGAA GCGGATTTAT 4020 

GAAATAAAAA GGAACTCGAA AAAACTCAAG 4 080 

AACAAGAAAA GCAAAACATT TTAACTAGAA 414 0 

AAGAGCAAGA ACTCAATCAG AAAATCAAAG 4200 

AGATTTCAGA AAATGAAAGA AAAGAAATTG 4260 

CTGTTAAAGA ATTGAGTAAG ACTGAAAAAG 4 32 0 

GAAACAGAAA TGCTTATTCA ATAGACGAAG 4 3 80 

CAAGAAAAGC AAGAAAAAAA GAAGTGGATA 44 4 0 

AAAATAACGT CAACCTTTCT AAGTCTGAAA 4 5 00 

GACATAAGGA TGAAGTAAGA AAGGCAAAAT 4 5 60 

AAAAGCAAAA TAAAGATATT GATAAAGAAA 4 62 0 
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GTTGGTGGTC TAACTTTAGA GAAGACCAAA 
AAGAAGAAAC AGCTCGTAGA AACAGAGAAA 
^ ACGGCGTAAA AACTAAAACT GGTGAAGCCT 

TTGGCGGCGA AATGAAAAAA ATGTGGAGTG 
CAAGTTGGAG CTCAGCCAAA AGTTCTGTAG 

10 

CTGGTAAATG GTTTGGAAAA GCTTGGCAAT 
ATCAAACTAA GCAAAAGTAT TCAGATGCCT 
TTTGGAGAGG CACATCAAAA TGGTTTAGCA 

15 

TAGATATGGC TAATAAATCG CGCTCGAAAT 
ATGCAAAATC CGTTTGGAAA GGAACATCGA 
AAGG'ITGGAC TGGGGATATG TATTCAAGAG 
CGGCATGGTC TAACGCTAAA TCAGTATTTA 
ATGATTGGAT TAGAGATATT GGTAAAGACA 

25 ATGTTGCTAA TAAAGCTATT GGCGGTTTGA 

CTAAAGCCAT TACTGATAAA AATCTCATCA 
TAGCAGGAAA GGGTGTAGCT ACCGATAATT 

30 TATTAAATGA TAGAGGTTCT GGAAACGCCC 

GGGCTGACGG AACATTCCAT GCACCCCAAG 
GAGATAGTGT AATAAATGCC AATGACACTC 

35 

AATTCCATGG TGGTACGAAA AAGAAAAAAT 
AAAAAGCAGG GGACTTCGGT TCTAAAGCTA 
CAGAAGAAAT GGTTGAAGCG GCAGGCGATA 

40 

ATAAAATCGG CGATGTGTGG GATTATGTAC 
TGTCAGGTTT AAATATTAAT TTTGGAGGCG 
CGCGTACTCA TTGCTCAAAA AGAAATTAGT 

45 

TGGTGGCGGA GGCGATGGAA GCTATCTATT 
TTACACAGGT GGACTTAACT TTAATGGCGG 
so TACAGGAACG AACATTTATG CTGTTAAAGG 

CGGTGGCGGT AATTCTATAC AAATTAAGAC 



AGAAGAAAAG TGATAAATAC GCTAAAGAAC 4 74 0 

ATATAAAGAA ATGGTTTGGA AATGCTTGGG 4 8 00 

TTAGTAAAAT GGGCAGAAAT GCTAATCATT 4 86 0 

GAATCAAAGG AATTCCAAGC AAATTAAGTT 4 92 0 

GATATCACAC TAAGGCTATA GCTAATAGTA 4 980 

CTGTTAAATC GACTACAGGA AGTATTTACA 504 0 

CAGATAAAGC TTGGGCGCAT TCAAAATCTA 5100 

ACGCATATAA AAGTGCAAAG GGTTGGCTAA 516 0 

GGGATAATAT TTCTAGTACA GCATGGTCGA 522 0 

AATGGTTTAG TAACTCATAC AAATCTTTAA 5280 

CCCACGATCG TTTTGATGCA ATTTCAAGTT 534 0 

ATGGTTTTAG AAAATGGCTA TCAAAAACAT 540 0 

TGGGAAGAGC TGCGGCTGAT TTAGGTAAAA 546 0 

ATAGCATGAT TGGCGGTATT AATAAAATAT 5520 

AGCCAATACC TACATTGTCT ACTGGTACTT 5 58 0 

CAGGAGCATT AACGCAACCG ACATTTGCTG 564 0 

CAGGTGGTGG AGTTCAAGAA ATAATTCACA 5700 

GACGAGATGT GGTTGTTCCA CTAGGAGTTG 5760 

TGAAGTTACA GCGGATGGGT GTTTTGCCAA 5 82 0 

GGATGQAACA AGTTACTGAA AATCTTGGTA 5880 

AAAACACAGC TCATAATATC AAAAAAGGTG 594 0 

AAATCAAAGA TGGTGCATCT TGGTTAGGCG 6 000 

AACATCCAGG GAAACTAGTA AATAAAGTAA 6060 

GACTAACGCT ACAGTAAAAA TTGCTAAAGG 612 0 

AGACAAAGTA AAATCGTGGT TTGAAGATTT 618 0 

TGACCATCCA ATTTGGCAAA GGTTTGGGAG 624 0 

TCGTCACTAT GGTATCGACT TTGGTATGCC 6 300 

CGGTATAGCT GATAAAGTAT GGACTGATTA 63 6 0 

CGGTGCTAAC GAATGGAACT GGTATATGCA 64 2 0 
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ATCAGGTGCT ACAGGTAATT TCGTTAGAGG AGCACACTTA CATTTCCAAT TGATGCAAGG 6 54 0 

GTCGCATCCA GGGAATGATA CAGCTAAAGA TCCAGAAAAA TGGTTGAAGT CACTTAAAGG 6600 

^ TAGTGGCGTT CGAAGTGGTT CAGGTGTTAA TAAGGCTGCA TCTGCTTGGG CAGGCGATAT 6 6 60 

ACGTCGTGCA GCAAAACGAA TGGGTGTTAA TGTTACTTCG GCTGACGTAG GAAATATCAT 6720 

TAGCTTGATT CAACACGAAT CAGGAGGAAA TGCAGGTATA ACTCAATCTA GTTCGCTTAG 6 78 0 

10 

AGACATCAAC GTTTTACAGG GCAATCCAGC AAAAGGATTG CTTCAATATA TCCCACAAAC 684 0 

ATTTAGACAT TATGCTGTTA GAGGTCACAA CAATATATAT AGTGGTTACG ATCAGTTATT 6 90 0 

AGCGTTCTTT AACAACAGAT ATTGGCGCTC ACAGTTTAAC CCAAGAGGTG GTTGGTCTCC 6960 

15 

AAGTGGTCCA AGAAGATATG CGAATGGTGG TTTGATTACA AAGCATCAAC TTGCTGAAGT 7020 

GGGTGAAGGA GATAAACAGG AGATGGTTAT CCCTTTAACT AGACGTAAAC GAGCAATTCA 70 8 0 

20 ATTAACTGAA CAGGTTATGC GCATCATCGG TATGGATGGC AAGCCAAATA ACATCACTGT 714 0 

AAATAATGAT ACTTCTACAG TTGAAAAATT GTTGAAACAA ATTGTTATGT TAAGTGATAA 7200 

AGGAAATAAA TTAACAGATG CGTTGATTCA AACTGTTTCT TCTCAGGATA ATAACTTAGG 7260 

25 TTCTAATGAT GCAATTAGAG GTTTAGAAAA AATATTGTCA AAACAAAGTG GGCATAGAGC 732 0 

AAATGCAAAT AATTATATGG GAGGTTTGAC TAATTAATGC AATCTTTTGT AAAAATCATA 73 80 

GATGGTTACA AGGAAGAAGT AATAACAGAT TTTAATCAGC TTATATTTTT AGATGCAAGG 744 0 

30 

GCTGAAAGTC CAAACACCAA TGATAACAGT GTAACTATTA ACGGAGTAGA TGGTATTTTA 750 0 

CCGGGCGCAA TTAGTTTTGC GCCTTTTTCA TTAGTATTAA GGTTTGGCTA TGATGGTATA 756 0 

GATGTTATAG ATTTAAATTT ATTTGAGCAT TGGTTTAGAT CTGTGTTTAA TCGCAGACAT 7620 

35 

CCTTATTATG TTATTACTTC TCAAATGCCT GGTGTTAAAT ATGCAGTGAA TACAGCTAAT 7680 

GTTACATCTA ATTTAAAAGA TGGTTCTTCA ACTGAAATTG AAGTAAGTTT AAATGTTTAT 7740 

40 AAAGGGTATT CTGAATCAGT TAATTGGACC GATAGCGAOT TCTTATTCGA CTCTAATTGG 7800 

ATGTTTGAAA ATGGAATTCC TCTTGATTTC ACACCTAAAT ATACTCATAC ATCAAATCAA 7860 

l^ACTATTT GGAACGGTTC TACTGATACG ATAAATCCAC GATTCAAGCA CGATTTGAAA 7 920 

ATATTAATTA ATTTAAATGC GAGTGGAGGA TTTGAACTGG TTAATTATAC AACAGGTGAT 79 8 0 

ATTTTTAAGT ACAACAAAAG TATAGATAAA AACACTGATT TTGTTTTAGA TGGTGTGTAT 804 0 

GCATATCGAG ATATAAACAG AGTGGGAATT GATACAAATA GAGGCATTAT AACATTAGCG 8100 

50 

CCAGGTAAAA ATGAATTTAA GATTaAAGGA GACGTCAGTG ATATTAAAAC TACATTTAAG 816 0 

' ^ ^,'-:'^r;ATTTAA TGGATTATCA TGATCATTTA TCAGTAATGG 8220 
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ATTATGAACT GAATGAAGCT AGGTACATCA CCTTTACAGT TTATAGAACT ACTCATAATA 8 34 0 

GTTTTGTTTT TGATTTATTG ATTTGTGAAA ACTTCATAAT TTATCATGGT GAAAAATACA 84 00 

CAATTAAGCA GACAGCGCCA AAGGTTGAAG GTGATAAAGT TTTTATTGAA GTTACGGCAT 84 6 0 

ATCACATAAT GTATGAATTT CAAAATCACT CAGTGGAATC AAATAAGCTT GATGACGACA 8520 

GTAGCGAAAC TGGTAAAACG CCAGAATACT CTTTAGATGA GTACTTAAGA TATGGATTTG 8 580 

CAAATCAAAA AACGTCAGTC AAGATGACCT ATAAAATAAT TGGAGATTTT AAAAGAAAAA 864 0 

TACCAATTGA TGAATTAGGT AATAAAAATG GCTTAGAATA TTGTAAAGAA GCAGTAGATT 870 0 

TGTTTGGTTG TATTATTTAT CCAAATGATA CGGAGATATG TTTTTATTCT CCTGAAACAT 876 0 

TCTATCAAAG AAGCGAAAAA GTAATAAGGT ATCAATATAA TACTGATACT GTGTCTGCTA 882 0 

CTGTCAGTAC GTTGGAATTA AGAACAGCTA TAAAAGTTTT TGGGAAAAAG TACACAGCCG 88 80 

20 AGGAAAAGAA AAATTATAAT CCTATTAGAA CAACTGACAT TAAATATTCA AATGGTTTTA 8 94 0 

TAAAAGAAGG TACTTATCGT ACCGCAACAA TTGGGTCTAA AGCTACTATT AACTTTGATT 900 0 

GCAAGTATGG TAATGAAACA GTTAGATTTA CAATAAAaAA GGGCTCTCaA GGTGGAATAT 9060 

ATAAGTTGAT TTTAGACGGC AAGCaAATTA AGCaAATTTC TTGTTTTGCT AAGTCGGTTC 912 0 

AGTCTGAmAC AATAGATTTA ATAaAAAATA TTGATAAAGG CAAGCACGTT TTAGAAATGA 9180 

TATTTTTrGG AGArGrCCCC AAAAATAGAA TTGATATATC TTCAAATAAA AAAGCTAAGC 924 0 

CTTGTATGTA TGTTGGAACT GAAAAATCAA CAGTCTTAAA TTTAATTGCT GACAACTCAG 9 3 00 

GTCGCAATCA ATACAAAGCA ATTGTTGaCT ACGTCGCAGA TAGTGCAAAG CAGTTTGGGA 9360 

TTCGATATGC TAATACGCAA ACAAATGAAG ATATCGAAAC ACAGGATAAG CTGTTAGAAT 94 2 0 

TTGCAAAAAA GCAAATAAAT GATACTCCTA AGACTGAATT AGATGTTAAT TATATAGGTT 94 8 0 

ATGAAAAAAT AGAGCCAAGA GATAGCGTAT TCTTTGTTCA TGAATTAATG GGATATAACA 954 0 

CTGAATTAAA GGTTGTTAAA CTTGATAGGT CACATCCATT TGTAAACGCA ATAGATGAAG 9600 

TGTCTTTCAG CAATGAAATA AAGGATATGG TACAAATTCA ACAAGCGCTT AACAGACGAG 9660 

TTATTGCACA AGATAATAGA TATAACTATC AAGCAAATCG TATAAATCAT TTATACACTA 972 0 

GTACTTTGAA TTCTCCTTTC GAGACAATGG ATATAGGGAG TGTATTAATA TAATGGCAAC 978 0 

AGAAGAAGTT AAAATCAAAG CGCTACTTGA AAACGATAAA C 9821 
(2) INFORMATION FOR SEQ ID NO: 471: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1017 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 471: 

TTATTGTTTT CCAAATGGTC TTCTCGGTTA TTTAACCATC GGTTTCAATC CTACGTGCTT 6 0 

CACTTTCTAT CtATTAATTC AaTTTCAGCT TGACCACCCG CTGTATAAAG GGTCAAAGTT 120 

GCTAATCGAT AGCGTCTCAT TATAGGACCA ACATCGATAT CAATATTTTG AATACGAAAA 180 

TATGGTATTA CCTTTTCATC CAAAAATAAA ATGCCGTTTC GTACACGCAA ATGGTGTTTT 24 0 

TCAAATGCAT ATCTGCAGTG CTTATATCGA TAGACTGGCG CTATAACAAG CGTGAAAACA 3 00 

GCAACAAGTA ATATTATAAT CACACTACTA ACAATGGATA AATGGTTATC TAAAATCTCC 3 60 

CAAAATAGCC AGTTCAAAAT ATTAAATGCG ATTAAAAGTA CAAGCGCTAT GGGTATCCAA 42 0 

AACAGCACAG CACTTAACCT CATCACTTTT TTAGCGTGTG GTGACATAAA ATTATAATCC 4 80 

CTCATCATTT TCACCTCTTA AATACCATGA TTTCAATTTA TTTGCATCTT CACTTCTTGC 54 0 

GTATTTTAAG TTAATCGATT GGGCTCCCAC ACCTTTAGCA ACAATAAAGC TAAAATTATT 6 00 

TAAATTGTTG CGTTTAAGTA ATGTATGTTG CCAAGTGTCA AATCCTATAA TGTGATGCGC 6 60 

TTTAAAATAA TAAATATTTC GTTTCAATAG CTCGAAATTC TGGATAGTAA TTTGTTCTTC 720 

TGTCATTTTA AAACCCGCAT GTTTGACATA AAGATATCCT TTGATCACAA ATAAACCAAT 7 80 

AATGACTATT GTTATAATCG TAAATAACAA TAATAATTGA TTCCAAAAAT AACAGCCTAT 84 0 

ACCTGCCATA GCTATGACAA TAATACTAGG TATTAAAAAG TGTCTGTGGA AACCTGACAA 900 

AGGCATACCT TCATTAACTT GTTGATAAGA TAAATCTGGT ACTAAATTCT GGATAATTTG 96 0 

ATATGCTTTG TCTCGTTTAA TAAACGGCaA TATCGGCACA CTACCTGAAT CATTGTC 1017 
35 (2) INFORMATION FOR SEQ ID NO: 472: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 806 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
'^^ (D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 72: 

TAGGATTGAT TAATCCTTTC TTCAAAATGA TGAATGTGTA ATGTTAAATA TATAATTTCA 6 0 

GACTCACTGA TATTAACATC AAATTGTTTT TGTATCATAG TTAAAATTTT ATATGCTGTG 12 0 

TTATAGCAAA TCGGATAGTG ATTTTTAATC ATAGACACAA AATCATCTTG TGCATGTATA 180 

TATTCTTTTC TTCTTAATCG GCGAATTAAA AATTGTACGT GCCTTATAAA ACGTTGGTAT 24 0 
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TTAATAACAT TATTnGATCA AGGTCATCTC ATGCATAGAT AAATCTTCTG TATTaGATGC 360 

AATATGTAAT GCAATAAATC CTATCTCATC TTCAGGaAAA TGTaCATCTA ATGCTGCATT 420 

TAACTGATCA ATCACCTGTT TAGCAATATG ATATGCATCA CTATATAACT GCATAGTTTC 4 80 

CATAACAAAT GGATTGCTAA TAACTTGATT TTGTTTTAAG CGTTTATAAG CAAATATAAT 54 0 

ATGATCCGTT AATGAAACTA CAAGTTGTTT TGAATCAACA TTCATCGCAG TATTAGAAAT 600 

AAAATTCAAC GAATCAATAA TTACTTGTAA TACATTATCA TCAGCGATTT CAACTAAACT 660 

TTTATAATGT GCTTTTTGTT GCTCACTTTC TAATTTATAA ATTTTCTCTA TTGTAATAGT 72 0 

TTGGTCGTTT AACGCCATTC CCrrCTTTTTT GTTAAAACCA ATACCTTTAC CGATTAAAAT 780 

AACTTCTTGA TCATTATTAG TACATACTAC GACATTGTTG TTCAATGTTT TAGTAACAAT 84 0 

ATATTCTCCC ATTATCATCA CCTATTTTTC TATTATTAAG ATTATATATC GGAAATGTCT 900 

AACTTGAAAG TATAATAATT TAAATACTTG TGTTCTATTA CACGCCTATC CTATATGATA 960 

TATCTTAATT TAATTTCGAT GTCTCTCAAA GTGGAATAAC TATAATAAAA AAATCTGACT 102 0 

CCCATATTTA CAAATAATTC TATTTATTTA TATATTATCT GAATTAATAC TCAATACAAA 108 0 

25 CTPiPiPiACGTA CTATTAAATT GTGCAAAGCT AAAACAAATT TATATTCATC TATCCAACAA 114 0 

TATGTCTTAT CAATGGTATA GTCTTTGCAC ACCAATGGAG GAAATAAATC TCAACCTTAC 120 0 

TATATTAATA TATAATCAAA TCTTAGATTA ACTAGTGTAA TGATACAGAT GATAATTGAG 126 0 

TACAAATTTA AAACCCTGAG ATTTTCGCTT TAATTTGAAA ACCTCAGGGT TTATTTGATT 1320 

TTTATATAAT GAATCGTTAC ATTAAAAATA TTTATTTATC AGAGTTCTTA TATTTGTTAG 1380 

CGCCCCAAGC ACTAATTCCA AATAAGTTAA TTTCTAAGTT TTCAGGTTTA AAGACAGGGT 144 0 

TCTTGCCTTC TTTTTTCTGC TTTTGATAAT CTTTCATCAA TGCAAAAGCT ACATTGGACA 1500 

GTCCTATAAT GGAAATAATG TTTACAATTG CCATTAAGCC CATAAATAAG TCTGCCGTAT 156 0 

TCCATACTGT TTCTGTTTTT ACAACTGCAC CGACAAAGAC AAGTACTACA ACAAGACATC 162 0 

TAAAGATAAA TAATATTACA CGGTTTGTTG ATAAAAATTC AATATTAGAT TGACCGTAAT 16 80 

AGTAATTACC TACAACAGAT GAAAATGCAA ACAGTGTAAC tGCTaTTGTT AAGAAAATAC 174 0 

CTCCAGCAGA ACCTAAATGC TCATTAAGTG CTGATTGAGT AACTGCAACA CCTTGAGGTG 1800 

CGTTATCACC AAATTTCAGT CCTGAATATA GTAAAATCAT GATTGCAGTT GCTGTACAAA 166 0 

CCAACATTGT ATCAAAGAAC ACACCTAATG ATTGGATTAA ACCTTGCTTA ACAGGGTGTG 1920 

GTACGGCAGC AGTTGCCGCT GCATTCGGCG CAGAACCCAT ACCAGCTTCG TTAGAGAATA 198 0 

AACCACGTTT GATACCTTGA AGAACCGCAG CACCTACAGC GCCACCAGTT ACTTGTTCGA 2 04 0 
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GCAATATTAC TAAAACCATA CCAATOTAAA TGATAGCCAT AATCGGTACA ATTAACGAAG 
ATAACGTAOC AATACTACOT ACACOVCCAA ATATAATAAT AGCTGTrACG ATTGCTAAAA 
TAATACCTGT GATTACrGGA CTAATATTAT ATTGCGTATT TAACGACTCC GCAATrGTAT 
TAGATTGCAC TGTGrTAAAT ACAAATGCAA ATGTAATTGT AATTAAAATC GCAAATACGA 
TACCTAGCCA TTTr^TTT AAACCTTTAG TAATATAGTA AGCTGGACCA CCACX^GAATC 
CACCATCm ATCATGTACT TTATAAACCT GAGCCAAAGT CGCTTCTATA AATGCACTCG 
CTGCACCTAT AAATGCAATA ACCCACATCC AAAATACTGC ACCTGGACCG CCTAAAACAA 
TCGCAGTCGC AACACCAGCA ATATTACCAG TACCAACTTCT CGAACCAGCA CTAATCGCAA 
.TGCrrGGAA TGGCGAAATA CCCTrCTTAC CATCTTCTAA AGrTTCTGGA CGTrCTACTA 
;^OCTCTAAA CArTTCAGGT AACATTCGTA ATTGAACGAA TTTAGAACTA ATCGTAAAGA 
AGAATCCAGC TGTCAATAAT AGACCAATTA AATATTGAGA CCATATTAAA TCGGTACCAA 

TL/^i-pr fj&aRTCTTTC AAAATAAACC 

CATGGACAAA TTCTTTAAAC CAii.vj^*^!'* xx«~.v— 

CCTCGCATCC TCTACATGAA TCAI^TACCT TCTATAAAAT TAGACCGAAT TGAACCTTCA 
GTAAATATAG AGATACATCA TCATTTCTTA TACAATACAA GAGATTTATA TTAGrrrGGT 
CAAAGTATAT CGCTAATrTA ACGATAAGTA CrTGGTCAGC ATTTAATATA AATCCCTTGA 
.riTAGTCAA AATTTAACAT TACTGTArTT TATCATTTAA TrTCGTGArr GCATATAGTT 
rTTAGCTAAT ATACA1X.TCT ATTACTTCAC CAAAATCATC TGTATCTACA A^TGAGC 
CATTIGTATA TTGrTCAGAT TtATGAATAT CATTAATTAA ACO.TGrrCT TCATTT^TT 
TTGAATATAA ...TATATTGA CTATGrTCAC CTGTCACTAC ATGTGCAGCT ACAATACGAT 
GTGGATTrrr C.TTAATTCT TTTAATAAAG rTATTCCaCG TTGTGCTCTT TTAGCAACTT 
OTAAGATTTT AAAACTAATA CGTTTrAACG AGCCGCGTTG TGTGGCCATC AATATAGTAT 
CAirrrCAGA AACACCTTCT GTCATAACAA CGAAATCTTC AGCTTTAAGA TTTATTGArr 
.AACACCAGC TGCCCTTAAT CCGGTATCTG ATAGTTCACT TX^TATTATAC GrTAATGACA 

TACcrrrATT agtaatgacg gtaattaatt GATcrrmc aaagcgcata acactaatca 

AATCATCATT TTCTriAACT TTAGTAGCAA TTAAAGGTrr ATTAAAACGC GTrGTTTrAA 
ATAGAGGCAC TGTACTTTTC TTAATCATGC CArTTTGAGT CGCAAAAACA TAAAATGCAT 
CTGTATTAAA GTCCrTTTCA TTAAAGACAT TAATAACCAC TTCATC^CT TCGATAGGAA 
CTATTTGTGA TACATGTTGT CCCAATTCrT TCC^CGAAT ATCTGCTAAT TTATGAACCG 
OTATAAATAG ATAACGACCT TTArTTGTAA ATACTAGTAC GGTATCTTGC GTArTTACrr 
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TAAAGCTACG AATAGAAGTA CGTTTAATAT ATCCATGACG TGTCATACTT AAAATAACTy 3 960 

CTTCACTAGG CACCATAACT TCTTTGTCAA TTTTAATTTC TTCAATTTCT GCTTCAATTA 4 020 

AAGACAGTCG TTCAGATTTG AATTTCTTTT TAATTTCATT CAATTCTTCT TTTATGACAT 4 08 0 

TCAATAATGC ATCATGGTTA TCAAGAATAT GACGTAATTG TTTGATTAAT GCTTCAAGTT 414 0 

CTTTATGTTC ACCTTCAAGC GCAACTATGT CAGTATTTGT TAAACGATAT AACTGTAACA 420 0 

TTACAATTGC TTCAGCCTGT TCTTCTGTGA ACTCGTATAC TTCGATAAGG TTTTCTTTAG 4 260 

CGTCACGCTT GTTTTTAGAG CwACGAATCA ATTCGATTAC TTTATCTAAA ATTGACAACG 4 320 

CTTTAATCAA ACCTTCAACG ATATGCATAC GTTTTTCTGC ATTATCTAAT TCAAACTTCG 4 380 

TTCTATTTGC AACAACCTCA ATTTGGTGAT TCAAATAACT ATCTATAATT TGACGAATAC 444 0 

CCATCAATTT TGGACGACCA TCACTAATAG CGACCATGTT GAAATTATAT GAAATCTGTA 4 500 

AATCAGAGTT TTTATAAAGA TAATTTTTGA TTGATTCACT GTTCACATCT TTTTTCAATT 4 56 0 

CAATTGCTAT TCGTAAACCA GTTCTATCAG TTTCATCACG TACTTCAACG ATACCATCGA 462 0 

CTTTTTTGTC AGCACGTAAT TCATCGATAC GTTTTACTAA GCTACTTTTG TTCACTTCAT 468 0 

25 ATGGAATTTC AGTAATAATT AACTGTTTAC GTCCATTGCG TAAAGTTTCT TCTTCAACTT 4 74 0 

TAGAACGAAC TATAATTCTA CCTTTACCTG ATTCATAAGC TTTTTTAATA CCATCAATAC 4 8 00 

CTTGAATAAT ACCACCAGTT GGAAAATCAG GACCTTTAAT ATATTTCATT AATTGATTGA 4 860 

30 CTGTAATATC CGGATTATCA ATATATTTAA GTGTTGCTTG AATCACTTCA GCTAAATTAT 4 920 

GTGGTGGTAT ATCTGTCGCG TAACCTGCAG ATATACCTGT AGAACCATTC ACTAGTAAGT 4 980 

TAGGAAATCT TGATGGCAAT ACCATTGGTT CGAGTGTCGT ATCATCATAG TTTGGAATGA 5 04 0 

AAGAAACTGT CTCTTTATTA ATATCACGTA ATAACTCTTC AGCTAGTAAG CTTAACTTAG 510 0 

CTTCAGTGTA ACGCATTGCC GCTGGCGGAT CATTATCGAT ACTACCATTA TTACCATGCA 5160 

TTTCTATTAA GACATGTCGT AACTTCCAGT CTTGACTTAA ACX3GACCATT GCTTCGTACA 5220 

CTGAGGAGTC TCCATGTGGA TGATATTGAC CAATAACATC ACCGACTGTT TTCGCACTTT 52 8 0 

TACGGAAATT TTTATCGTGT GTATTACCAC TTGAATACAT TGCATATAAA ATACGACGTT 5 34 0 

GTACTGGTTT TAAACCATCA CGAACATCTG GCAATGCACG CTCTTGAATA ATATATTTAC 54 0 0 

TATATCTTCC AAAGCGATCA CCTAAAACAT CTTCAAGTGA TAAATCTTGA ATTATTTCAC 54 6 0 

TCACTAGATT TCCTCCTCAT CAAATTGATC ATTTTCAAGC ACTTGTACTT CAGAATTATC 552 0 

so TAAAATACTT TGGTCCTCTT GCATACCAAA CTCAACATGC TTTTCAATCC ATTCACGTCT 55 8 0 

AGGTTGTACT TTGTCACCCA TTAATGTTGT TACACGTTTA GATGAACGCA CTTCATCTTC 564 0 
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AGGGTTCATT TCACCCAAAC cmGTAACG TTOTAACGTG AaGCCTTTAC CAAGTTCTTT 
TTGCAATTTA TrAAGCTCTT CGTCTGTCCA AGCGTATTCA ACTCGCmG TrrrGCCrTT 

Accrmrcc AArrrAXAAA gtggaggtaa agcaataaat acacgacctg cttgaacaag 

CGGTTTCATA TATTTGAAGA AGAATGrTAA CAATAGCACT TGAATATGCG CACCATCAGT 
ATCAGCATCA GTCATAATAA TTACACGATT ATAATTACTA TCTTCAATTT TAAAGTCAGT 
ACCAACGCCT GCCCCGATTG TGTGGATAAT TGTATTAATT TCrrCATTTr TAAAAATATC 
TTCTAGACGT GCTTTCTCTG TATTAATTAC CTTACCACGT AATGGTAATA TCGCTTCGAA 
TTTGCGGTCr CGTcCAAGTT TTGCTGAACC TCCCGCAGAA TCACCTTCGA CTAAATACAA 

TTCArrmr tcagtctttt tactttgtgc aggtgttaat ttaccaqata gcaaagtgtc 
rrxACGcrrG nrrrcTrAc ctgaacgagc atcttcacga gctttacgtg cagcttccct 

TGCrrcTTGT GCTTTAATCG CTTTnTCAC AAGTGATTTA GACAATTGTC CTnTTCTTC 



TAAATAGAAT GGCAATTTGT CTGCAACAAC 'luAAT 
ACCCAATTTA GAriTCGTrr GTCCTTCAAA TTGCAATAAT TCrTCTGGAA TACGAACAGA 
CACAACAGCT GTTAAACCrr CACGAATATC ATTACCATCT AAGTrTTTAT CTrrrGTTIT 
AAGTrCATTA ATACGACGTG CATAATCATT AAATACACGT GTCATTGCTG mTAAAACC 
AACTTCATGT GTACCACCAT CTTTAGTACG TACATTATTT ACAAAACTTA AAATACTTTC 
30 TGAATATTGA TCATTATATT GGAAAGCTAC GTCTACCTCT ATACCATTTG CTTCACCTGA 

AAATGTAGCC ACGTCATGCA AAACTrcm TCCrTCATTG ACATAACTAA CAAACTCTTT 
GATTCCTTCT TATAATGGTA TGTCTT 
(2) INFORMATION FOR SEQ ID NO: 473: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1716 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



rrriACTTGA TCrrGGTCTT TTCCGCAGAA AGAGCATTTC CAAArmCT TCATCTTCCA 
TTGAATTTAA ACATTCrrTT TACACCCCTA rTCGTTAAAG ACTATACTAG ATrGGATGTT 
ACAATGCAAC ATATTAACAT ACAAACrTTT TGCTTAAAGA ATAGTAGCAG ATACATAAGC 



6300 
6360 



^r^^r^rr^rrn^T^n. rTTPAGAAGT 6420 



6480 
6540 
6600 
6660 
6720 
6780 
6806 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 473: 

. ^-nT.'^i^r^z, AA-^AT^^Ti^CA CCACTTCCTG CTACAAGTiin 6 0 

GGGGCAnAAA TTCCAATACA CiCAi lAC^ AA.A.AI-.la l,^. 
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CCtTCAACGA ACTTTGCGTT ATCTCTTAAT AAATCGATAA CTTTTTGGAT ACGAACATCA 360 

TTTTTAATGA TATCAGTATT ACCTAAAGTA TTTTTGATAT CTTCAACTGA GATATTAAAT 420 

TGTTTACTCA TTTTTTCTAA TTCTTTATCG ATATCTTCAT CAGTAGCTTC GATTTTTTCA 480 

GCTTCAGCGA TCGCAGTTAA AGTTAAGTTA GTTTTAACAC GTTGTTCTGC ATCGTCTTTC 540 

ATTTGCTCTC TTAATTGAGT TTCATCTTGA CCTGAGATTT GGAAGTACGT TTGTAAATCT 600 

AAACCTTGTT GTTGAATTCT TTGTGCAAAT TCAGACACCA TACGATCTAA TTCAGTATTA 660 

ACCATTGCTT CAGGAATATC GATTGTTGTA TTATCAGTAG CTTTTGTAAT CGCTTCTTCT 72 0 

TTTTCAACAT TTTCAGCATC TGTAGCTTTT TGTTCAGCTA AACGTTTACG TAAGTTTTCT 780 

TTGTACTCGT CTACTGTATT TGCTTCTGCA TCTAATTCAT TAGCAATTTC ATCTGTTAAT 84 0 

TCTGGGACTT CTTTAAATTT AATTTCGTTA ACTTTTGTTT TGAAAGTTGC TTCTTTACCG 90 0 

GCTAATTCTT CAGCATGGTA TTCTTCTGGG AATGTTACGA CAACATCTTT TTCTTCGTCA 96 0 

ACTTTCATAC CTTCTAATTG CTCTTCGAAA CCAGGTATGA ATGAACCTGA ACCGATTTCT 1020 

AAATCGTAAC CTTCAGCTTG TCCACCTTCG AATTCTTCtC CGTCAACTGA ACCACTAAAG 10 80 

25 TCGATGTTAA CTGTGTCGCC ATTTTCAACA ACACCATCTT CTTTAACGAC CATTTCAGCT 114 0 

AAATGTCCTA AGCTGTGGTC AATCGCTTCT TGTAACTCAT CATCAGATAA TTCAGTTTCT 1200 

TGTTTTTCAA TTTCAAGACC TTTATAGTCT CCTAATTTAA CTTCTGGCTC AACTGTAACT 1260 

30 GTTGCTTCAA AAATGAAATC TTTACCTTTT TCAATTTGAG TAACACTTAC TTCtGGTTGT 13 20 

GCAACTGGTT TAATATCAGT TTCGTCAATT GCTTCACCAT AAGCATCTGG TAATAAAATG 13 80 

TCGATAGCAT CTTGATATAA TGCTTCTACA CCAAAGCGTT GTTCAAAAAT TGGACGTGGC 144 0 

ACTTTACCTT TACGGAATCC AGGTACGTTA ATTTGTTTAA CCACTTTTTT GAATGCTTGA 1500 

TCTAACGCTT TGTTTACTTT TTCTGCAGGA ACAGTAACAG TTAATAAACC TTCGTTACCT 1560 

TCCTTTTTTT CCCAAGTTGC TGTCATGTAT ATATACCTCC ATGATTAACT AATTTATTTT 162 0 

TTCAACTTCC CTATTATATC ATACGTCTAT TCCCTATACA AACATTGAAA TCACAACGTT 1680 

TATATATTTG TAAATCAACT TTTTTCGTCA AAACTA 1716 
(2) INFORMATION FOR SEQ ID NO: 4 74: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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TGACCAAGTA CGTTTCGAAG TTGCCATTAA AGCATTAAAC CCATCATTGA AAGCATTCGC 60 

ACCTGTACGT GAGTGGGCAT GGAGTCGTGA AGAAGAAATC GATTATGCAA TTAAACATAA 12 0 

TATCCCTGTA TCAATCAACC ATGATTCACC TTATTCTATC GATCAAAATC TATGGGGCAG 180 

AGCGAATGAA TGTGGTATTT TAGAAGATCC TTATGCTGCG CCACCAGAGG ATGCGTTCGA 24 0 

TCTAACAAAT GCTTTAGAAG AAACACCAGA TACTGCTGAT GrAATCATTT TAACGTTTGA 3 00 

TAAAGGCATC CCAGTTCAAA TTGATGGCAA AACATATGAA TTAGACGATT TAATTTTAAC 360 

GTTGAATGCA TTAGCTGGTA AGCATGGTAT CGGAAGAATT GACCATGTAG AAAATAGACT 420 

TGTAGGTATC AAATCAAGAG AAATTTATGA GGCACCTGCT GCAGAAGTTA TTTTAAAAGC 480 

GCATAAAGCA TTAGAAACGA TTACGTTAAC GAAAGATGTC GCACACTTTA AACCAATCAT 54 0 

TGAGAAGCAA TTTGCTGAAC AACTATACAA TGGACTTTGG TTCTCACCTT TAACTGATAG 600 

CTTGAAATTA TTTATTGATA GTACTCAGCA ATACGTAAGT GGTGATGTCA GAATTAAATT 660 

ATTCAAAGGT AATGCCATCG TGAATGGTAG AAAATCACCT TACACATTAT ATGATGAAAA 72 0 

ATTAGCAACT TATACAAAAG AAGATGCATT TnATCAAGAC GCTGCTGTTG GCTTTATCGA 78 0 

TATCTATGGT TTACC 795 
(2) INFORMATION FOR SEQ ID NO: 475: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 887 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 475: 

CGATTGAAAG AAGACGGTTC AGTTGAAAAG TTTCCAAAGC CAGTAATTAG CCAACAACCA 60 

GAAGGATATA CGAGTCATTT TAGAGATCCT AAAGTTTTTA AATATGATGr GaAATATTAT 120 

GCAATCaTTG GTGCmCaAAA TAATGATCaG CaAGGTCGAT TATTACTTTA TAATACTGAA 18 0 

GATATAATTA ATTGGCATTA TTTAGGTGAA ATAAATACAG AGTTGGATGA TTTTGGATAT 24 0 

ATGTGGGAAT GCCCAGATTA CTTTAATGTA GATAATCAAG ATGTCATACT TATTTGTCCA 3 00 

CAAGGTATTG AACCAAAAGG CGATCAGTTC AAAAATATTT ATCAAAGTGG TTATATACTT 3 60 

GGAAAGTTTG ATATTGAAAA GTTAACATAT GAACATGAAA ATTTTGTCGA GCTTGATAAT 420 

GGTTTTGATT TCTATGCACC TCAAACATTT TTAGATGAAA AAGGCCGACG AGTACTAATT 4 80 

GGATGGATGG GGTTACCGGA AATCGAATAT CCTACTGATA ATGAAGGATG GGCCCATTGC 54 0 
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GCGTTGGAAA AATTACGTCA CAATAAAGAG ACAGCATTtA GGctACGCAA ATAAATTTAC 660 

TCGAAAATTA CATCCGTATG AAGGTAAACA GTATGAATTA ATCATAGATA TTTTGGATAA 72 0 

TGATGCTACC GAAGTGTACT TTGAATTACG TACATCTAAG ACTTCTTCAA CATTAATTGC 78 0 

TTATAACAAG CGTGAAAATA AAATAACATT AGATCGCAGC GACAGTGGTT TATTGCCGAC 84 0 

AAATGTTGAA gGTACGACGC GTAGTACGAT ATTAGACACG CCATTAA 8 87 



(2) INFORMATION FOR SEQ ID NO: 4 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1183 base pairs 

(B) TYPE: nucleic acid 

15 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 476: 

20 

TTGGAAnCAA AAAACCATTG GTAAACCGTG TTnAACCGGA TTTCCGATGG ACCTTTTAAA 60 

ACnACCAAAT AGAAAGCTTT GATAAAAGGT AATTATGGTA CTGATAACAA ACAAGTTCAA 120 

^5 AAACATCATG ATTTAGTACG TATGCTTTTG ATGGATCAAG ATGGTTTTTT AACTGAAAAT 180 

AATAAAGTTG ATCATTTCAT TGATGGAAAT GATTTATATG ATCAAGTTTT AAAAGATATT 24 0 

AAAAATGCAA AAGAtATATC CATTTAGAGT ACwATACTTT CGCTTwAGAT GGTTwAGGTA 3 00 

30 AAAGAATTTT ACATGCTTTA GAAGAAAAAT TGAAACAAGG TCTAGAAGTA AAAATATTAT 3 60 

ATGATGATGT TGGATCTAAA AATGTTAAGA TGGCAAATTT TGATCATTTT AAATCGTTAG 42 0 

GTGGAGAAGT TGAAGCATTT TTTGCTTCAA AATTACCGTT ATTGAATTTC AGAATGAATA 4 80 

ATAGAAATCA TAGAAAAATC ATCGTAATCG ATGGTCAACT AGGTTATGTC GGAGGATTTA 54 0 

ACATTGGTGA TGAATATCTm GGATTAGGAA AATTAGGATA TTGGAGAGAT ACGCATTTAC 60 0 

GTATACAAGG GGATGCGGTT GATGCACTGC AGTTGCGATT TATTTTAGAC TGGAATTCGC 66 0 

40 

AAGCGCACCG TCCACAATTT GAATATGATG TTAAGTATTT CCCTAAAAAG AACGGACCAT 72 0 

TGGGCAATTC ACCAATTCAA ATAGCTGCAA GTGGCCCGGC TAGTGACTGG CATCAAATTG 7 80 

AATACGGTTA TACAAAAATG ATTATGAGTG CAAAGAAATC TGTATATTTA CAATCACCAT 84 0 

45 

ATTTCATTCC GGATAATTCA TATATaAATG CCATTAAAAT TGCTGCTAAA TCAGGTGTAG 900 

ATGTACATTT AATGATTCCA TGTAAGCCAG ATCATCCATT AGTATATTGG GCGACATTTT 96 0 

CAAATGCCTC TGACTTATTA TCAAGTGGTG TTAAAATTTA TACGTATGAA AATGGATTTA 102 0 

TACATTCTAA AATGTGCTTA ATTGATGATG AAATCGTATC AGTGGGCACA GCAAATATGG 10 8 0 
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CTAAAGATTT AAGGGTGGCT TATGAACATG ATATTACAAA ATC 1183 
12) INFORMATION FOR SEQ ID NO: 4 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2332 base pairs 

(B) TYPE: nucleic acid 
iC) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 477: 

GGTTATATGT TTGGTATTAC TCATTATATT GATTGGAGGT TGTGTCATTA TGACAAAAAC 6 0 

AAATGGTCGA AACGCTCAAA TTAAAGAAAA TTTCAACAAA ACATTAAGTG TATATCTAAC 120 

CAAGAATCTC GATGATTTTT ACGATAAGGA AGGTTTTCGA GATCAAGAAT TTGATAAAAG 180 

AGATAAAGGG ACTTGGATTA TTTATTCTGA AATGGTTATC GAACCAAAAG GGAAnAATAT 24 0 

GGAATCGACJA GGAATGGTGC TCTATATCAa TCGCAATACT AGAACmACGA AGGGTAATTT 300 

TATTGTCACC GAAATAACTG AAGATAGTAA AGGATATTCA CGTAGTAAAG AAAAAAAATA 36 0 

TCCTGTCAAG ATGGAAAATA ATCGAATTAT TCCAACAAAG CCTATACCGG ATGACAAGTT 4 20 

AAAAAAAGAG ATTGAAAACT TTAAGTTCTT TGTACAATAT GGAAATTTTA AAGATTTTAA 4 80 

AGATTATAAA AATGGTGATA TTTCATATAA TCCTAATGTG CCAAGTTATT CTGCAAAGTA 54 0 

30 TCAATTGAAT AATGATGACT ATAATGTTCA ACAGTTAAGA AAACGATATC ATATTCCAAC 600 

CAAACAAGCG CCCGAATTAA AATTGAAAGG ATCCGQCAAT TTAAAAGGCT CATCCGTAGG 660 

ATCTAAGGAT CTAGAATTTA CGTTTGTAGA AAATCAAGAA GAGAATATCT ATTTTTCAGA 72 0 

TTCGGTCGAA TTTACACCTA GCGAGGATGA TAAATCATGA GTCAAACGGA ATATCAAATA 780 

AAATCTGGCA ATATAAAAGG TAACTCTGAA GAAACAAGTA CAGTATCTAA TATAAGTTAT 84 0 

GAAATAGAAA ACGCAAATAA CAGTGGTTTA AAACAAAATA AAATTGATAA ACAAATTAAA 900 

AAGTTACAAG AAAAAAATAA ATTCCCTAAA AATCTTTCAT ATCTTAAAAG TTATACGGAC 96 0 

CCCAAAACAG GCACGACTAC AAGCGCCTTT TTAAATAAAG ACACTGGCAA AGTTACTTTA 102 0 

GGTATGACAG GTACTAATGT ACACAAAGAC GCAATATTAA AACAAACATT TGGTGTTCCT 108 0 

TCTTATCAAG GATATATAGA CGTGAGTGAA ACgCTaAAAG ATATTGGGGC CGATGTCAAT 114 0 

ATTGGCCTTC ATTCCGTCAC AGATAAAGAT CCACATTATA AAAATACCCA AGACTTTATC 12 0 0 

AAAAATATCA AAAAAGACTA TGATATTGAT ATTATTACCG GACATTCGCT GGGCGGTAGA 126 0 

GATGCGATGA TTTTAGGTAT GAGTAATGAT ATTAAACATA TCGTTGTGTA TAATCCAGCT 132 0 
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ATTGAAAAGT ACGATGGTCA CATTGTAAGA TTTGTGTCTG ATGAAGACGA ATTAGATGCA 14 4 0 

GGTGTCCGCA ATCATTTATA TGAAACTGCT GGAGAAAAAA TAGTACTTAA AAATGGAGAA 150 0 

GGCCATGCAA TGAGTGGTAT TTTAATGAGC AGAACACAGG CTATAATCTT AGCTGAATTA 156 0 

AACAAAGTTA AAGGCTACCA AGACGAAAAT AATAAAGCAT TAAAATCCGT TCGTAAACAA 162 0 

ACGAGGCATA GATTACATAA AGTAGAGACG TTAAGAGCGA ATTGGATTCA AACAACGGGT 16 80 

GGATCACTCT CTTCCTCCCw ACAACAATTA TTAGAAGCTT TAACAGCACT AACCATTGCC 174 0 

GAAGGCTTAA ATCAATTAGT GAATGAAGAA AGCCAACATT TGAAAAAATG TATCACGCGA 1800 

TGGCACATAA ATTTGGAGAC AACTGGAAAA AAGCGCAAGA AGTTGGAAAT GAAATTGGTG 18 6 0 

AAAAATTAAC CTCTGAAGAG GTTATAGATG rATTAAGAAA AGGTGGCGCG TATGAAAGTa 192 0 

AACTTGAAAC AGATCCCAAA AGAAAAATTG ATGATAAGAT AAAGAAATTA AATGATGTTT 198 0 

ATAAAAATTG TAATGGCTAT ATCGCAAAAA TTAAACAGAG TATCGAAGCA ATTGTTTCTA 2 04 0 

ATGACCAAAT GTTAGCGAGC CAGATTGATG GGATGATGTA ATGTTTACTA CGTATAAnAA 2100 

TATTAATGAA CTTGAAAATG CCTATGATGA AGAAAGAAAA CAATTGAATG ATGCATTCAA 2160 

TCAAATTGAT GAATTAAGAC ATCAAACACG CAAGAmATGT GAACAAATGT ATGATCATTT 2220 

CTTATATCTC AAACATAAAA TGAATTmyms TGAAGACGCT ATGATCAGGA TGACACGTAT 22 80 

TATAGAATCT TTCGATAGAG AAACGAATCA ACGTATCCGA CATCACGAAA TG 23 32 
30 (2) INFORMATION FOR SEQ ID NO: 478: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 478: 

TTTACTACCC AGTATCTCTT TTTATAAATT ATATAGCCAC CACATATGGT GGAAAGTCTT 6 0 

TTTAATTAGA ATTTTGTTTT TTCAGTTAAG AAAGCTTCTA ACTCTGAGAT TGGCATACGA 12 0 

ACTTGTTCCA TTGAATCTCT GTCACGTACT GTAACTTGAT TATCTTCTAA TGAATCAAAG 180 

TCGAATGTtA CACAATAAGG TGTaCCGATT TCATCTTGAC GACGGTATCT TTTACCGATA 24 0 

GATTGTGATT CATCGAAATC GATTGAGAAT TTAGAACTTA ATTGCTCAAA AATCTTAATC 3 00 

GtTCGCCAGA TAATTTCTTA CTTAAAGGTA AAATCGCTGC TTTATATGGT GCTAATGCAG 3 60 

GATGGAAGTG TAAAACTGTA CGTGCATCTT TACTACCTTC AACGCCTTCT TCATCATATG 4 20 
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GAATATATTT TTCGTTCGTT TCTGGATCAT GGTATCTGAA ATCTTCACCA GAGTGTTCAG 54 0 

CATGTTTACG TAAGTCGAAG TCTGTACGAC TTGCGATACC CCATAACTCA CCCCAACCAA 600 

ATGGGAATTT ATATTCAATA TCAGTTGTTG CATTTGAGTA ATGAGATAAT TCATCTTCAT 660 

CATGATCACG TAAACGCATA TTTTCACTGC TCATATTTAA GCTTGITAAC CAGTCACTTG 72 0 

CAAAAGTTTT CCAATAATTT TGCCATTCGA TTTCTTCTCC AGGTTTACAG AAGAATTCAA 7 80 

GTTCCATTTG TTcAAATTCT CTTGTTCTGa AAATGaAGTT ACCTGGAGTG aTTTCaTTAC 84 0 

GGaATGaTTT ACCAATTTGG ACCGG 865 



(2) INFORMATION FOR SEQ ID NO: 479: 

75 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 1444 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 479: 

TAATGAGTAA ATAAGTACCA GATAAAATCA TGAATATCAT CCAACATGCG GTTAACTCTA 60 

CTAAATAATT AATAATAGTA TTTTCAGTAA ACAAAGAACT ATGTATACTT CGCATCACAT 120 

TAGAATACGT ATGTTTCGCA TTTTGATCTG CAACAAATTG ATTGTTATGA TCTAGAAAGA 180 

30 CGTAACGTTG ATTTCCTGCC ATATCACTCA GTGTAATTCG TTTGTTATAT GGTTCATCAA 24 0 

GTATGCTAAC TTTACT"TACA AAGAATCCTT CATATTGTTG TTCAACTTGA TGTACTGCAT 3 00 

CATTTAATGT TTGATGCGTT TTTACATCAC TGTCACCAAA AAACTCATTC TTATAAATAT 3 60 

35 TATTTTCAAC TTCTGGAAAG AACAAGTAAC CAATGCCCGA AATGGTTAAA GTGATTAACA 42 0 

GTGGAGCAAT AAATATTGCT GCATAGAAAT GTAATCTTTG TAATGGATTA AATGTATTTT 4 80 

TCATATTTCC CTCCCAATTG GCTATTATAC GGTGTCAATT CTGTGATGTG TGTGAACAAA 54 0 

40 

CTGTGACAAT ATTTATTTTC TAGAAAAATT TAACGATGAT TTGTGATTTT TAGAAAAATG 60 0 

AACTTTTAAG TTGGAATGTT TGAAGAAAAT TGATTATTCG TATGTTTTAT CAAGCAGCTA 6 60 

TGATAAAATT TAAACATAAT ACAATGCGAG CCATTTAACG ATCTATGTTT AAATGGACAT 72 0 

45 

CGATATTGTA TGAATTCGTT GTAACAAGCA AGCATTTCTA TGTGAACGAA CCAAAGGGGA 78 0 

AAGTAACATG ATTAATAAAG AACAATTAGA TCTTTTATAT AAATTAAAAA AAGAAGTTGA 84 0 

AAAGTCGCGA AATGAAGCAC TTTTACATAC AATTAACCAA GTAATTAAGA AAGTATATTT 900 

GCAGCAATAT ACATGTTCGT TCGTTGGACA TTTTTCTGCA GGTAAATCGA CACTGATAAA 96 0 
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TATTGTGTCA GTTTCAGACA ATCACGATAT TATTGCTAAT TTGCCGAATC AAACGTATGC 10 8 0 

CAAATTATCT AATTATGATG AAGTAAGGGA AATGAATCGC CAAAATGTCG ACGTTGAATC 114 0 

5 TGTAGAAATT AATTTTCAAT CAGCTAAATT TGAAAATGGG TTTACGTTGC AAGATACACC 12 00 

AGGTGTTGAT TCAAATGTTG CATCACATCA GTCAATAACA GAACAATATA TGTATACAAG 12 6 0 

TAATATGATA TTTTATACGG TTGACTATAA CCACGTTCAA TCTGAACTTA ACTTTAAGTT 1320 

TATGAAGCAT ATAAATGATG TTGGaATACC TGTTGTGTTT ATCATTAATC AAATTQACAG 13 8 0 

CATCCAAGAC GATGGAATTG TCATTCTCTA CGTnTTAAAT CTCGAGTTGG AAAAATCAAT 144 0 

TGGC 1444 

15 

(2) INFORMATION FOR SEQ ID NO: 480: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6309 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : double 

(D) TOPOLOGY: linear 



25 (xi) SEQUENCE DESCRIPTION: ! 

GCAGCAGCTT ATCGTGAGTT ATCATTATTA 
CCAGGTGACG TATTCTACTT ACATAGTAGA 

30 GACTTAGGTG GCGGTTCAAT TACTGCATTA 

TCAGCTTATG TACCAACAAA CGTTATTTCA 
GATTTATTCT TCTCAGGTGT AAGACCAGCG 
GGTGGATCTG CACAAATTAA AGCAATGAAG 
GCGTCATACA GAGAACTTGA ATCATTTGCA 
GCAAGTAAAT TAGAACGTGG TAAACGTACT 

40 

CCATTACCAG TCGAACACCA AGTGTTGATT 
GATATTCCAG TTGTAGATAT CACACGTTTT 
AATGCTACTG AACTGTTAAA TGAAATCAGA 

45 

TTTGACACAG CAATTAACGA ATTCAAAAAA 
TTAGTATAAG GTGGTGAGAT AGTGGCTTCT 
5^ ACCAAAAAAA TGAAGCAGAT TACGAAAGCG 

AGAGCTGAAA AAAATACAAA ACAATTCACA 



;EQ id NO: 480: 

TTACGTAGAC CTCCAGGTCG TGAAkCATAC 6 0 

tTATTAGAAA GAGCAGCAAA ATTAAACGAT 120 

CCAATTATCG AAACACAAGC TGGTGATATT 180 

ATTACAGATG GACAAATCTT CTTACAATCT 24 0 

ATTAATGCCG GACAATCTGT ATCTCGTGTT 300 

AAAGTTGCTG GTACGTTACG TCTTGACTTA 36 0 

CAATTCGGTT CAGACCTTGA TGAATTTACT 420 

GTTGAAGTCT TAAAACAAGA TCAAAACAAA 4 80 

ATTTATGCAT TAACAAAAGG ATATTTAGAT 54 0 

GAAGACGAGT TAAACCACTG GGCAGAATCA 6 00 

GAAACTGGTG GCTTACCAGA TGCTGAGAAG 660 

AGCTTTAGCA AATCTGAATA ATAAACAAGT 72 0 

CTTAAAGAAA TAGATACTCG AATAAAATCA 780 

ATGAACATGG TATCAAGTTC AAAACTTCGT 84 0 

CCATATATGG ATAAAATGCA AGATGCAATT 90 0 
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ACTAGAAGTG GATATTTAGT TATCACGAGT GATAAAGGTT TAGCAGGTGC ATATAGTGCA 102 0 

AACGTGCTTA AAAAATTGAT TACTGATATT GAAGCGAAAC ATCAAGATAG TAGCGAATAC 10 8 0 

AGTATTGTAG TTTTAGGGCA ACAAGGTGTT GATTTCCTAA AAAATAGAGG TTATGACATT 114 0 

GAGTATTCTC AAGTAGACGT ACCTGATCAA CCTTCTTTCA AATCTGTTCA AGCACTAGCT 1200 

AACCATGCTA TAGACTIATA CAGTGAAGAA GAAATTGATG AATTAAATAT ATACTATAGT 1260 

CATTATGTCA GCGTTCTTGA AAACAAGCCT ACATCTAGAC AAGTATTACC ATTATCTCAA 132 0 

GAGGATTCTA GTAAGGGGCA TGGTCATTTG TCTTCTTATG AATTTGAGCC AGATAAAGAA 13 BO 

TCTATCTTAA GTGTAATCTT GCCTCAATAT GTTOACyiGTT TGATTTACGG AACAATATTA 144 0 

GACGCAAAAG CAAGTGAGCA TGCAACACGT ATGACTGCGA TGAAAAATGC CACTGATAAT 1500 

GCAACTGAAC TTATTGATGA CTTATCATTA GAATATAACA GAGCGAGACA AGCAGAAATT 1560 

ACGCAACAAA TTACTGAAAT TGTTGGTGGT TCCGCAGCGC TTGAATAATA TTTAAAGGAG 1620 

GAAAATAGCA TGGGAATTGG CCGTGTAACT CAAGTTA7GG GTCCTGTAAT TOATGITCGA 16 80 

TTTGAACATA ACGAAgTTCC TAAAATTAAT AACGCCTTGG TTATTGATGT GCCTAAAGAA 1740 

GAAGGTACAA TACAACTAAC ATTAGAAGTT GCGCTGCAAT TAGGTGACGA CGTTGTTCGT 1800 

ACAATTGCGA TGGATTCAAC TGATGGTGTC CAAAGAGGCA TGGATGTAAA AGATACAGGC 186 0 

AAAGAAATTA GTGTACCTGT TGGTGACGAA ACATTAGGTC GTGTATTTAA TGTACTAGGT 192 0 

30 GAAACAATTG ACCTTAAAGA AGAAATTAGT GATTCTGTTC GCCGCGATCC TATCCATCGT 1980 

CAAGCACCAG CATTCGATGA ACTTTCAACA GAAGTTCAAA TTTTAGAAAC AGGTATTAAA 2 04 0 

GTAGTAGATT TACTAGCACC TTATATTAAA GGTGGTAAAA TCGGATTGTT CGGTGGTGCC 2100 

GGTGTAGGTA AAACAGTATT AATCCAAGAA TTAATTAACA ACATCGCTCA AGAGCACGGT 2160 

GGTATTTCTG TATTCGCCGG TGTAGGTGAA CGTACTCGTG AAGGTAACGA TTTATACTTC 2220 

GAAATGAGTG ACAGTGGTGT AATTAAGAAA ACAGCCATGG TATTCGGGCA AATGAATGAG 228 0 

CCACCTGGTG CACGTATGCG TGTTGCATTA TCTGGTTTAA CAATGGCTGA ATATTTCCXST 234 0 

GACGAACAAG GTCAAGACGT ATTATTATTC ATCGATAACA TTTTCAGATT TACACAAGCT 24 0 0 

GGTTCTGAGG TATCTGCATT ATTAGGTCGT ATGCCTTCTG CAGTAGGTTA CCAACCAACA 24 6 0 

CTTGCTACTG AAATGGGACA ATTACAAGAA CGTATTACGT CTACAACAAA AGGATCAGTT 2 52 0 

ACTTCTATTC AAGCGGTATT CGTACCTGCC GATGACTATA CTGACCCAGC GCCTGCGACA 2 58 0 

GCGTTTGCCC ATTTAGATGC AACTACAAAC TTAGAACGTA AATTAACTGA AATGGGTATT 264 0 

TATCCAGCCG TGGATCCATT AGCGTCTACA TCAAGAGCAT TGGAACCATC AATTGTAGGT 2 70 0 
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CAAGATATCA TTGCTATCTT AGGTATGGAC GAATTATCTG ATGAAGATAA ACAAACAGTT 2 820 

GAACGCGCAC GTAGAATTCA ATTCTTCTTA TCTCAAAACT TCCACGTAGC GGAACAATTT 28 80 

ACTGGTCAAA AAGGTTCTTA TGTACCTGTT AAGACAACAG TTGCAAACTT TAAAGATATC 294 0 

TTAGATGGTA AATATGACCA TATTCCAGAA GATGCATTCC GTTTAGTTGG TAGCATGGAT 3000 

GATGTTATTG CAAAAGCTAA AGATATGGGT GTTGAAGTAT AACAATTAGG AGGAATGGAT 3060 

AATGAATACA TTAAACCTAG ATATTGTCAC TCCTAATGGT TCTGTTTACA ATCGTGATAA 3120 

TGTTGAACTC GTTGTTATGC AAACAACAGC TGGTGAGATA GGTGTCATGA GTGGACATAT 3180 

TCCAACTGTA GCTGCTTTAA AAACAGGCTT TGTAAAAGTG AAATTTCACG ATGGAACTGA 324 0 

ATATATTGCT GTAAGCGATG GCTTTGTTGA AGTTAGAAAA GATAAAGTTT CAATCATTGT 33 00 

TCAGACTGCA GAAACTGCAA GAGAAATTGA TGTTGAAAGA GCTAAATTAG CCAAAGCAAG 3360 

AGCAGAGTCT CACTTGGAAA ATGATGACGA CAATACTGAT ATTCATAGAG CCGAAAGAGC 3420 

TTTAGAGAGA GCAAATAACC GTTTGCGTGT GGCTGAATTA AAATAGTAAA TAAAGGGTCG 34 80 

AAGATGTGAT TTCATATCTT CGACCCTTTT TTGAATTATA TTGATTTAAA GATACAAAAC 3 54 0 

25 ATGAGAGGGG GGAAGGAATT GATAAAGAAC CATTAAAGAT TTATGATGTA GTGGTTCTTT 360 0 

ATCATTAAAC ACAGCTAATG TGTATTTAAA AATAGGaayA CATgAGTAAA ACTCATGTAT 3660 

AAGAAATACT AATTTCTAAA GAAAAAGTAT TTCTTTATGT TGGGGCCCCG TCAACTACTG 3720 

CCAAATACAA CACTATAGAG TCTAGACATT GATTTATGTC CGACTCCCAA GAATAGTTTT 3730 

ACTTTTTTAC AATCACTAAT AGATTGCTAA AATCAAAATT TCCTTCACCA CTATCTACAG 3 840 

TCGACATTTC ATTTTTTGAA ATTATCTACA TTTTTTCATA CCAAGATATT TTATAGTTAT 3 900 

GATATTTATG TAAAAAGAAT TATATAGTAA GTTAGCTTAA ACTTTACTAA AAACGGGTAT 395 0 

TAAACTTTGT ATCATTATTT AAATTTTTCA TGTACAATGT AATACAGTAA TCTTATGAGG 4020 

TGATAAAATG GATTATATCG GACAATATGC AGTTATCCAT TTAGTGTTAC ATGTTGTATG 4030 

TATTTGTATT GCCTATTGGG CTTTACAATC AATTAGATTA GATCAATTTT TTAAAAAAGG 4140 

ATACGCCACT CAATTACAAG TGTGTATGAT ATTTGTTGCT ATTTTATTAG GCACTGCAGT 4 2 00 

AAGCAATTTT ATTGTAGATT TGTTACAATA CTCGACGCAG GTAAAATATT TAATAAAATA 4260 

AGTCTAACTC TATGATTTGT AATCAAAACT AGATATAATT AAATAATGAC TTAAAATAAT 4 32 0 

TTTAAAATAG GGAAATGTAA AGTAATAGGA GTTCTAAGTG GAGGATTTAC GATGGATAAA 43 8 0 

50 ATAGTAATCA AAGGTGGAAA TAAATTAACG GGTGAAGTTA AAGTAGAAGG TGCTAAAAAT 4 44 0 

GCAGTATTAC CAATATTGAC AGCATCTTTA TTAGCTTCTG ATAAACCGAG CTU^TTAGTT 450 0 
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GACGTTACAT ACAAAAAGGA CGAAAATGCT GTTGTCGTTG ATGCAACAAA GACTCTAAAT 4 62 0 

GAAGAGGCAC CATATGAATA TGTTAGTAAA ATGCGTGCAA GTATTTTAGT TATGGGmCCT 46 80 

5 CTTTTAGCAA GACTAGGACA TGCTATTGTT GCATTGCCTG GTGGTTGTGC AATTGGAAGT 4 74 0 

AGACCGATTG AGCAACACAT TAAAGGTTTT GAAGCTTTAG GCGCAGAAAT TCATCTTGAA 4 8 00 

AATGGTAATA TTTATGCTAA TGCTAAAGAT GGATTAAAAG GTACATCAAT TCATTTAGAT 4 86 0 

w 

TTTCCAAGTG TAGGAGCAAC ACAAAATATT ATTATGGCAG CATCATTAGC TAAGGGTAAG 4 920 

ACTTTAATTG AAAATGCAGC TAAAGAACCT GAAATTGTCG ATTTAGCAAA CTACATTAAT 4 980 

GAAATGGGTG GTAGAATTAC TGGTGCTGGT ACAGACACAA TTACAATCAA TGGTGTAGAA 504 0 

15 

TCATTACATG GTGTAGAACA TGCTATCATT CCAGATAGAA TTGAAGCAGG CACATTACTA 5100 

ATCGCTGGTG CTATAACGCG TGGTGATATT TTTGTACGTG GTGCAATCAA AGAACATATG 5160 

20 GCGAGTTTAG TCTATAAACT AGAAGAAATG GGCGTTGAAT TGGACTATCA AGAAGATGGT 5220 

ATTCGTGTAC GTGCTGAAGG GGAATTACAA CCTGTAGACA TCAAAACTCT ACCACATCCT 52 SC 

GGATTCCCGA CTGATATGCA ATCACAAATG ATGGCATTGT TATTAACGGC AAATGGTCAT 534 0 

^5 AAAGTCGTAA CCGAAACTGT TTTTGAAAAC CGTTTTATGC ATGTTGCAGA GTTCAAACGT 54 00 

ATGAATGCTA ATATCAATGT AGAAGGTCGT aGTGCTAAAC TTGAAGGTAA AAGTCAATTG 5460 

CAAGGTGCAC AAGTTAAAGC GACTGATTTA AGaGCAGCAG CCGCCTTAAT TTTAGCTGGA 5520 

TTAGTTGCTG ATGGTAAnAC AAGCGTTACT GAATTAACGC ACCTAGATAG AGGCTATGTT 5580 

GACTTACACG GTAAATTGAA GCAATTAGGT GCAGACATTG AACGTATTAA CGATTAATTC 5 64 0 

AGTAAATTAA TATAATGGAG GATTTCAACC ATGGAAACAA TTTTTGATTA TAACCAAATT 5700 

35 

AAACAAATTA TACCTCACAG ACAGCCATTT TTATTAATTG ATAAAGTAGT TGAATATGAA 5760 

GAAGGTCAAC GTTGTGTGGC TATTAAACAA GTATCAGGAA ACGAACCATT CTTTCAAGGG 5 820 

CATTTTCCTG AGTATGcGGT AATGCCAGGC GTATTAATTA CTGAAGCGTT AcTCAAACAG 5880 

40 

GTGCGGTAGC TATTTTAAAT AGTGAAGAAA ATAAAGGTAA AATCGCTTTA TTTGCTGGTA 594 0 

TTGATAAATG TCGTTTTAAA CGTCAAGTAG TACCTGGTGA TACTTTAACG TTGGAAGTAG 60 00 

4, AAATCACTAA AATTAAAGGA CCAATAGGTA AAGGTAATGC TAAAGCTACT GTCGATGGTC 60 6 0 

AACTTGCTTG TAGTTGTGAA CTTACATTTG CAATTCAAGA TGTAAAATAA AACAAAAAAA 612 0 

ACATTCAAAG ATTTAATGTG TTGGCATAAT CTTTGAATGT TTTTTATTTT ACTCTTCTAA 6180 

so TTTTTCATCC TTTAACTTTG GTTTAGACTG CaTCATTCGA TTAAATGATT TTTTTAATTC 624 0 

TTCACCAGAT AATCCATCAT CAATAAGTTG GTTCTAATAA ACTTTCAGCA TACTGTTGGA 63 0 0 
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(2) INFORMATION FOR SEQ ID NO: 4 81: 

Ci) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 1717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 481: 

AGTTGCTACA CCAGACATGA TGGGTGAAGT TGGTAAATTA GGTCGTGTAT TAGGACCAAA 6 0 

AGGTTTAATG CCAAACCCTA AAACTGGAAC TGTAACAATG GATGTTAAAA AAGCTGTTGA 120 

AGAAATCAAA GCTGGTAAAG TAGAATATCG TGCTGAAAAA GCTGGTATCG TACATGCATC 180 

AATTGGTAAA GTTTCATTTA CTGATGAACA ATTAATtGAA AACyTCaATA CyTTACAAGA 24 0 

20 TGTATTAGCT AAAGCTAAAC CATCATCTGC TAAAGGTACA TACTTCAAAT CTGTTGCTGT 3 00 

AACTACAACA ATGGGTCCTG GAGTTAAAAT TGATACTGCA AGTTTCAAAT AATAAATGAT 3 60 

ATAAACAATT ACAGGCTGAA AGAAATATCT TTCAGTCTGT AAAAATATAT TGACAATAAG 4 20 

^5 TAATTTCCAA GTTATATTAC TTATTGTGAT TATTTTACCT AAGACAGTAG GAGTTATTTA 4 BO 

TAACTTAAAA TTTATCCTGC CGAGGCTAAA ATTGACTTGA ACGTGATGAT CTATGATCTT 54 0 

TCAAGCACTT TTTGCCGTGG GTAGAAAGTG CTTTTTTTAT TAATTTTAAA AAAAGCACCA 600 

30 

AAAATTTAAA TGGAGGTGTC TGAATGTCTG CTATCATTGA AGCTAAAAAA CAACTAGTTG 66 0 

ATGAAATTGC TGAGGTACTA TCAAATTCAG TTTCAACAGT AATCGTTGAC TACCGTGGAT 72 0 

TAACAGTAGC TGAAGTTACT GACTTACGTT CACAATTACG TGAAGCTGGT GTTGAGTATA 78 0 

35 

AAGTATACAA AAACACTATG GTACGTCGTG CAGCTGAAAA AGCTGGTATC GAAGGCTTAG 84 0 

ATGAATTCTT AACAGGTCCT ACTGCTATTG CAACTTCAAG TGAAGATGCT GTAGCTGCAG 900 

CGAAAGTAAT TTCTGGATTT GCTAAAGATC ATGAAGCATT AGAAATTAAA TCAGGCGTTA 96 0 

40 

TGGAAGGCAA TGTTATTACA GCAGAAGAAG TTAAAACTGT TGGTTCATTA CCTTCACACG 1020 

ATGGTCTTGT ATCTATGCTT TTATCAGTAT TACAAGCTCC TGTACGCAAC TTCGCTTATG lOBO 

45 CGGTTAAAGC TATTGGAGAA CAAAAAGAAG AAAACGCTGA ATAATTTTTA GCGTAAAAAA 114 0 

ATTAAAAATA ATGGAGGAAT TATAAAATGG CTAATCATGA ACAAATCATT GAAGCGATTA 1200 

AAGAAATGTC AGTATTAGAA TTAAACGACT TAGTAAAAGC AATTGAAGAA GAATTTGGTG 1260 

50 TAcTGcAGCT GCTCCAGTAG CAGTAGCAGG TGCAGCTGGT GGCGCTGACG CTGCAGCAGA 1320 

AAAAACTGAA TTTGACGTTG AGTTAACTTC AGCTGGTTCA TCTAAAATCA AAGTTGTTAA 13 80 
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TCCTAAAGTA ATCAAAGAAG CTTTACCTAA AGAAGAAGCT GAAAAACTTA AAGAACAATT 1500 

AGAAGAAGTT GGAGCTACTG TAGAATTAAA ATAATTCAAG TATCTTAAAC TTAATAATCA 1560 

AAGTTTTATA GCAAGTATTG CTATAATATA ATGATTCTTT GAGAAGTTAA AACCCCGTTA 1620 

TTTTGATAAC GGGgTtTTAT TCaTTTAAAG ACTGAGTGAA ATGTTATAAT TATAATGACG 16 80 

AGTTACAAAG TGAAGATGAG GTGG-GAATAA TGAGTCA 1717 
(2) INFORMATION FOR SEQ ID NO: 482: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1279 base pairs 

(B) TYPE: nucleic acid 

'5 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 82: 

GTAAATCTGT TACICGAAAT GTAACGATTA AAGAAAAGCG CTCATCTCAA ACATATATTT 60 

TGTTAGGCTA TCCAACAAAA GCACAGAAGA ATAGTCATAG CAAATATAGT GGAGTCTTTA 12 0 

25 TATATAAAGA CTTGAAATCA ATCGAAGATA CAAATAATGC TATTACGATT ATCACCATAA 18 0 

TTACGGCTGT TATTTTCTTA ACAATTACAA CAGTCTTTGC GTTTTTCTTA TCGTCAAGAA 24 0 

TTACAAAACC TTTAAGACGT TTAAGAGACC AAGCTACACG TGTATCTGAA GGGGATTACT 30 0 

CTTATAAACC TTCTGTCACA ACGAAAGATG AAATTGGTCA ATTATCGCAG GCATTTAATC 36 0 

AGATGAGTAC AGAAATCGAA GAGCATGTCG ACGCATTATC CAGATCTAAA A.ATATTAGAG 420 

ACAGCTTAAT TAACTCTATG GTAGAAGGTG TCCTAGGTAT TAATGAGAGT CGACaAATTA 480 

TCTTATCTAA TAAGATGGCG AATGATATTA TGGACAATAT TGATGAAGAT GCTAAAGCTT 54 0 

TCTTATTAAG ACAAATAGAA GATACTTTTA AATCAAAACA AACTGAAATG CGCGATTTAG 600 

AAaTGAATGC ACGATTCTTT GTTGTGACCA CAAGCTATAT CGACAAGATT GAACAGGGAG 660 

GTAaAAGTGG TGTTGTTGTG ACAGTTCGTG ATATGACTAA TGAGCACAAT CTAGATCAAA 72 0 

TGAAGAAAGA TTTCATTGCT AATGTATCAC ATGAATTACG TACACCGATA TCATTACTTC 7 80 

AAGGTTATAC TGAATCAATT GTAGATGGTA TTGTTACAGA ACCGGATGAA KTPiPiPLAGhAT 84 0 

CGCTTGCCAT TX^TCCTTGAT GAATCGAAAC GTTTAAATCG TTTAGTTAAT GAATTGTTAA 9 00 

AIGTCGCACG CATGGATGCT GAAGGGTTAT CCGTAAATAA AGAAGTTCAG CCTATTGCAG 960 

SO CGTTACTAGA TAAGATGAAA ATTAAGTATC GCCAACAAGC TGATGATTTA GGTCTAAATA 102 0 

TGACTTTTAA TTAyTGTAAG AAGCGTGTTT GGAGTTATGA TAwGGATCGC ATGGACCAAG 10 8 0 
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(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1144 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: double 
(D) TOPOLOGY: linear 
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45 



1200 



TTACTTGTGA TGAAAATGAA AGCGAAGATA TTTTATACAT TAAAGATACA GGTACAGGCA 
TTGCACCAGA ACATTTACmA CAAGTATTTg ATCGTTTTTA TAAAGTTGAT GCAGCGnAnA 126 0 

ACCCCGnGGT AACCAnGTA 1279 
C2) INFORMATION FOR SEQ ID NO: 4 83: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 83: 

GAGCTGTTGT TACTTTGATG CCTGCAGCTT TATTACGGCT GACTTGGTAA TGATAAGTTT 60 

CAGCATATTG CTCAATATAT GCTATATCAT ATTGAATGGT ACGAGGTGAT ACACCAAGTT 120 

GATTAGCAAT GGTATTGATT GGAATAAACG TTTGCTCATG AATTAAAAGA TACAAAATTT 180 

CGATTTGTCT ATAACTTAAC AACGTAATAT CCTCCTATTT GTAATTGTAA GCGATTTCTT 24 0 

25 AAAAACGTAG ATATGCAATC TCTTTCATAT TTTAATCCGA AAAATTGCAT ATCAAAATGT 3 00 

TTATGGCGCA AGATTTTATA GGAACTTTTA AAATAAATTA rATATTCATG TTGACAATTT 3 60 

AAAAATGTCG CAGTATATTT AGTTAGACAT CTAACGAAAT GGTGGTGCAA TAAATGGAAT 420 

TCACTTATTC GTATTTATTT AGAATGATTA GTCATGAGAT GAAACAAAAG GCTGATCAAA 48 0 

AGTTAGAGCA ATTTGATATT ACAAATGAGC AAGGTCATAC GTTAGGTTAT CTTTATGCAC 54 0 

ATCAACAAGA TGGACTGACA CAAAATGATa TTGcTAAAGC ATTACAACGA ACAGGTCCAA 6 00 

CTGTCAGTAA TTTATTAAGG AACCTTGAAC GTAAAAAGCT GATCTATCGC TATGTCGATG 66 0 

CACAAGATAC GAGAAGAAAG AATATAGGGC TGACTACCTC TGGGATTAAA CTCGTAGAAG 720 

CATTCACTTC GATATTTGAT GAAATGGAAC AAACACTCGT ATCGCAGTTA TCTGAAGAAG 780 

AAAATGAACA AATGAAAGCA AACTTAACTA AAATGTTATC TAGTTTACAA TAAATGATAA 84 0 

GTGTGACTGG TAGAAATCAG TCACTTTGTC TTTAATATTA TAGTTAGATA TCTAATTGTT 90 0 

AGTAAGCTAA TTATTGGAAA AGACAAGGAG TATTGAACAA TGAAAGACGA ACAATTATAT 96 0 

TATTTTGAGA AATCGCCAGT ATTTAAAGCG ATGATGCATT TCTCATTGCC AATGATGATA 102 0 

GGGACTTTAT TAAGCGTTAT TTATGGCATA TTAAATATTT ACTTTATAGG ATTtTyAGAm 108 0 

^0 GAyAGCCACA TGATTTCTGC tAatCTCTCT AACACTGCCA GTATTTGCTA TCTTAATGGG 114 0 

GTTA 1144 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1158 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 



15 



20 



30 



(xi) SEQUENCE DESCRIPTION. SEQ ID NO: 4S4: 

ACAACCAATT TTACTAAACA TGGTrrTCAT gCATTTTCAA TATTTAGATG TCGACCGCAT 60 

AATCGAAGAA TCGCCGACAA TAGTACTTAT CGATGAGTTA GCACATACGA ATATTTCTAG 12 0 

AGATCGTCAT GAGAAACGAT ATATGGATAT TGAAGAAATT TTAAATCATG GTATCGATGT 180 

TCATACCACT TTGaACATTC aaCATATTGA AAGTTTAAGT AGTCAAATTG AACTGATGAC 240 

CGGTGTACAT GTTAAAGAAC GTGTACCCGA CTATTTCATA ATGAGCGCCG ATGTATTAGA 3 00 

AGTCGTAGAT ATCTCACCTG AACAATTAAT TAAACGCTTA AAAGCTGGCA AGGTATATaA 3 60 

AAAGGATAGG CTAGATGTAG CATTTAGTAA TTTCTTTACG TATGCCCACC TAAGCGAAtG 4 20 

CGTACATTGA CGTTAAGAAC AGTTGCCGAC TTGATGAGTG ATAAAGAAAA AGTCCGACAC 4 80 

25 AACCATAAAA CGTCACTCAA ACCTCATATT GCTGTGGCAA TTAGTGGGAG CATTTATAAT 54 0 

GAAGCAGTAA TTAAAGAGGC ATTCCATATT GCTCAAAAAG AACATGCGAa GTTCACTGCT 600 

ATTTATATAG ATGTATTCGA AAAAAACAGG CAATATAAAG ATAGTCAAAA GCAAGTGCAT 660 

CAACATCTCA TGCTTGCAAA ATCATTAGGA GCAAAAGTAA AAGTAGTTTA TAGCCAAACC 720 

GTTGCATTAG GATTAGACGA ATGGTGTAAA AATCAAGATG TAACCAAATT AATTATCGGA 780 

CAACATATTA GAAATAAGTG GCGAGACTTT TTCAATACAC CTTTAATTGA CCATTTAATG 84 0 

TCCTTTGAAC ATAGCTATAA AATCGAAATC GTTCCAATCA AACAAATACC TGTTGAATTG 900 

AAAATGAACA AATCACCCTA TCGTCCTAAA GGCAAACGTT TCGCCATAGA TATGTTAAAA 96 0 

ATGATTTTGA TTCAAATAAT TTGTGTAATG ATGGGACTGT GGATTTATCA ACTTGATAAG 102 0 

CATGAGTCTA GTACGATTAT TTTAATGATT TTTCTCATCG GCATCATTTT ATTATCCATT 1080 

TGGACGCGGT CCTTCATCAT TGGCTTTTaG CAGCAATTAt TAACGTATTT GTgTkTAATT 114 0 

4S ATkTTTTtAC GGAACCTA 115 8 

(2) INFORMATION FOR SEQ ID NO: 4B5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2224 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 485: 

ATCATATGGT CGATTTAACA GATCCAACGT ACTGCtAAAT AATTACATGA CGTTTAAACA 60 

5 TAGCATTGAT TATAACTATT TCTAAGTCTT CGCATTATTT GCGATGATGT GGGAATAGTT 120 

ATTTTTATTT AAAAATATAA AAAAATAGAT GCAGCAAAAT TTTAAAGCAT TTTATTTTGA 180 

ACATATTAAA AGGGAGCGTA TCATAATGGA ATGTAATGTT TATATCGTAT GCATTACGGA 24 0 

10 

TAAATAATAT ATAAATCATT CTTGAGGAGT GAAAGAATAA TGAGAGACTA CACAAAGCAA 3 00 

TACATTAATG GCGAATGGGT AGAAAGTAAT AGTAATGAAA CGATAGAAGT TATAAATCCA 3 60 

GCAACCGAAG AAGTAATCGG GAAAGTTGCT AAAGGTAATA AAGCTGATGT TGATAAAGCC 4 20 

15 

GTCGAGGCGG CAGACGATGT TTATTTAGAG TTCCGTCATA CATCTGTGAA AGAAAGACAA 4 80 

GCGTTATTAG ATAAAATTGT AAAAGAmTAT GAAAACAGAA AAGACGATAT TGTACAAGCT 54 0 

ATTACGGATG AATTAGGTGC TCCTTTATCA TTATCTGAGC GTGTCCATTA TCAAATGGGA 6 00 

20 

CTAAACCATT TTGTTGCAGC GAGAGACGCA TTAGATAACT ACGAATTTGA AGAACGCCGC 660 

GGAGATGATT TAGTTGTTAA AGAAGCAATC GGTGTATCTG GATTAATTAC ACCGTGGAAC 720 

2^ TTCCCTACAA ACCAAACATC ATTAAAATTA GCAGCAGCAT TTGCGGCTGG TAGTCCAGTT 7 80 

GTACTTAAAC CATCTGAAGA AACACCATTT GCAGCTGTTA TTTTAGCTGA GATTTTTGAT 84 0 

AAAGTCGGTG TTCCTAAAGG TGTATTTAAC CTTGTTAATG GTGATGGTGC TGGTGTTGGG 90 0 

30 AATCCTTTAT CTGAACATCC TAAAGTACGC ATGATGTCAT TTACAGGATC AGGCCCTACT 96 0 

GGTTCTAAAA TTATGGAAAA AGCCGCTAAA GATTTTAAAA AGGTATCATT AGAGCTTGGT 102 0 

GGCAAATCAC CATATATCGT CCTAGATGAC GTAGATATTA AAGAAGCGGc TAAAGCAACa 1080 

aCAGGCAAAG TTGTTAATAA TACTGGTCAA GTATGTACAG CTGGTACACG TGTTTTAGTG 114 0 

CCTAACAAAA TTAAAGATGC ATTCTTAGCT GAATTAAAAG AACAATTTAG CCAAGTGCGT 12 00 

GTCGGTAATC CAAGAGAAGA TGGTACACAA GTAGGCCCTA TCATTAGTAA AAAACAATTT 1260 

40 

GATCAAGTAC AAAATTATAT TAATAAAGGT ATTGAAGAAG GTGCTGAATT ATTTTATGGT 132 0 

GGTCCTGGTA AACCAGAAGG ACTTGAAAAA GGATACTTTG CACGTCCGAC AATTTTTATT 13 8 0 

AATGTAGATA ATCAAATGAC GATAGCACAA GAwGAAATTT TTGGGCCAGT AATGTCAGTT 144 0 

45 

ATCACTTATA ACGATTTAGA TGAAGCGATT CAAATTGCAA ATGATACAAA ATATGGTTTG 1500 

GCAGGATATG TTATTGGTAA GGACAAAGAA ACATTGCATA AAGTAGCTCG TTCTATTGAA 1560 

GCAGGTACAG TAGAAATAAA CGAAGCAGGT AGAAAGCCAG ATTTACCATT TGGTGGCTAT 162 0 

AAACAATCTG GTTTAGGTCG TGAATGGGGC GATTATGGTA TTGAAGAGTT CTTAGAAGTG 16 8 0 
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w 



15 



AGTGCACATG ACTAATTAAG TTTTGTGTAC TGTTTTAATT TTGCAATTTT TATAAATAGA 1800 

TTTTGTAATT AAAATAAAAA TTTGCTATAG TTATTCATGT ATTTAAAAGG TTGGGGATTA I860 

GCATAATGGG ATTGTGCTAG CACAGTTATT TATGCATTGT CATGCCTATC TATTACTTAC 1920 

TAACTAAAAA ATAATGAAAT GGGTGTAAAC TATATGCCTG AAAGAGAACG TACATCTCCT 1980 

CAGTATGAAT CATTCCACGA ATTGTACAAG AACTATACTA CCAAGGAACT CACTCAnAAA 204 0 

GCTAAAACTC TTAAGTTGAC GAACTATAGT AAATTAAATn AAAAAGAACT TGTTCTAGCT 2100 

ATTATGGAAG CACAAATGGA nAAAGATGGT AACTATTATA TGGAAGGTAT CTTAGATGAT 2160 

ATACAACCAG ATGGTTATGG TTTTTTAAGA ACAGTGAACT ATTCTAAAGG GGAAAAAGAT 2220 

ATTT 2224 

(2) INFORMATION FOR SEQ ID NO: 4B6: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 16 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNEbS : double 

(D) TOPOLOGY: linear 

25 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 96: 

ACATTACnTT GAATCGAAAG TTTCATAAAC GACTTGAATG CCAGTTTCTT TTTCAAATTT 60 

3^ CTTAATTAAC TCTGGATCAA TATATTCGCC CCAATTGTAT ACGTAAATTT TTTGATTTGT 12 0 

ATGCACTTGT TCTTTAGATT TAAACCAATG ACTTAAAGTA AGACAAAGCA TACCCACAAC 18 0 

TAATGCACCT ATAATGAGTT GTAAAAATCG TTTCATTATT TTACACCTCG CTTGATTAGT 24 0 

35 TTTTTCTTAT TTATCACGTA TTGAATCAAA TAATATCCTA GTATTCCTAA TACAATAACA 300 

GCAAACAATA ATGTTGAAAT CGCATTAATT TCCATACTAA TTCCTTTTCT CGCCATAGCA 360 

TAAACTTCAA CTGATAACAC ACTAAAGCCA TTACCAGTAA CGAAGAAACT TACTGTGAAA 420 

TCGTCTAGTG AATAAGTTAA AGCCATAAAG AATCCTCCTA TAATAGAAGG TAAAATATTA 4 80 

GGAATAATAA TGTTGCTTAA TAATTGTGGT TCAGTCGCTC CTAAATCTCT TGCAGCATTT 54 0 

AACATATTAT TATTCATyTC ATACAGTTGT GGTAAGACGA TAATCACAAC TATAGGTATG 6 00 

CAAAATGCAA TATGAGATAT TAGAACTGTC CaAAAkCCTA AACCAAGACC AGTAAAATGG 66 0 

CCAATCGTTG TAAACATAAT TAAGAATGAT GCACCTATGA CAACGTCGGA 1X3ATACCATC 72 0 

AAGACATTAT TCAATGTTAG TAAAGTTACT TTAAACTTTT TATTTCTTAA ATAATAAATA 7 80 

GCAATGGCAC CAAATGTACC AATAACTGTA GAAATTGAGG CTGCTAAAAG TGCTACAGCT 84 0 
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AATGTAAAAT GTTCAAAGTG AATCATATTA CCAGCCGAAT TGAATGAATA GAACATTAAA 960 

AAGAATATTG GGATGTATAA AATCGCTAAA AGTATCCCGA TATACAGCTT TCCATACCAT 1020 

^ TTCATATGAT TCACCCTCTC CCATTAGATG ATTTTGTAAT GATTAAAATG AATGCCATAA 10 80 

ATACAATTAA GAATATAGCT ATAGTTGATC CCATACCATA ATTTTGAATT GTTAAAAATT 1140 

GTTCCTCTAT TGCCGTACCT ATATTTATGA CTTTATTACC TGCAATTAAT CTTGTAATCA 1200 

10 

TAAATAATGA AAGTGATGGA ATAAAGGTTA CTTGAATCCC AGTCATAACA CCTTCTTTTG 1260 

TTAACGGCAT GATTACTTTT CTAAAAGTAT AGAAAGGACT GGCACCTAAA TCACTTGAGG 13 20 

CCTGCAATAA ATTATTAGGA ATTGCTTTCA TGCTATTAAA TATAGGTAAA ATCATAAATG 13 80 

15 

GTATATAAAT GTAACTTGCC ACTACTAAAA ACGCACCAGT TGTAAATAAC AAATTGAATG 14 4 0 

ATGGTAAATT AAATAAGTGG AAAATTGATT AATCACGCCA TCATGACTTA ATAAACCTAT 1500 

AAAAGCATAT GTCTTTAACA ATAAATTTAT CCATGTTGGA ATAATCATTA TCATTAATAA 1560 

GATATTTTGA AATTTCGAAC GAGTAATATA ATAGGCAGnT GGATAACTGA TAGTCAAGGT 1620 

AATAATTGTT ATTGAAGCGG CATATAAAAT TGAATATGCA AACATTTTCA AATATTTTGT 16 80 

25 AGTAAAAATT 16 90 

(2) INFORMATION FOR SEQ ID NO: 487: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2112 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(D} TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 87: 

ACGAAAAGAA ATATTATGAT GAACAAAAAG AAAGAATAAC GATTTATATG AAGTACAATG 6 0 

TGAAAGGTTA TAAAAATATA AGCTTCGCTA ATTTTAAAGA AAACCCAATG GATGGTTATT 12 0 

CTATTAGTGG TTATATAAAT AAtGaTAAAA AGTTATCATT TACAGCTGGT ATAAGATCTG 180 

TTGATGATTT TCAATTTGAT ACCGATATTT CTTATACAGA TGAATTGGGT AGAAAATTTA 24 0 

ATAAAAATCC TAAGTCAGTT TCTGAAATAA AAAAAGAGCA AAATACGTCC AATAAATAAT 3 00 

TGTTCATATT GTGATGAAAC AAAAATATAA GTCATTAGAT GAGTTTAACT ATGTTATAAA 3 60 

TATTTGTAGT ATCTATAAAA ATCTCGACAC TATTAAAATG ATAAAGTGCC GAGGTTTTCT 4 20 

TACTTATTTA GTTAATTCAA AGTTTATGCC AGATTCATAA GAATTTGTGA CACTTTTAAT 4 80 

AGTGTAcCAT TGATTATTAC AATTTATCAA ATGGTCCTTT AGAAGGTATA AATAACAAAA 54 0 
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TATTATTATG TTCAAAACTT TACGCTCCAA AAAGTAAAAA GGAAGTTAAG CAATGTTTAG 66 0 

TTGCTTAaCT TCGGaTATTG AACGCATCAG TCCAATTTGA CATAGAGCCT TTTTTAGTTC 72 0 

5 TTGATGTTTC TCTTTAAAAC CTTGCATATT TTACAAAAAG AAAAATTAGC AGTATAATTA 7 BO 

AGACAACGAA AATAAGTATT TACTTATACA CCAATCCCCT CACTATTTGC GGTAGTGAGG 64 0 

GGATTTTTAT TGGTGCGGCT ATATGTCACC TATTTTGTAT TGCGTCTACT TAGCCAATAA 900 

GAAAAAAACG CAATGGCACA GCCACTGATG ACTGGTGCTA TGATGTGAAC GaAAATAAGC 960 

ATCACCTTAT ACACCTCCTC TCTGCGTCTA AATTGACXSsC TGAGaGrTAG GcGACTCTAC 102 0 

TATTATATCA TCGGCAAATA TACAAGCACA GTCACTTGCT TCTGATAAGT TATATGATTC 108 0 

15 

TAGCTGATAG ATTGAATCGT CTACACTTAA TTGGACAAAT TCTATGAGAA TAGATATTGT 114 0 

TAATTTAAGA AAGTAGGCGA TTTTATTATG ACAAGAGAAA GAAGATCATT TAGTTCAGAG 12 00 

TTTAAGTTAC AAATGGTTAG ATTATATAAA AATGGTAAGC CTAGGAATGA AATTATACGC 12 6 0 

20 

GAGTATGATT TCACACCTTC GACGTTTGTA A.ATOOCGGTT ATAAAATGTA GGAAAATGGA 13 2 0 

TAAAGCAACA TCAAAACACG GGTACATTCA ATCACCAAGA TAACTTATCG GATGAAGAAA 13 8 0 

AAGAGCTGAT TAAATTACGC AAAGAAGTTC AACATTTAAA AATGGAGAAC GATATTTTAA 14 4 0 

AGCAAGTAGC GCTGATTATG GGGCAAAAAT AGAAGTCATT CAAAAGAATG CACATCAATA 150 0 

TTCAGTATCA GCAATGTGTA AAGTCCTGAT AATACTAAGA AGTACCTATT ATGATTCTAT 156 0 

30 AAAAAGPJKPiA GATAATAAAA TCACTAAAGA TGATTCAAAC ATAGAACATG CCGTCATAAA 162 0 

TATTTTTAAT TCTA-ATAGA_A AJVGTCTTTGG TACAAGACGA ATTAAAAATC ATTTAAATGA 168 0 

CAAGGGTCTC ACTGTATCTG GACAAAAGAT AGGTCGATCA TGAAAAAATC TAGTTTCTGT 174 0 

35 TTATACGAAA TCTAAATACA AAAATCATCT AAAAGAAACT AATGAAAAAC GAATTAAAAA 1800 

TCTTTATTAT TAGCTGCTGG TGTATTATTT GTTAGTCCAA TTTCATTATC TTATAATTCA 1860 

GATGTAGCTC ATGCTGAAGA TAAGTTAGAC CATTCTCAAG CAAAGGTAAT ATATTTGAGT 192 0 

40 

AACCAAAATT TATTTGATGA ACTTGAGAAA AAAGGTTATA AACTGGAAGA TATATTTACA 1980 

AAAGAAGAAA TAAAAAAATA TAAAGCTGAA GACCAATTGA GAGCGGGTAA AACTCAATAT 2 04 0 

GTAGAAACAG GTAAAGATAC TGCAACATTA TATCTTTCTT CTGCATATAC AAAAAC7VATA 2100 

GCTGCTTTAG GT 2112 
(2) INFORMATION FOR SEQ ID NO: 4 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 454 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS : double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 488: 

GTAGGCACAC ATCTGCCATA TAAACATTCT TTTATACTAG TGTTCTCATA TAGTGTAGAG 6 0 

TTATAGTCTC CTTCTTGAAT CTCGAATAAT TCAATCAACC TATCAACCTT AGTCTCTTCC 12 0 

GTTACTTCTT TTTCAATATC AACTATGAAG GGGATATCAA TTGGAATAAA ACTTGACGTC 18 0 

GAACACTTAT TTGTATTTGG ATGAAAACGA ACGAATCCAT CACTAAATCC TGTTGAAAAA 24 0 

AATATTTTTC CTTGTGATAG ATCCGGATTT TCTCGCGCCC ATTTAATTAA TTCATCTAAT 3 00 

CTCATTTCTT TTTTAACTTT GATTTTCATT GTTATATCTC CTCTTGAACA GTAAATTTAT 360 

CGTTAAtTGA TACGTATCCA GTCACATTAC ATAAGATGCT ATCAACATCA AAAGTCACAC 420 

AACAGTTGCG TTCAACATCA TTTGAATAGA ATCT 4 54 



(2) INFORMATION FOR SEQ ID NO: 4 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1372 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 489: 



30 


TTGTCAGAAT 


TAGAATGCTT 


TTGAGTTACT 


TCATAATACT 


CATCAGTTTT 


TTGTGTATCC 


60 


TTTTGACTTT 


TATTTATTTC 


TTTCCACTTA 


CCAGTATGAC 


TTTCTTTTTT 


TACAGTTATT 


120 




TTCGGTTTGT 


TTAAAAAATG 


ATAACCGATG 


TTCTTTTTGG 


TATCTGTGGA 


CTCTAAAAAG 


180 


35 


ACTGAATTGT 


TTTCCTGATT 


ATCAGAATTG 


GTTGTTTGTT 


TATCGTCTGT 


ATATAACGTA 


240 




TACCCATTCG 


CTTTGCGATT 


TTCAATCGTT 


GTAACTGGAT 


TCTTTGATGA 


ATCAGTACTT 


300 




TTAATGCCAG 


TTCCTAAGAA 


GACAATTTTA 


TCGTTTAATA 


TGAAATATGA 


TTTTTTGGCA 


360 


40 


GTTAAAGTTT 


TGTCCTGATT 


TTCAAAATCC 


ATTCCGATAC 


TAGCATGTTG 


GTCATCAACT 


420 




TTTGTTCCGC 


CAACAAAAGT 


TTTACTCGAC 


TTTTTATCAT 


CCGTATCTTT 


TAATATTTCA 


480 




TTGTCTAAAG 


TTGTTGTACC 


TGATAAACGT 


TTCATATCGG 


CTGTCACCCA 


GAAGTTATCA 


540 


45 


TGATAGTGTT 


TGACATCGCT 


GTTATATAAA 


TAAGACATTC 


CAGCACCAGT 


GTGCCAACCT 


600 




TTTAAATTCT 


CTCCGTTGAT 


ACTTTCATAG 


CGTGCTACGT 


TTTTCGACGT 


CATACTTAAA 


660 




CCAAATGCAA 


AGTCTAAGTC 


TTTGTTATGA 


TAGGTGACAC 


GATCCATGTC 


ATTATATATT 


720 


50 


TTAAGTTGTT 


GTGTTAATCC 


GTTTTTAGAA 


ATACTGTTAT 


CTGTCATTAA 


AGACTTCATT 


780 
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10 



15 



20 



ACTGAAGATT TGACAATCTT TTTATACTTA GCTTTTGTTG AATCATCCAT GGCATCACTT 900 

AATCTCAACA ATGATTTCAT TACTGTTGCA GATGCTGAGT GACTGGTTTC ATTTTCACGA 96 0 

CTGATAGCTC TACCTCGTGA TAAATCCATC ATTTCACCTT TATAAATGAG TGGCATAAAT 1020 

CCGTCGTCAA TCCATGACTT TAAGGTTGTA TCATTTTGGG TTTTATCATT AAAAGGTGTT 1080 

TCTTTTATCA TCGGCATCAT TTGAGAAATC CCCTCTAAGA GTACAACGCC ATAAGCACCA 114 0 

GTGTATGGAA CGTCTTGATG ATCAATGTAA GAGCCATCTT TATAAAATCC ATTACGTTCT 1200 

TTACCAGTGG CAGAATCTTG AACGTAAGTG AAGACTTTAT TAAATGAATC TATAGACTTT 126 0 

TTCATCATAT CTTTATCTTC TTCGATAATA CATTCTAAAA GTTTCACCTT AGAAATGTCT 132 0 

ACTAnATTTC CGCCTTTAGC AAGTTCAGnT TTTCCTACAC AAGATAATAT TT 1372 
(2) INFORMATION FOR SEQ ID NO: 490: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 564 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 490: 

ACAACAATAT AGTAAAGCGT CGTTAATTAA ACAATGGGAC CAATTTGTTC GTCTTATATA 6 0 

JO AATGTACTTT aCCTTTATTT TTACAAAAAT AGCATTTTCC TATGTCATTT AACTAAACAT 12 0 

GTAAGTTCGT ATGAACGAGG TTTGTTAAAT AGATGATTCT AGGAAAATGC TTTTTTCTTT 180 

TGACTTAGTT TAAAATATTT TGCCACTTTG TACTGATAGT AGTTGCATTG TACTGTTGTG 24 0 

35 CAGATTCTAT GCTATTAATT GAAAACTGTT GCAATTTTTG AGTATTATTT AGTAATTGGT 3 00 

CGACCTTTTC AACCATTTGA TTGATGTCAC CTTGAGGTAC TAAATAGCCA TTAAATCCAT 3 60 

CTTGaATCAG TTCTGaTGGA CCATAATCTA CATCATAACT GATCACTGGT GTACCTACTG 420 

AAAgCGACTC TAAAATTGCT AAGCCAAAAC CTTCCATTTT ACTTGTCGAT AACATCAGTT 4 80 

CTGCTTTAGC AATCTCTTCA TTAATATGCG TCTTAAAACC ATGAAATTTA ACATGTTCCA 54 0 

GATATnATGA TAATCTTCTA CAAG 5 64 
(2) INFORMATION FOR SEQ ID NO: 491: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1277 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 91: 

TATCCACCCC ACGAnAGCCC CGGAAACTTA TTGTGTTACA AGATATATAA GCAGAAACGA 60 

5 ACAACAGTTA ACAAAATAAA TGAAATTAAA CGTTTTAAAA ATGAAACAAA TGAAATCATC 120 

TATTAGGTTA TGAAACTGTT TATAGCTTGA ATAGAAGCAT TTATTTTTTA GGAGGACAAT 180 

TATTATGCGT CAAACATTTA TGGCAAATGA ATCAAACATT GAGCGCAAAT GGTATGTTAT 24 0 

CGATGCTGAA GGCCAAACAT TAGGTCGTTT ATCATCAGAA GTAGCATCTA TCTTACGCGG 3 00 

TAAAAATAAA GTAACTTACA CACCACACGT TGATACTGGT GATTATGTAA TCGTTATTAA 360 

TGCATCAAAA ATCGAATTTA CTGGTAACAA AGAAACTGAC AAAGTTTACT ACCGTCACTC 420 

15 

AAATCACCCA GGTGGTATCA AATCAATCAC TGCTGGTGAA TTAAGAAGAA CTAACCCAGA 480 

ACGTTTAATT GAAAACTCAA TTAAAGGTAT GTTACCAAGC ACTCGTTTAG GCGAAAAACA 54 0 

AGGTPJKPAAA TTATTTGTAT ATGGTGGCGC TGAACATCCA CACGCTGCAC AACAACCAGA 600 

20 

AAACTACGAA TTACGTGGTT AATTAGAAGG AGGAAATGAC TTTGGCACAA GTTGAATATA 6 60 

GAGGCACAGG CCGTCGTAAA AACTCwGtAG CACGTGTACG TTTaGTACCa GGTGAAGGTA 72 0 

ACATCACAGT TAATAACCGT GACGTACGCG AATACTTACC ATTCGAATCA TTAATTTTAG 780 

ACTTAAACCA ACCATTTGAT GTAACTGAAa CTAAAGGTAa CTATGATGTT TTAGTTAACG 84 0 

TTCATGGTGG TGnTTCACTG GACAAGCTCA AGCTATCCGT CACGGAATCG CTCGTGCATT 900 

3^ ATTAGAAGCA GATCCTGAAT ACAGAGGTTC TTTAAAACGC GCTGGATTAC TTACTCGTGA 96 0 

CCCACGTATG AAAGAACATA AAAAACCAGG TCTTAAAGCA GCTCGTCGTT CACCTCAATT 102 0 

CTCAAAACGT TAATTGTCGG ACGATATATA CAAAACACCT CGATATTATG TCGAGGTGTT 10 80 

35 TTTTTGGCGT TTTTGCGGCG AATATGGAAT GTGTAGAATA TAAATGAATT TTTACCTTCC 114 0 

CACCATAAAA GATGAAGAAC CATGAATGTG GAGAACAATA AATAGTTGGA TATTCTGTTA 12 00 

TTTTTTTGGA AGTGGAAGTG GATTTGGAAT ACTTTACTCn AAACGATTAA AAGGTTTAAA 1260 

AAAACAACAA AnAGAAA 1277 

(2) INFORMATION FOR SEQ ID NO: 492: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 673 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 492: 
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TTATTGACAT TGTTTTTATC CAAAATTCAT TGTTAAGACA mTCTTTAT GAAATAATAA 120 

TnATATTGAA GTATATTTTT ATTATTATTA AAAATAAATA AGGGGATACT TATGAGCACA 18 0 

^ AATCAAACAT TTTTAATATT TGTTATAGCA ATTATTCTAC TTACATCTGT AATAGGAATT 24 0 

GTTGGACGAT ACATGAGTCG TCAACGTCTA TTAAAATCTA TGGAAACATT ATGGCAAACG 3 00 

ATTTCTCCAT TAGAJVGCTTT TATCAGACCG AACTCACATT TCGACTATGA GTATAAGCTC 360 

w 

TACAAGGAAA AATTTGAATC ACATTCATTA GTTGATGATA AAACTTGGTC CGACTTAAAT 420 

ATGAATGCAA TCTTTCATAA GATGAATTAT AATTTAACAG CTATTGGTGA AATGAAGCTA 4 80 

TATGCCTGTT TACGTGGAAT GCTTTCAATT ACGAACAAAT CATTACTTAG TTTATTTAAT 54 0 

GATAATGCTG AATTTAGAAA AAACGTAACA TATCATTTAG CTTTGATTGG tAAAACTGTT 6 00 

aTCCAACATT TCCAGACCAA ATCACACCGG kAAACGTCCA AATATATTGn TCTATGCCCG 6 60 

GTTTACCAGT ATC 6 73 

20 

(2) INFORMATION FOR SEO ID NO: 4 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1240 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
CD) TOPOLOGY: linear 



40 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 493: 

AAAAAAAGTT AATGCTGACG GTGTATTAAC TTTTGATATT CTAGAAAATA AATATACTTA 6 0 

TGAAGTTATT AACGCTATAG GGAAAAGATG GATTGTTAGT CATGTCGAAG GTGAAAACGA 120 

•35 CAAGAAAGAA TATGTAATAA CTGTCATTGA TAGGAAATCA GAAGGCGACA GACAACTGGT 180 

TGAATGTACT GCTAGAGAGA TTCCCATAGA CAAGTTAATG ATTGATAGAA TTTATGTTAA 240 

TGTAACAGGA TCTTTTACAG TAGAAAGATA TTTTAACATT GTGTTTCAAG GTACTGGAAT 300 

GCTTTTTGAA GTCGAGGGCA AAGTTAAATC TTCAAAGTTT GAAAATGGTG GTGAAGGCGA 360 

TACAAGGTTA GAAATGTTTA AAAAGGGATT AGAACATTTC GGTTTAGAAT ATAAAATAAC 420 

GTATGACAAA AAGAAAGACA GATATAAGTT TGTATTGACG CCTTTTGCAA ATCAAAAAGC 4 80 

GTCTTATTTT ATTTCTGACG AAtCAACGCC AACGCTATAA AACTCGAGGA AGATGCAAGT 54 0 

GATTTCGCCA CCTTCATTAG AGGATATGGT AATTATTCAG GAGAaGAAAC ATTCGAACAC 600 

GCTGGGCTCG TAATGGAAGC TAGAAGTGCA TTAGCTGAAA TATACGGCGA CATCCACGCA 660 

GAACCATTTA AAGATGGTAA AGTGACTGAC CAAGAAACTA TGGATAAAGA ATTACAATCG 720 
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TATCCAGAAG CAGACCCACA ACCCGGAGAC ATAGTACAAA TAAAATCTAC CAAACTAGGT 84 0 

TTGAATGATT TAGTCCGTAT AGTACAAGTT AAAACGATTA GGGGTATAAA CAATGTAATT 900 

^ GTTAAGCAAG ATGTAACGCT TGGTGAGTTT AATCGAGAAC AACGATATAT GAAAAAAGTT 96 0 

AATACTGCAG CTAACTATGT TTCTGGATTA AATGATGTTA ACCTTTCTAA TCCTAGTAAA 102 0 

GCGGCAGAAA ACTTGAAGTC TAAAGTAGCG TCAATAGCTA AATCAACACT CGATTTGATG 108 0 

10 

AGTAGAACTG ATTTGATTGA AGATAAACAA CAGAAGGTAA GCTCTAAAAC TGTGACTACA 114 0 

TCTGACGGCA CTATCGTTCA TGATTTTATa GATaAATCmA ACATTAAaGA TGTAAAArnCG 12 0 0 

aTTGGAACGa TTGGCXSATtC TGTAGCTAGA GGATCACATG 124 0 

15 

(2) lOTORMATION FOR SEQ ID NO: 494: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1311 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 94: 





ACGGTGGATT 


TAGCCTAGAA 


GATTTAACGC 


ATCAAGGTAa 


ATTaTCAGCA 


TTTAGCTTTA 


60 




ATGATCAAaC 


AGGTCAAGCA 


ACATTGATTA 


CTAATGAAGA 


TGAAAACTTC 


GTAAAAGATG 


120 


30 


AGCAACGTGC 


TGGCGTAGAT 


GCAAATTATT 


ACGCTAAACA 


AACATATGAT 


TATTACAAAG 


180 




ACACATTTGG 


TCGTGAATCA 


TATGaCAACC 


AAGGTAGTCC 


AATTGTTTCA 


TTAACGCATG 


240 




TTAATAACTA 


CGGTGGTCAA 


GATAACAGAA 


ATAATGCCGC 


ATGGATCGGT 


GACAAAATGA 


300 


35 


TCTATGGTGA 


TGGTGATGGT 


CGCACATTCA 


CAAGTTTATC 


GGGTGCAAAT 


GACGTAGTAG 


360 




CACACGAATT 


aACACACGGT 


GTGACACAAG 


AGACAGCGAA 


CTTAGAATAT 


AAGGACCAGT 


420 




CAGGCGCTCT 


AAATGAAAGC 


TTTTCAGATG 


TTTTTGGATA 


CTTTGTAGAT 


GACGAGGATT 


480 


40 


TCTTAATGGG 


TGAAGATGTC 


TACACACCTG 


GAAAAGAGGG 


AGACGCTTTA 


CGCAGCATGT 


540 




CAAACCCAGA 


ACAATTTGGT 


CAACCAGCTC 


ATATGAAAGA 


CTATGTATTC 


ACTGAAAAAG 


600 


45 


ATAATGGTGG 


CGTACATACG 


AATTCTGGAA 


TTCCAAATAA 


AGCAGCTTAT 


AACGTGATTC 


660 


AAGCAATAGG 


GAAATCTAAA 


TCAGAACAAA 


TTTACTACCG 


AGCATTAACG 


GAATACTTAA 


720 




CAAGTAATTC 


AAACTTCAAA 


GATTGTAAAG 


ATGCATTATA 


CCAAGCGGCT 


AAAGATTTAT 


780 


50 


ATGACgAGCA 


AACAGCTGAA 


CAGGTGTATG 


AAGCATGGAA 


TGAAGTAGGC 


GTGGAGTAAA 


840 


AATATATAAA 


CaAGAAGAAG 


TAATGTTAAA 


CACTTATAAA 


TAATTAAATT 


TTAAATACAG 


900 
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ATTAGATGAG AGGAGTGTGA GGGTTGTCTG CCGAAAGACT ACTCGGCAGT CTAAAATCAT 102 0 

TACAAGTAGT AGATATGTGA TAATTAAATG CTGACTTAGA ATACAAAATT CATTTTAAAA 1080 

GTTGTCACAA AAAATTTACA TGTATTTTTA TTATCTTTTG CAAAACAAAG TGTTAAATTA 114 0 

TAAATGAaAC ATGCATGAAT TTATTTTTTA ATACAAGAAA CGTAACTACC AAAGGAGTTT 1200 

ACAATATGAA GAAAAGTAJ^ CGA1TAGAAA TTGTTTCTAC AATAGTTAAA AAGCATAAGA 12 6 0 

TTTATAAAAA AGAACAAATC ATTTCATATA TTGAAGAATA TTTTGGTGTA A 1311 



(2) INFORMATION FOR SEQ ID NO: 495: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 1761 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) sequence: DESCRIPTION: SEQ ID NO: 495: 

TGCACTTTCT AAAAATACTT GCTTTACTTG TTCCAATTCC TTGTCAGACA TTGATTCGGC 60 

TAATTTATTC ATTATTGCCT CAAATACGTT AATTATGTCA TCCATTTCAG TACTATAAAA 120 

25 

ATCAAACCAT TTTGCCGTAT CTTTTTCTCT ATTAAGCTTA TGATCACTTT GAGAGCGCTT 180 

AGCTAATTCT GCATAAATAT ATGGACAAGG TGCCATTGCA GCAATTGTAT AAATAGCATT 240 

jQ TTCACGACTA TGCGCTTGGA AATACATATG TTTTATGTAA TGGTCGCCAC TTGGAGGCCA 3 00 

A_ACTTTTGTT TTAATGATTT CTTCGTATGA TTCACCAACA ATTTGCGCTA AAATATCATG 36 0 

CGCAAGTACT TCACCTTCAA CCATAAATTC TATTTGCTCT ACTAAAAATT TTACGTCATT 420 

35 CATGCTATTC ATTTTTGGAA TTAACAAAGC ATATAAGTTT GTAAATTCTT TTAAATACGC 4 80 

AGCATCAGCT TTTAAGTAAT GGCGTAATGC GTCAGcTCCT ATATCTCCGG ATAACATCTT 54 0 

CTGAATAAAG TCATCCTCAT AAATATCATT AATGATTGGC TTTGCAGCTT GGTACAATTT 60 0 

TTGTGAAAAT TCCATTGTAA AAAATCCTCC CTAAATAAAA AAACTACTTC CAACATGAAA 6 60 

GTAGTTTGAT GGCAATGTTG CTATACTAGC CCCATCACTT CAATAACTAC TTTCCTACGT 72 0 

TGGTACTAAC CAAATCAGGT CATAAGGGTC TGAACAATTC ATCTCAGCCA TATCATTAGG 780 

CTCCCCTAGT AGTTCCTTAG TATTCAATTG CAAATTAATC TTAGCAAACG GTTTCAACAT 64 0 

TTTCAATTAT TGTTGCTCAG TTGTATTATT ATCTTTAAAT AATAATTCTA TAATGACATA 900 

TATTTGCGAA ATAAAAAAAC CGGAACATAT CGAGAATTCC CCGATATATT CCAATCTAAA 96 0 

50 

AGTTACTTAT ATAACTATTA ATTAGCTATG CATAAATGGC TTATGCAGTA ACCCAATGTC 102 0 
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TTGCTGGTGA TACACCTTTA TATTTAGCAG GTGCTACTGA ATCCCAAGTT GATTGTAAGA 114 0 

ATTGATACTT ACCAGCTGCA CCTGgATGTT GGrTTTACAG CATGAATATT GCCACCTGaT 12 0 0 

^ TCACGTTGrG CAATTrGTTT TAGATGAGCa TTCACATTTA CTGATGAACC TTCTGATGAT 126 0 

TTTGATyCAG TTGGTGTTGC AGTAACTTGT GAATTGTTTG ATGTTGATGC TTGTGGTTGT 13 2 0 

TGAGTTTGAG CATTTTGTGG TGCTTCAACT TCTTGTGATT GTACTTGATT AGCTTGAACA 13 8 0 

10 

GCTGATGGTG CAACATTATT AGTTGCAGGT GCTTGTGCAC TCATGTCTGC TCCATTAGTA 144 0 

CCTGTTGCAT GGTAATTCCA AGCAAAGTGT GTACCATCTG ATTCAAAGTG ATAAGTAAAC 1500 

CCTTCATAGT CAAATGTATA ATTATAAGCC CCAGCTTCAA TTGGTTTTTG ATTTAATGTT 156 0 

15 

TGATCATTTG ATTGCGCCAT TTGCCTGAAA GATGCTTTAT TTAAGTCCGC TTCACnTGCA 162 0 

TGGGCTTCGT GGACCTGCAT TTCCTGGCTA CGATTCCTAA ACCTACTGGC nAAnGATGAT 16 80 

GCGAGTAATG TTTTCTTCAT AATCTTAAAA TCCTCCTACA AGTGAATTTG TGTCTCTAAA 174 0 

20 

AGTTTTACAG TGGACGACTG T 1761 
(2) INFORMATION FOR SEQ ID NO: 4 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 794 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 496: 

TCATTTATGA AAAATGTCGG AnAGAnCCaa GaAAmaCAAT TGAgCGTGAA GAAAAAGCAA 6 0 

35 GACTTAAAGA AGAACAAAAG GCACGTCAAA ATGAACAGCC ACAAATAAAA GATGTGAGTG 12 0 

ATTTTACGGA AGTGCCTCAA GAAAGAGATA TTCCAATTTA TGGGCATACT GAAAATGAAA 180 

GTAAAAGCCA GAGTCAACCA AGTCGAAAAA AACGAGTGTT TGATGCAGAG AATAGTTCGA 24 0 

'^^ ATAACATCGT AAATCATCAT CAAGCAGATC AGCAAGAACA ATTAACAGAA CAAACTCATA 3 00 

ACAGTGTTGA AAGTGAAAAC ACTATTGAAG AAGCTGGTGA AGTTACGAAT GTATCGTATG 360 

TTGTTCCACC GTTAACTTTA CTTAATCAAC CTGCAAAACA AAAAGCAACA TCTAAAGCTG 42 0 

45 

AAGTGCAACG TAAAGGACAA GTACTAGAGA ATACATTAAA AGATTTTGGG GTAAATGCAA 480 

AAGTGACACA AATTAAAATT GGTCCTGCAG TAACTCAATA TGAAATTCAA CCAGCTCAAG 54 0 

GGGTTAAAGT GAGTAAAATT GTAAACTTGC ATAATGATAT TGCATTAGCT TTAGCAGCAA 60 0 

50 

AAGATGTTAG AATCGAAGCG CCAATACCTG GTCGTTCTGC AGTAGGTATT GAAGTGCCAA 66 0 
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ATAAACTAGA AGTTGGATTA GGaAGAGATA TATCAGGTGA TCCAATTACT GTTCCACTAA 7 80 

ATGAAATGCC ACAC 7 94 

5 (2) INFORMATION FOR SEQ ID NO: 4 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1161 base pairs 

(B) TYPE- nucleic acid 

(C) STRANDEDNESS : double 
' (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: I 

15 

AGCCAGTTTT GcATTTCGTC AAATCGCAAT 
AGAAATTTAT TCTTCTTTTT CTTTTCAGCA 
TCTTAGTGAT AAAGAATCCA TCTGAATTAA 

20 

CCAACTCTCC AGTTATCGGA TGTTGAAACG 
AAAACGTATA AATCACGTTT TCATTTTCTA 
TTTCTCCACC TATTTTTACA TTGTTTTTTA 

25 

CTAGTGACTC AATATGTTGT TTGCTTTGAG 
CACCTAATCC GCTACATGGT GCATCAACAA 
TTGTCGCATC ATGTTGAAAA GCTTTAATAT 
TAATTAAGTC TATTTTGTGA TCATGTATAT 
AAACTTCAGC AATGTGACAA GCTTTACCGC 

35 CATGTCGGTC TACATTCATA ATGTGTGCAa 

CGAATCCATC TTTAAATGAA CGAGAATGAA 
AAGGTAAGTC ATGATCTTTT TCAACGTCAT 

'^^ CATCTAATGA TGCTCGCGTC AGGTTGGCAC 

ACTGTAAAAT TTTTTCAGTT TCTTCGAGAC 
ACTTCGGCAT ACTATACTCG ATTGCCATTC 
CAGGTAAGTC ACTACGCATC ATTGTACGTA 
AGCCACCGCG TTCTTTTGCT ATTTCAACTG 
TATCTAnATA nACATATTGA T 

50 

(2) INFORMATION FOR SEQ ID NO: 4^ 



;EQ ID NO: 4 97: 

GAATATATTG ATTGCTTGTC AAAATTTGGA 60 

GTTATCATTC CTAATTGTCC TTTCTTTTTA 12 0 

AGTCTTGCGG CATGATTTGT AACGTTTTGA 180 

GTTCAAATTC GAAGTTTTTA TTATTTTTCA 24 0 

GTTGCTCAAT TGTACATGTT GAATAGATGA 3 00 

CATTTTCCAA TATTTCAAGC TGTAATTCAA 3 60 

TATACTTAAT CTCCGGCTTA TGTCTCATTA 4 20 

GTATCTTATC GTATGTTTTA TCATAAGGTT 4 80 

TTGTTAATCG TAATTnTTTT ATATTAAAAT 54 0 

CTGAAGCGTC AACTTGCCCT TCTGGCATTA 6 00 

CAGGTGCACT ACATGCATCT AATACGTGAT 66 0 

CAAACATTGA GCTTTTATCT TGAATTGAAA 72 0 

TAATTGGTTG TCCTCCTATA TGGAGACAAT 780 

AACCTTCGTC TTGCAACTTT TCAATAATAT 84 0 

GCACAGTTGT TGATGTCGTT TCTAAAAATG 90 0 

CATAATGTGT TGCCCAATGA TCTATAATCC 96 0 

TTTTTTTAGG ATCTGCAATT TCATTAAAAT 102 0 

AAATACCATT TACGACATTA CCATTATGAT 108 0 

CTTCATTAAT AATGGCATGA TTTGGAACTT 114 0 

1161 

i8 : 
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20 



25 



(A) LENGTH: 1504 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 498: 

AGCTCACGTC ATCTTCGGCG GCGCTAAATT AAAATAATCA ATTTCTGAGT TAAACTTTTA 6 0 

TTTACAACAT ACTATTACTA TACATTACAA ATTTTTAAAA TATACATTAC ACTCATTACT 120 

CAATGGmAAG CGTATGATTT CmCAGCCCCC CTAGCTTGTA GAAATCATAC TTTCCTTTTT 180 

75 TCAATATATA TACAACTATT AAATCCCATA AGATTGCAGA GCACATAAGT AAATTTTTTT 24 0 

AGAGCTTGAG GTrTGTTTAG CTTAAGCAAC CCATGAGCTC AAACACTTCC TGTTCACATA 3 00 

ACACTACAAA TCGCATTATG TTGCTTAATC TTATGTTTAT ATAAATTACA CACAATAAAT 360 

AGAAAGAATG TGAACATCAT GAATAAATTA TTGCTACTCG TTACATTTAT CATTCGTGTG 42 0 

GGTTCAGGTA TTGTTATGTT AATGCAAGGC TACGAAAAAT TAACGGGCGG ATTTACGCTG 4 80 

AAAGGTTTAG TACCAGTCAT CGCTAACAAT ACTGATTCAC CAGAGTGGTA TAAGTGGTTT 54 0 

TTCGCAAATA TAGTTGCACA TACGACGTCA TTATTTGATA TTGTTGTCCC ACTCGGAGAG 60 0 

ATTGCAATTG GATTAGGTTT AATTTTTGGA GTTTTTGCAT ATGCTGCTAG TTTCTTTGGA 660 

GCCTTTGTTA TGATAAATTA TATCTTAGCA GATATGATAT TTACGTATCC TCTTCAATTA 72 0 

ACTTTCTTTA TCCTTTTACT AATGAGTCAC TCATTGTTAA AACAGATTTC ACTTAAAGAA 780 

ATCATTAATT ACTTTAGAGG TCGTAAGAAC AGAGGTGAAA AAATAGATGA CCCACTTACT 84 0 

35 GATCGTGGAT GATGAACAAG ACATTGTAGA CATTTGTCAA ACCTATTTTG AATATGAAGG 9 00 

TTACAAAGTA ACAACGACAA CTAGCGGTAA AGAAGCAATT TCTTTACTAT CAAATGATAT 960 

TGATATCATG GTACTTGATA TCATGATGCC AGAAGTTAAT GGTTACGACA TTGTCAAAGA 1020 

AATGAAAAGG CAAAAATTAG ATATCCCCTT TATCTATTTA ACTGCCAAAA CACAAGAACA 10 80 

TGATACCATT TACGCCTTAA CTTTAGGTGC AGATGACTAT GTCAAAAAAC CATTTAGTCC 114 0 

AAGGGAACTC GTTTTACGTA TTAATAATTT ACTTACAAGA ATGAAGAAAT ACCATCATCA 12 0 0 

ACCAGTTGAA CAACTGTCGT TTGATGAATT AACACTTATT AACTTAAGTA AAGTtGTGaC 12 6 0 

tGTAAaTGGT CACGAaGTCC CTATGCGTAT TAAGGAATTT GAGTTATTGT GGTATTTAGC 1320 

TTCTaGAGAA AATGAAGTTA TTTCTAAATC AGAATTACTT GAAAAAGTTT GGGGATATGA 13 80 

CTATTACGAA GATGCTAATA CCGTGAATGT CCATATACAC CGTATTAGAG AhAAATTAGA 144 0 

AAAAGAGAGC TTTACAACAT ATACCATCAC AACTGTATGG GGATTAGGAT ATAAATnTGA 15 00 
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(2) INFORMATION FOR SEQ ID NO: 4 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1623 base pairs 
£ (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 99: 

ATTGAAAGCG ATAATTCGTA nTAATTGAGT TTGTTGAAAA ATTTAGGGTA ATGTAAAGAT 60 

ATAAAAGATA CATAGAyTGG AGAGATATAA AGATGTTGAA TGAGATACAA ATATTAaATA 12 0 

15 

aTGGATACCC GATGCCTTCA GTTGGGTTAG GTGTTTATAA AATCTCTGAC GAAGATATGA 180 

CTAAAGTTGT AAATGCTGCA ATTGACGCAG GCTATAGAGC GTTTGATACA GCATACTTTT 24 0 

ATGATAATGA GGCTTCACTA GGACGAGCAT TAAAGGATAA TGGCGTCGAT AGAGAAGATT 3 00 

TCTTTATAAC AACGAAGTTA. TGGAATGA.CT ATCAAGGTTA TGAGAAAACA TTCGAATATT 3 60 

TCAACAAATC GATTGAAAAT TTACAAACTG ATTATCTTGA TTTATTTCTA ATACATTGGC 4 20 

25 CTTGTGAAGC AGATGGTCTA TTTTTAGAAA CATATAAAGC TATGGAAGAA CTTTACGAGC 4 80 

AAGGTAAGGT AAAAGCAATA GGTGTATGTA ATTTTAATGT TCATCATCTA GAAAAATTAA 54 0 

TGGCTCAATC AAGTATCAAA CCAATGGTGA ATCAAATTGA GGTACATCCA TATTTTAACC 6 00 

AACAAGAATT ACAAGAATTT TGTGATCGTC ACGATATTAA AGTGACTGCA TGGATGCCTT 66 0 

TGATGAGAAA TAGAGGACTA CTAGACGACC CTGTCATTGT TAAAATTGCT GAAAAATATC 72 0 

ATAAAACACC AGCACAAGTT GTATTACGTT GGCATTTAGC ACACAATAGA ATTATTATTC 78 0 

35 

CAAAATCTCA GACACCTAAA CGCATTCAAG AAAATATAGA TATTTTAGAT TTTAATTTAG 84 0 

AATTAACAGA AGTAGCTGAA ATTGATGCTT TAAATAGAAA TGCAAGACAA GGTAAAAATC 900 

CAGATGATGT GAAAATTGGG GATTTAAAAT AACTGGATGT TAAATTTTAC GTTTATGAAT 96 0 

40 

GCCTTTTAAT GTGTACATTA AAATAAATGA GTTGGTTTTT ACTATTTGAT AAAACAATAC 102 0 

TCAGGTACAT TCAAAATCTT TTAAATAAAA AGGATGGACA TAGATGAAAA TTAGAGTCGT 108 0 

CATTCCTTGT TTTAATGAAG GGGAAGTCAT TACACAAACA CATCAACAAT TAACTGAAAT 114C 

ACTTTCACAA GATAGTAGTG TGAAAGGCTA TGATTATAAT ATGCTTTTCA TAGATGATGG 12 00 

TAGTACGGAT ACCACTaTAG ATGAAATGCA ACATCTTGCC ACAATAGATA GGCATGTCAG 1260 

50 

CTTTATTTCT TTTAGTAGAA ATTTTGGAAA AGAAGCAGCT ATGATTGCAG GTTACCAGCA 132 0 

TAGTACTGAA TTTGATGCAG TCATCATGAT AGATTGTGAT TTGCAACATC CACCTGAATA 13 80 
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1623 



TAGAAGTGGT GAAAATTTTA GTCGCAAAAC ATTAAGCCAT TTGTATTATA AGTTAGTTAA 150 0 

TTGCTTTGTA GAAGAAGTAC AATTTGATGA TGGTGTTGGT GATTTTAGAC TTTTAAGCCA 156 0 

AAGAGCTGTT AAATCCATTG CATCACTTGA AGAATATAAT CGnTTTTCAA AAnGGnTATT 16 2 0 

TGA 

(2) INFORMATION FOR SEQ ID NO: 500: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 500: 

20 AAAGTnGGTG AAnCTATATA CTTAATCTAT ATTTATATAT TAACCATTAG GGTTAAAAAT 6 0 

TACTCTAGCA TTTATGAATA GATGGGAGTT TATTTTATTA TTATATAGGA GAGATGTTGA 12 0 

ATGACACATC GCGCACTATT AGTTGTTGAC TATTCATATG ACTTTATCGC AGACGACGGC 180 

TTACTAACAT GCGGTAAACC TGGACAAAAT ATTGAAGATT TTATTGTTTC TCGTATCAAT 24 0 

GACTTTAATT ATTATCAAGA CCATATATTC TTTTTrrTGG ATTTACATTA TTTACATGAC 3 00 

ATTCATCATC CTGAAAGTAA ATTATTCCCA CCACACAATA TCGTAGATAC AAGTGGTAGA 3 60 

GAATTATACG GTAAAGTAGG TAAATTATAC GAAACAATTA AAGCGCAACC TAATGTACAT 42 0 

TTCATTGATA AAACGCGCTA TGATTCGTTC TTTGGTACCC CGCTTGATAG TTTATTGAGa 480 

GAAAGAAGTA TTAATCAAGT CGAAATCGTT GGTGTATGTA CCGATATTTG CGTGTTACAT 54 0 

ACAGCAATTT CTGCATACAA CTTAGGtTAT AAAATTTCAG TACCTGCTGA GGGAGTGGCT 6 00 

CATTT 6 0 5 
40 (2) INFORMATION FOR SEQ ID NO : 501: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 173 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 501: 
TAGGTTnAAA GCATAGnTTT nTCAAAAAGA CAAATCATTC ATATATTGGA GGATATTTTG 6 0 

GTGTAAGATA TAGTGCAACC ACAATTGCTA AAGACTTGAA GGAACTAAAT ATATATCGTG 12 0 
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TGAGAGAAAA GTTTAGACAC TATTGTGAAC ATGAAGTTCT AAGTTCAATC ATCAATGGTT 24 0 

CATACATTAT CGTCAAAACC TCACCTGGTT TCGCCCAAGG CATAAACTAT TTTATCGATC 3 00 

AGCTAAATAT AGAAGAGATA TTAGGTACGG TGAGTGGAAA TGACACTACA TTAATCTTAA 36 0 

CTGCCTCAAA TGATATGGCA GAATACGTAT ATGCAAAATT ATTTAAATAG ACATGTATCA 420 

AATGAATAAT AAAAATTTGT TTCGTATCAC GTGTACTCAA GTTAGTTACC AAATATTAAC 4 90 

TTGTGTACGC GTmTTTAT GGAAAGAAAG AATTCATAGT CATTCAATTG ACTGTATAAA 54 0 

AAACTTTATA CAACATGTTT TTATGGGTAT TTTTGAATAA AAAATGTATA TTTTGACCCA €00 

AAATACCTTT ATTTATGTAT AAAAATCCAT TATTATGTAT TGTATAACAA AAAGATATGA 66 0 

AATTTTCGAC TTTCTTTATG TGAATATAAT CACATGTAAG CGTTTGAAGA TTGTCTATAC 72 0 

TCTAAATGAA TTCAAAGATA AAAGGAGGAA ATAGACATGA CAGATGGTCC AATTAAAGTA 78 0 

20 AATAGCGAAA TTGGAGCPTT AAAAACTGTG TTACTTAAGC GTCCTGGaAA AGAATTAGAA 84 0 

AATTTAGTAC CTGATTATTT AGATGGATTA CTATTTGATG ATATTCCATA TTTAGAAGTA SOO 

GCTCAAAAAG AGCATGACCA TTTTGCGCAG GTGCTAAGAG AAGAGGGTGT TGAAGTACTT 960 

TACCTTGAGA AGTTAGCAGC TGAAAGTATT GAAAATCCTC AAGTAAGAAG TGAATTTATT 102 0 

GATGATGTAT TAGCAGAGTC TAAAAAAACA ATATTAGGTC ATGAAGAAGA AATTAAGGCA 108 0 

TTATTTGCGA CACTTTCTAA TCAAGAACTT GTAGATAAAA TAATGTCAGG GGTACGTAAG 114 0 

GAAGAAATTA ATCCGAAATG TACACATCTA GTAGAGTATA TGGATGATAA GTATCCATTC 1200 

TATTTAGATC CAAlXiCCAAA CCTTTATTTT ACTAGAGATC CACAAGCCTC AATAGGACAC 126 0 

GGTATAACAA TCAATCGGAT GTTCTGGAGA GCACXSACGAC GAGAATCAAT ATTTATTCAA 1320 

TATATTGTAA AGCATCATCC TAGATTTAAA GATGCGAATA TTCCAATCTG GTTAGATCGA 13 80 

GATTGCCCAT TCAATATTGA AGGCGGCGAT GAACTTGTTT TATCTAAAGA TGTCTTGGCT 144 0 

ATAGGCGTTT CAGAACGTAC ATCTGCACAA GCTATTGAAA AGTTAGCGCG ACGTATTTTT 1500 

GAAAATCCGC AGGCGACGTT TAAAAAAGTA GTAGCAATTG AAATTCCAAC TAGTCGAACT 1560 

TTTATGCACT TAGATACAGT ATTTACAATG ATAGATTATG ACAAATTTAC AATGCATTCA 16 2 0 

GCCATTTTAA AGGCAGAAGG CaATATGAAT ATATTTATTA TTGAATATGA TGACGTAAAT 16 8 0 

AAAGATATTG CCATCAAACA ATCTAGTCnT TTAAAAGATA CmAGAAGA CGTACTAGG 1739 
12) ILMFORMATION FOR SEQ ID NO: 502: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1745 base pairs 

(B) TYPE: nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 502: 
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CTGTACATAC 


AGCAATATCG 


TTAACAAATG AAAACAGTAT 


TTTAGGATTG 


TAAACCATGA 


60 


TAAACCTAAA ATACTGTTAT 


TTTTATTACT 


TAAATTTCTT 


CTTCAATGCC 


TTTTCAACAT 


120 


AAGGTGGAAC 


GAATTCAGAA 


ATATCTGCTC 


GATAAGCTGC 


AACTTCTTTA 


ACAATACTTG 


180 


AACTTATAAA 


TGAATAATTA 


GTACTAGACA 


TCATATATAA 


CGTTTCAATT 


TCATTGTTCA 


240 


ACTTTTTATT 


CATTGAAGTT 


AAGCGTAATT 


CATATTCAAA 


ATCACTGACT 


GCTCTTAAAC 


300 


CACGTATGAT 


TGTTTTAGCT 


CCTACTTGTT 


CACAATAATC 


GACTAGTAAA 


CCACTAAATT 


360 


GATGAACCTT 


GACATTAGGT 


AAATGTTTAA 


CAGATTGTTC 


AATTAAATCC 


ATACGCTCTT 


420 


CTAAACTAAA 


CGTACCTTCT 


TTTTTACTAT 


TTTTAAGAAC 


ACAGACATGA 


ATTTCATCAA 


480 


ATCTATCTGT 


ACTTCTCTCA 


ATAATGTCTA 


AATGACCATA 


AGTAATGGGG 


TCAAAACTAC 


540 


CCGGAATGAC 


CGCTATTGTA 


TGTTCCATGC 


TATTCTCCCT 


TTTCTAATAA 


CAATGTGTCT 


600 


GTCAACCCAT 


AATGGTAACG 


TTTAATCATA 


TTAAACGGTT 


GATAATCtAT 


TTCTTCATGA 


660 


TTGCTAAATT 


CACAAACGAT 


GATACCATTT 


TCTTTCAATA 


AATTAAACTC 


TGAAATTAGT 


720 


TTTAAAGCTT 


TATCAATGAG 


ACCTTTATTA 


TAAGGTGGAT 


CTAaGAAAAT 


GACATCAAAT 


780 


TGAATATCAC 


GTTTTGACAA 


TGCTTTTAAA 


GCTCTATCTG 


CATTATTTTT 


ATAAACTTCA 


840 


GATTGTGCCT 


CTAAATCCAA 


ATTCGCAAGA 


TTTGaTTTAA 


TAACTTTTAC 


AGCTTTAAAA 


900 


TTTTGaTCAA 


CAAAGATTAC 


CTTATCCATA 


CCTCGAGAGA 


GTGcTTCTAT 


TCCAAGCGCC 


960 


CCGCTTCCTG 


CAAATAAATC 


TAAACCTATA 


CCTGACACAT 


CATATAAACT 


ATTAAAGATA 


1020 


CCTTCTTTAA 


CTTTATCCAT 


AGTTGGTCTC 


GTATTACGGC 


CTTCCATACT 


TTCTAAAGCT 


1080 


TTACTTTTAT 


GTTTACCTGC 


AATGACGCGC 


ATGTTGTTCA 


CACTTCCAAT 


TCATTTAGTT 


1140 


ATTTAATATA 


ATTTATTGAG 


AAAAAGGAGA 


ATGATAAACC 


AATGAAACAA 


ACATTTATTA 


1200 


CACTTGGTGA 


AGGTCTAACA 


GATTTGTTCG 


AATTCATGAC 


GATGATTGAA 


TATAACCATC 


1260 


AACGTATTGA 


TAAAATTATC 


TATTTTCATT 


CACCACAAGC 


TGAAAATAAA 


AAGTCATCTG 


1320 


TAGCAATCAT 


TATGAACCCT 


ACAACTGGCA 


ATCATTTCCA 


AGCATTTTAT 


ATCATGATAA 


1380 


ACGCTATTAA 


ATATCCATAT 


CCAGATTCAA 


ATAAAAAGTT 


TCAAATGATA 


AATGATTGTG 


1440 


CTGAAAAATT 


CGACATACCA 


ATTTTAGGTA 


TCGATGTACA 


GCCCCCTCAA 


GCAT'ITCATG 


1500 


ATTTA7CGTT 


ATATTATAAT 


TATTTAATTA 


GTGTGTTAAG 


GCTCCAAAAA 


TGGATACCAG 


1560 


aACTTCAATA 


ATATTAATTA 


TATATTTCGT 


GTTTCTCTTT 


TTCGTAAGTT 


TTCTTTAAGT 


1620 
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TTTGATCAAC ATCTTGCTCA TTCACATACA TAATTACAAA TTTACGATCT CTATTTGAAT 174 0 

GAACG 174 5 

(2) INFORMATION FOR SEQ ID NO: 503: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1035 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



15 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 503: 

TCGTCTTTAA TCTTGCTGAC TTTAGAAGGC TTACGAGTGC GGTTACCATT TTTTGCATCT 6 0 

TTAACTGATT GAACTAAAGC TtGACGTGTA GATTTATCAG CTAAACTAAT TGCACCACCA 120 

20 ATTACGGCAC CAATTAAAAT ACCAGGAACA AATTTATTTT CCATAAAAAA CTACCCCTCT 180 

TTCAAATTTG CATCTTTTAC GATGTAGTCT ATTAAATTAT CACAAGATGA TAATACCATG 24 0 

TCGTATACAC CTTCAAAATT ATTCGTGTAG TATGGATCTG GTACATCACT CTCTTCCATA 3 00 

25 

TTACTAAATT CTAACAGTTT GAACAATTGT CCCTTAAGAT TAGGATTGAT AGATTTAATA 360 

TTATCAACGT TACTTTGATC CATAGCCACA ATGTAATCAA AATCATCTGT CGCTTCGAAT 42 0 

AATTCACTAA TCATGCCATC AAATGGAATA TTGTGTTTGT TGAGAATTTT TTGTGTACCT 4 80 

TCATGAGGTG GCTCTCCTAA ATTCCAGCTA CCAGTACCTC TTGAATGTAC TTTAATATCA 54 0 

TGAATATTTC TGTCTTTAAG TCTTTGTCGC ATGATTGCTT CTGCCATTGG AGAACGACAT GOO 

ATATTGCCAA GACAGACAAA TGCTACATCT ACCATTTTGA TTCCTCCAAA CTATGTAGTT 660 

ATATCCCCAT TTTATAGCGA CTTTAAACAA TAAGAAAGCA GATTATATAA AATTCTATTA 720 

AAGTTTATTA AATTGTGATA CTTTGATAAC ATAACTATTA TTAGAGGTGA ACATTGTGGC 780 

40 TATGACAAAT GAAGAGAAAG TnTTAGCTAT TAGAGAGAAG TTAAATATtG TTAATCAAGG 84 0 

ATTATTAGAT CCTGAAAAAT ATAAAAATGC AAATGaAGAA GAATTAACAG ATATATATGA 900 

TTTTGkTCaA yCAAGAGaAA GATTGTCGCC AAGTGaAGTG mCAGCTATTG CTGaCGCTTT 96 0 

■^^^ AGGACAATTG CGACACGAAT AGGAGTGGGA ATTTTGACTA ATTACAAAGA AAAGTTACAA 102 0 

CAATACGCTG AACTA 103 5 

(2) INFORMATION FOR SEQ ID NO: 504: 
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(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1284 base pairs 

(B) TYPE: nucleic acid 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 504: 

AnCCTGACAA GATTCTTATA TTCATAGATA TGAGAGCTAA AAATGAAATC AACAAACAnA 60 

CAATAAAGTA AACGATAATA GCCCATATAC CATTTTGTAA CCATATTACA AATTGTGTTG 120 

TATTATAGCC ATTTCCAGCT AATAATTGCT GGATAAATGC ATTATTGTTT AATGTATTTT 180 

CAAGATTAGC AAtCGATGTG TTATTACTGA ATGAAACAAG TGCTATAAAC ATCGTAATGA 240 

CAGTAAGCAC TAATAACATC ACCCAACATA ACCAACCTAG AACTTTTTCA GTTAATCTAC 3 00 

TTACTGGACG TTTAATTTGA GTAAATTGTT CTCCAGTCAT TCGTTACAAC TCCTTATAGT 360 

ACTTATCCCG TTATTATAAC TAAATATACA GTAAATAACT ACTATTTATG ATTTTATTTT 4 20 

AATGACATTT TGAAATTCAA AAAGTTTTCA TTGTATTCAC TTAAAACTTC AGGTCCTAAA 4 80 

20 TCTTTATAAA CTTCAAGGCG TTCTTGCTCT TTCTTAGTCG GATAAAAACG ATGGTCGTCT 54 0 

TTAATCTCTT TAGGCAACAA TTGTCGAGCA GCCTTGTTTG GCGTTGCATA GCCTACGAAT 6 00 

TCTGTATTTT GCTTGTTATT TTTAGCATCT AATAAAAAAT TCATAAATTT ATATGCACCC 660 

TCTTTATTTT GTGCCGTTTT TGGAATTACC ATATTGTCGA ACCATAAATT CGATCCTTCT 720 

TTAGGAATAA CATAATTATA TTTATCCCCT TCTTGCACTA GAGGTGCTGC AACACCACTC 780 

CAAACAACCG CTATGTTACC TTCATTTTGT TGAAGCATCA TGGTAATTTC ATCACCTACG 84 0 

ACACCTCTTA CTTGTGGTGC TAGTTTGGTT AAATCTCGCT CTGCTTCTTT TAAATGGTGC 900 

GAATTACGGT CATTAAGATT ATACCCAAGT TTATTCAAAC TCATGCCTAT AATCTCTCTA 960 

GCACCGTCAA CTAGTAAAAT TTGGTTTTTA AATTTAGGAT TATACAATGA CTTCCAACTA 1020 

TCAAATGATT CATTTGGATA CTTTTCTTTA TTATATAAAA TACCTACAGT TCCAAAGAAA 10 80 

TAAGGTAAAG AATATTTATT GCCTCTATCA AATGACATAT TCATATAATC TGAATCTAAA 114 0 

TTTTTAATAT TAGGTACCTT ATTATGATCT ATTGGTAACA ATAAATGATC TCTTTTCAAT 12 00 

TTTTGAACTG nATATTCACT AGGAnAAGCA ACATCATAAT GTGTACCGCC AGTGCGAATT 126 0 

TTGGnGTCCA nCGCTTCATT TGAA 12 84 
(2) INFORMATION FOR SEQ ID NO: 505: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5763 base pairs 

(B) TYPE: nucleic acid 

50 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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ATAATTAACA TCTTTTGCTA TATACCACCA GTTTGATACA TAAAATATCG CAGCAATAAT 6 0 

ATCATGTTTA ACCCTAATGA TATTATCTGA TTTTAATAAT AAGGTTGCTG TCCCTACAAC 120 

5 

CATTAATAAA ACTATGACTG CTGGTAATAA ACGTTTTAAA CGACGTATCC AAAAGCTTTT 180 

CAATTTAATG ATACCTGTGT CATCATACTC TTTGAGTAAT AAGCTTGTAA TTAAATAACC 24 0 

AGAGATCACA AAAAATGTAT CCACACCTAA AAAGCCACCT GtCAACCATT GCTTATTTAA 3 00 

JO 

GTGGTAAATA ATAATTCCTA GAACAGCGaT TGCCCTCAAA CCATCGAGCC CTGGAATATA 3 60 

TCTCATTTTC TTATACTTTG TAAAACCCTT TGTTTTGTTC ATTTTTGCAT TCTTCCCTTT 42 0 

;5 TAAAACTGTT CTTCTTAGAT GCTTAATTAA ATTTAGTTAT GCTGTTTAAA AGAATATTGA 4 80 

AATGCATATG TATATTATTG AATTACGACA TCATCAAAAT CATATTGACT AAAATACTGT 54 0 

TAAATTAAAA AAATTACCAA TGATGATTCT TACTTCGAAA TCCAATTTGT AATGCAACTC 6 00 

2^ GGCAACTTAA AAACTATGAA GTATTATGTA TTGTAATATA ACTGTAATAT AAATTCAATT 660 

TATTATAAAA ATTTTCAAGA AAATATTCAA CTAGAAAATG AATTGTGCAC TCTTGGAAGT 720 

GCAAGTCACT GTCTTAATTC ATATTTTTTG AAACAAGTTA GATATAAATT TTCAAAATAA 7 80 

AATCAGAAAC TAGAACATAA ATAAGGCTCC CTTCAAAATT TTCATTTTTC AATGTCTACT 840 

TTGAAGGGAG CTTATTCACA ATGAATTATA CTCTACAATG TTATATTGAC TGCGGGCCCA 900 

AACACAGAGA ATTTCGAAAA GAAATTCTAC AGGCAATGCA AGTTTATGTT AGCTCACACC 960 

30 

AAGTGCAATC TTAGCGTAAC GTGACATCAT ATCTTTTGTC CAAGGTGGAC TCCATACGAT 1020 

ATTCACTTCA GTATCCTGAA TTTCAGGAAT CTCTGCTAAT ACTGTTTTAA CTTGaTCAAT 1080 

AATTTGAGGT CCCATTGGAC ATCCCATTGA TGTTAAAGTC ATATCAACTG TACATACGCC 1140 

TTCATCATCA ACATTCACTT TGTATACTAA ACCCAAATTA ACGATATCAA TTCCTAATTC 12 00 

AGGGTCAATT ACCATTTCTA ATGCACCTAA GATACTATCT TTCAATGCCT CTTCCATCCA 1260 

40 TATCACCTCT TTAATGTCAT ATTATTCATA ATATATCAAA TATCCGACAA AACGCCAATA 1320 

AAATGCTATG ATGTATCTAT ATGAACTAAG CAACTTATGA GGAGAGAGAT ATGCAACCAC 13 80 

ATTTAATATG TCTAGACTTA GACGGAACAT TATTAAACGA TAACAAAGAA ATTTCATCAT 144 0 

ATACTAAACA AGTATTAAAT GAATTACAAC AACGTGGaCA CCAAATTATG ATTGCGACTG 150 0 

GCAGACCTTA TCGTGCAAGT CAAATGTATT ATCATGAATT AAATTTAACG ACACCAATTG 1560 

TTAATTTTAA TGGCGCTTAC GTACATCACC CTAAAGATAA AAACTTCAAA ACTTGCCATG 1620 

50 

AAATTTTAGA TTTAGGCATC GCACAAAACA TTATTCAAGG ATTACAACAA TATCAAGTAT 16 80 

. aAA^AT^ATr TTTTCATTAA CAATCATGAT CCAAGATTAT 1740 
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AAGAATCCCC TACCTCAATT TTAATTGAAG CCGAAGAAAG TAAAATACCT GAAATCAAAA 1860 

ATATGCTTAC TCATTTTTAT GCCGATCATA TTGAGCATCG ACGCTGGGGC GCACCATTCC 1920 

5 

CTGTCATTGA AATTGTAAAA CTTGGTATTA ATAAAGCAAG AGGCATTGAG CAAGTTAGAC 1980 

AATTTTTAAA TATTGACCGA AATAATATTA TTGCATTCGG TGATGAAGAT AATGATATTG 204 0 

AAATGATTGA GTACGCGCGT CACGGTGTTG CTATGGAAAA TGGTTTGCAA GAACTTAAAG 2100 

10 

ATGTAGCGAA CAATATTACA TTCAACAATA ATGAAGATGG CATTGGTCGA TATTTGAATG 216 0 

ATTTCTTTAA TTTAAATATT AGATATTACT GTTAATTTAT AACTAATCAT TTTATAATAT 2220 

15 TTTAAAACAA TAGGAGGTAA GTTACGATGC CCAAAATAGT CGTAGTCGGA GCAGTCGCTG 2280 

GCGGTGCAAC ATGTGCCAGC CAAATTCGAC GTTTAGATAA AGAAAGTGAC ATTATTATTT 234 0 

TTGAAAAAGA TCGTGATATG AGCTTTGCTA ATTGTGCATT GCCTTATGTC ATTGGCGAaG 24 00 

20 TTGTTGAAGA TAGAAGATAT GCTTTAGCGT ATACACCTGA AAAATTTTAT GATAGAAAGC 24 60 

AAATTACAGT AAAAACTTAT CATGAAGTTA TTGCAATCAA TGATGAAAGA CAAACTGTAT 2 520 

CTGTATTAAA TAGAAAGACA AACGAACAAT TTGAAGAATC TTACGATAAA CTCATTTTAA 2 5 80 

25 

GCCCTGGTGC AAGTGCAAAT AGCCTTGGCT TTGAAAGTGA TATTACATTT ACACTTAGAA 264 0 

ATTTAGAAGA CACTGATGCT ATCGATCAAT TCATCAAAGC AAATCAAGTT GATAAAGTAT 2700 

TGGTTGTAGG TGCAGGTTAT GTTTCATTAG AAGTTCTTGA AAATCTTTAT GAACGTGGTT 27 6 0 

30 

TACACCCTAC TTTAATTCAT CGATCTGATA AGATAAATAA ATTAATGGAT GCCGACATGA 2 8 20 

ATCAACCTAT ACTTGATGAA TTAGATAAGC GGGAGATTCC ATACCGTTTA AATGAGGAAA 2 8 80 

55 TTAATGCTAT CAATGGAAAT GAAATTACAT TTAAATCAGG AAAAGTTGAA CATTACGATA 2 94 0 

TGATTATTGA AGGTGTCGGT ACTCACCCCA ATTCAAAATT TATCGAAAGT TCAAATATCA 3 00 0 

AACTTGATCG AAAAGGTTTC ATACCGGTAA ACGATAAATT TGAAACAAAT GTTCCAAACA 3 060 

40 TTTATGCAAT AGGCGATATT GCAACATCAC ATTATCGACA TGTCGATCTA CCGGCTAGTG 3120 

TTCCTTTAGC TTGGGGCGCT CACCGTGCAG CAAGTATTGT TGCCGAACAA ATTGCTGGAA 3180 

ATGACACTAT TGAATTCAAA GGCTTCTTAG GCAACAATAT TGTGAAGTTC TTTGATTATA 324 0 

^5 CATTTGCGAG TGTCGGCGTT AAACCAAACG AACTAAAGCA ATTTGACTAT AAAATGGTAG 33 00 

AAGTCACTCA AGGTGCACAC GCGAATTATT ACCCAGGAAA TTCCCCTTTA CACTTAAGAG 33 60 

TATATTATGA CACTTCAAAC CGTCAGATTT TAAGAGCAGC TGCAGTAGGA AAAGAAGGTG 34 2 0 

50 

CAGATAAACG TATTGATGTA CTATCGATGG CAATGATGAA CCAGCTAACT GTAGATGAGT 34 30 

TAACTGAGTT TGAAGTGGCT TATGCACCAC CATATAGCCA CCCTAAAGAT TTAATCAATA 3 54 0 

55 
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GTTAGAATTA TGTTGGACTG GTACTACTAT CCAGTCCATT TTTTATGTTT AACATTTTTA 36 60 

GAATCAAAAA AGACATAAGG TCTTGGACTA ATAATTGTCC ATGCCTTATG TCATATACTA 3720 

5 

TATGTCTTAT CAATTAGCCA ATACCGAATA ATTTTGATAT AGGsCCTAAC GGTAGAATGA 378 0 

CACCTAATAC CATTGTGATG ATAATTAATG CAATTGTTAT CCAAAACATT GTGTGACTTT 384 0 

GTTCATGTCT CTTTCTTTTA GCAATCGACA CTTCCATCAA TCCAACTACT GCAACACCAC 3900 

JO 

ACAGCATTTT CAATGTAAGC AACATATGAT TTGCCCCX3CC ATTCATAAAT GACTGAATTA 3960 

ATATCCAAAA TCCTGAAATT AACGTCAACA GCATAAATAA GCGTAAAATC ATGTGCAACG 4020 

^5 GTTTGAAAAA TGGTGATCTG CCTTGATTTT TTGAAATGTT TAAGTATGTA GCGATAAATA 4 080 

AAATAATCGC TAATACCCAA CTTAATATAT GTAAATGTAA CATACTGATT CCCCCCACTT 414 0 

TAATTATTTA TATTATTAAA TTAAAGCTTC TTGGGATTAA TACCCACTTG CTTGTAATTT 4200 

-'0 AATCATGATT TGATTATACA CGAATATATA TTCTACCACA CTTCTATATT TGAGAGGAAG 42 6 0 

AACATGACAT TTTATTCCTT ATTAGAATAT TGTGAATCTG CTGTAAAATA ATCAACTACT 4320 

TCTTAATATC AATATTTCAC TTTCATCTCA AAATGGTAAC ATTATAAATA ATTTATCTTT 43 80 

25 

AACACCTTTT TAGAAAAGGA AGAAAAAACT AACCAATCTA TATAAAGACT GGTTAGCTTT 44 4 0 

TTAAATGATA ATTATTTAGC GATATAAGTT GTCAGCGTTC CAATATTATC AATAGTCACT 4 500 

TTAACTTCAT CACCTGGTTG TAAAAATTTA GGTGGTTGCA TACCTGCACC AACGCCTGCT 4 560 

30 

GGTGTACCAG TTGCAATAAT ATCTCCCGGA TGTAGTGCAA CATATTTTGA AATTTCTTCT 4 620 

ATTAATTCAT CAATTTTAAG AATCATTTCG CCAGTGTTAC CATCTTGTCT AATTTCATTG 46 8 0 

TTAACTTTTG TAACAATATT TACATTTTCA GGTAATGGTA GTTCGTCTTT AGTAACGATA 4740 

TAAGGACCCA TTGGGCAACC GCCAGTTAAA CTTTTTGATA AAAATGCTTG ATCTTGTTCA 4300 

CTTTGTGCTT TGCGATCAGT GATATCGTTA ATAATTGTAT AGCCGTAAAC ATAATCTAAA 4 860 

40 GCTAATGCTT TTGGAATCTT TTCACCAGAC TTACCAATAA CAATACCTAA TTCACCTTCA 4 920 

TAATCTAATT GATCAGTAAT ATCTTTATGA TTTGGAATTG TTGCATTATC TCCTGTTAAA 4 980 

GATGACGCTG CTTTTGTAAA TACATATAAT TTTTCCACTT CATGATTTAA TTCGTTCGCA 504 0 

TGATCTTTGT AATTTCTACC AAAAGCAATC ACATTATTCG GAGGTGTTAC TGGTGGTAAA 5100 

AATTCAATGT CATTAAATGA AATTTTATAG TCTTCAGCTT TGCCGCTATC TTCTGCTGCT 516 0 

ACAACTGCTT TACGTACTTG TTCTTGAAAA TCTAAAGTAT GATTTTGTTG TAAACCAGCT 522 0 

50 

AACAATGTTT TAGGATGGAA ATCTCCTTCT GCAAAGTCAG CAAATACTTG TGTTAAATCC S2 8 0 

r^ATAr-^^^CA-^ f^TTCGCGTTT TACTTTAArO CCATATGAAG TTTTGTCATT ATACTTGAAT 5 34 0 
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TTCGTTATCA AATAACAAAT AAATAAGTAA GACAATTTTG AAAATGAGTT GTGTTCATTC 54 6 0 

TGCTACAAGG ACTTTGCACT TAATCGAAAT TATTTTTTAT TCTTTTGAAA ATCAAAATAC 552 0 

TATAGTTGCA ATGTACCAAA ITTGAAGAAG TATAAATAAC CTTTAACTTC TTTATTAAGA 558 0 

ATCGTTTGAA GCGTATTTTG ATAATATTTC ATCTGTATCT TATATTTATT TTTTAATTGT 564 0 

GTACCAATTT CTTCATCTGT CATCCCACGG CGACGATTAA ATGCATCGGT TTTATAGTCT 570 0 

ACAAAATAAT GCACACCATC TTTAACAAAG ATTAAGTCAA TCATACCTTG AATAATTGAG 57 6 0 

ACG 5763 
js (2) INFORMATION FOR SEQ ID NO: 506: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 506: 

25 

CCATATGATT TTGTGCAATA ACTCTTTTTC TTTCTTCTTT TCGTAAAAAG TTGTACATCG 6 0 

CTTTTGTGTT TAAGAGACTA TTGTTTTCTT TAGGTTTTTG AACTTCACTC AGTGTATTTC 12 0 

TAGTTGTTAA CACTAAAATT CCAACTGTTG TATCTTTGTA TCTAGCCATA ACTTTATTCA 160 

30 

GATGTTTGTC ATTTGTAATA ACTACGACAT AATTAAACAC TTCATAATAA TCATTAATTT 24 0 

GATTATCTAA TCTATCCAGC TTATCTAATT CTGTTTTAAT CTCATAGACA ACGCCTTTGC 3 00 

CGTTTAACAA TATAAAATCA GCaATACTTT TCCCTATGGG CATCTCAGAA AGTGCAGTAG 360 

TTGTATTAAT AGAATGTCGT CCTAGAAGGA GTkTATTAAG TATnGTGTTT TTGTAAAAAT 42 0 

AT 422 
40 (2) INFORMATION FOR SEQ ID NO: 507: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1188 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 507: 

50 

GCTTATGTAT TCTCAAAATA TTTATGTGAT ACGCAAAGGA GACATGGTTA TTCGACCAGC 6 0 

ATTTGATGAT GACGATCAAA GAAACGGTAG TGAAATAATT CGGTTTGACA AAACGCGTAT 120 
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TTATCTTGGC AAGAAAGCAG AGACAAACCG CATTACTGGC ATTTCTAGTA AACCACCTAT 24 0 

TTTACTAACA CCATTATTTC CAACTTATTT TTTCCCAACA CATTCTGACA GACAAAATGA 3 00 

AAATATTTGG TTAAATATGC ATTATATCGA AAGTATTAAA GAATTAAAAA ATCGTAAATG 3 60 

TAAAGTGACA TTTATTAATA ATGAATCAAT CATTCTTCAT GTTTCATACC ACAGTTTATG 4 20 

GCATCAATAT AACAATTCCA TTTTTTACTA TTACATGGTA GATAAACAAT CTCGCATGAT 4 80 

ATCAAAAAAT CCCGACCAAC CAATAGATTA TAATAAAGCC ACATTGAATG TGTTTGAAGC 54 0 

ATTGACACGC TATTCTTTAT TTGAAGATAA ATAAATTGTT TATTTTTAAA ATATGCGGAA 6 00 

TGTTTTATAA ATATAGTGTA AATGTTCTGC ATATTTTTTT AAGGTATCTA TTGCAAATTA 6 60 

ACTTAATCTT GTTATAATAA TATTTGTGCT TGaTATTCAA ACACATACAA ATTAATCCAC 72 0 

AGTAGCTCAG TGGTAGAGCT ATCGGCTGTT AACCGATCGG TCGTAGGTTC GAGTCCTACC 7 80 

20 TGTGGAGCCA TTGGAAACGT ACTCAAGTTG GCTGAAGAGG CGCCCCTGCT AAGGtGTAGG 84 0 

TCGCGAAAgG CGCGAGGGTT CGAATCCCTC CGTTTCCGTT ACTTGCTAAA ATGGTATATA 900 

CCATTTTAnC TTTTTTGTTT ACTTATATAT AATGAATGAG AATTTCACTG TTCTTTTATA 960 

TCAATTTTAA AATTCTAAAA ACCTTTCCTA GATAATCTTC TCTAAGAAAG GTTTTTATAC 102 0 

TTGTTGAACT TATAATTAAT TTATTACATA GCAATATTTA CCTGmTTA ACTATAAAAT 1080 

TATCACTACA TGAAATACGA TAATTCGGaT CTCTTAACTT CTCTGCaATT AATGtACTCA 114 0 

TTGgTTTCAT CGTATGATTC ATGTATAATA GCATTTkTTA AATAATTC 118 8 
(2) INFORMATION FOR SEQ ID NO: 508: 



25 



30 



35 



40 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 508: 

CCCAACTTCG GTTATAAGAT CCCTCAAAGA TGATGAGGTT AATAGOTTCG AGGTGGAAGC 6 0 

ATGGTGACAT GTGGAGCTGA CGAATACTAA TCGATCGAAG ACTTAATCAA AATAAATGTT 12 0 

TTGCGAACAA AAtCACTTTT ACTTACTATC TAGTTTTGAA TGTATAAATT ACATTCATAT 180 

GTCTGGTGAC TATAGCAAGG AGGTCACACC TGTTCCCATG CCGAACACAG AAGTTAAGCT 24 0 

CCTTAGCGTC GATGGTAGTC GAACTTACGT TCCGCTAGAG TAGAACGTTG CCAGGCATAA 3 00 

-^T^-^-^AAT^-^A CAGTAGCTCA GTGGTAGAGC TATCGGCTGT TAACCGATCG GTCGTAGGTT 3 60 
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GGCGGTAACA 


CGGGTTCGAG TCCCGTAGgA GTCATACAAG CAGAAGTGAA 


ATATCGCTTC 


480 


TGTTTTTTTA 


TTACATATTT ATTGTTGAGG AAGGTTGTCC GAGCTGGCCG 


AAGaGCACGC 


540 


CTGGaAAGTG 


TGTAGGCGTC ACAAGCGTCT CAAGGGTTCG AATCCCTTAC 


CTTCCGTAAA 


600 


GGcGCTTAAA 


TTGGTTTTAC CCATTTTAAG CGCTATTTTT ATTTTGGACT 


CAATCCCTTG 


660 


ATATATCTGC 


ATTTGAGCTA TTATCCTCAT TTTTACACTT CTTATTTATT 


TATATCCATT 


720 


TAAAATTTTT 


TAGCCACAAT GTGACTAATT TTTGaTGAAT AATCCTAATT 


TTAGtCACAA 


780 


GATTTTGAAG 


TTTAGTCACA AAACAAATCA TTCAGATTTT TTTCyATAAA 


TTTAGTTTCA 


840 



(2) INFORMATION FOR SEQ ID NO: 509: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1373 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
20 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: £ 

25 

TTCTATGAAA CTATCAGCTG TTTCTTTTTC 
CATGTATTCT CTTTCAaCAC CCGATTGGTC 
TGAAATAACT GCATTATAAT TTGCAATTGC 

30 

ATTCTCTTTT GCAATGGCGC TTAGCGTTGG 
GATTAAGTAA CGTTCTTTCA TTATATTTCT 
TACATCATTA AACTTGCTGC AACCGAAGCA 

35 

ATCTTAATAT AAAAATCGCA TTTATCACTT 
CCAATTACAA TAGCCAATGA CATGTCCGCT 
40 GCTTCAGTGC CAGCTACCCA AAAGCCATTA 

TTTGTCACTC GAATAACTGG TACATGTTCA 
TGCGTTAGTG TAACTGAACG ACGTTTAGGA 
GCTGTTCTTA AAATTGATCC CAAGTTATGT 
GTCAATAAAC CTTCTTTTTC TTTTTGCTGT 
TATGGTGCAA TAAGCGCTGC AACACCCTGA 

50 

GATTTTGGTA CAGTTTGAAC AATGATTTTT 
ATTTGTTGCT TTTTAATACC TTCTTGAATC 



EQ ID NO: 509: 

CTTGGTAAAA ATCGTTTTAA CGCCATGATA 60 

ATAAGCATCG PAPJiCACT^ TAATTTCATC 12 0 

ATCTATTAAT TGCATTCTAG CTTCTTCTAA 180 

TGATTGTCCT ATCATATTGT ATCCATCAAT 240 

CCAACATCAT GTCTTTTTCG AAATACTTCG 300 

TTCAAACTGT TTACATGTCC AACCATTGGA 360 

ACTAGGCGAC TCATACCCTG TCCTTCGCTA 42 0 

TCTAGATTTC TATAATCTGT TGCATTATTA 48 0 

TCTTTTAGTT CATCGATAGT TTTAGCTAAA 54 0 

ATTGCACCTG TTGAGGCTTT TGCAACTGTT 600 

ATAATAACAC CATCAACTCC CGTTGCATCG 66 0 

GGGTCTTCTA AGCCGTCTAA TATAAGTACT 72 0 

TTTAAAAATT GATCGAAGTC AGCATATTCA 780 

TGTGGTGCAT TTGCTAAAAA ATCTAATTTA 84 0 

TGATCTTTTG CATTTTTTAA AATTTCATTA 9 00 

AATATCTTAT TTAyCGGATG CCCAGTAATA 96 0 
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TTCGTTTACT ATTGTTATTA TTTTATTTAA TAATGCCTCT AATCGTTCTT CTCTTTTTTC 1080 
TAAATAaAGA AAACCTATCA CTGCTTCTAT CkCTGAACTT TTACGATATG TTTGAACATC 114 0 

5 

AGTGTTTTTA GCTTTAGTAT GACTTTTCGC GTTACGCCCT CGCTTCAAAA TATCCATTTC 1200 

TTCGTCTGTA AACCATTCTT GCTCCATTAA ATATTCTAAC GTTTGCGCCT GACTTTTGGC 1260 

AGATACATAT TTTTTAGACA TTTGATGTAG TTTATTAGGC TTACTTTTAA GCTTTAAAAC 13 20 

w 

GATATAGGTA CGTACATATT GATCTAaGAC TGCGTCnCCC ATATATGCTA AGG 13 73 
(2) INFORMATION FOR SEQ ID NO: 510! 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 510: 

TCAAGTGCCA ACTACACAAC TGATTTACAT TCTTTAGGTC AATATGTACA AGAAGGCCGT 60 

CGCTTCTTAT TCGAAACAGT TGTAAAAGTA AATCATCCTA AATATGATAT TACTATTGAA 120 

AAAGATAGTG ATGATCTAGA CGGATTAAAT TATTTAGCTG GTAAAACAAT CGACGAAgTT 180 

AACACAAAAG CATTCGAAGG TACATTATTA GCGCATACTG ATGGTGGTGT TCCTAACATG 24 0 

30 

GTAGTGAACA TTCCACAATT AGATGAAGAA ACTTTCGGTT ACGTCGTATA CTTCTTCGAA 3 00 

CTTGCTTGTG CAATGAGTGG ATACCAATTA GGCGTAAATC CATTTAACCA ACCTGGTGTA 36 0 

GAAGCATATA AACAAAACAT GTTCGCATTA tTAGGTAAAC CTGGTTTTGA AGaCTTGAAA 42 0 

35 

AAAGrATTAG rAGAACGTTT ATAAAATACA TTACTTCAAA GATTAGTGAA GTTTGAAAAG 4 80 

ATAGAACTAG ACGTTAACTA TTTAAAGCAT ATTTTCGAGG TTGTCATTAC AAATGTAAAA 54 0 

40 ATGTAATGAC AACCTCGTTT TTATTTATAT GCAAGAACTA GGTTACTAGC TAaTGTGACA 60 0 

AGATGTTAAG AGAAAATTAA aGATAAAATA ATATCTGCCT TACAATAATA TTGTTATACT 660 

ACTAGnGACT GATTTATTAG CATGATTACA TGTTAATGTT TCTTTACTTA GTAATTA 717 
(2) INFORMATION FOR SEQ ID NO: 511: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2700 base pairs 

(B) TYPE: nucleic acid 

so (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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AATCTAATTT TTCCTCCTAA TCTCGGATTG GATAGTGGTC AATTTTTCTG GCCCGCCATT 6 0 

TTAGCGTTTG TTCTAACTGG GATTGGTTTA CCATTATTAG GTGTGATTGT AGGTGCACTT 120 

5 

GATAAAGAAG GATATATTGG CGCATTAAAT AAAATTTCAC CTAAATTTTC AATATTGTTC 18 0 

TTAATCATCA TTTATTTGAC TATAGGACCA CTTTTTGCAA TACCTAGAAC TGCATCTACA 24 0 

TCTTTTGAAA TGACAATTAC ACCAATTATA CATAGCAATA GTAGTATCGC TTTATTTATA 3 00 

TTTACGATTA TCTACTTCAT AGTCGTTTTG TATATTTGTT TAAATCCATC TAAGTTAATC 36 0 

GATCGTATTG GTTCATTATT AACACCATTA TTATTGATTA CTATTTTAGC GATGATTATT 420 

75 AAAGGATACT TAGACTTTAG CGGTAATAGT GCTGGAAAGG GCAATGAAGC ACTATATCAT 48 0 

TCTAATTTTT CAAGTTTTGC TGAAGGCTTT ACACAAGGCT ATTTAACAAT GGATGCCATT 54 0 

GCAGCAATTG CTTTTTCAAT GATTGTTGTT AATGCAGTAA AACTAACAGG CATTACTAAA 6 00 

ACAAATCAAA TATTCAAACA AACTTTGACT GCTGGTTTAA TTGCAGCCGT AGCTTTAATT 660 

TTCATATATA TTTCATTAGG TTATATTGGT AATCATATGC CAGTAAGTGA CATGACGTTA 72 0 

GATCAATTGA AATCCAAAGA TCGAAACATT GGGACATATT TATTAACGAC AATGGCTTCA 780 

25 

ACAGGATTTG GTTCATTCGG AAAATATTTA TTGGGCATCA TTGTGGCGCT GGCATGTCTA 84 0 

ACTACAGCAT GCGGGCTTAT TGTTGCAGTT TCTGAATATT TCCATAGAAT CGTACCTAAA 90 0 

GTATCATACA AAGCATTTGT ATTAGTTTTC ATTTTAATGA GTTTTATTAT TGCTAACCAA 960 

30 

GGTTTAAATG CTGTTATCTC AATGTCAATT CCGGTATTAA GCATTGTATA CCCAGTAGCA 102 0 

ATAACTGTTG TATTATTAAT TTTAATTGCC AAATTCATAC CGACAAAACG CATTTCACAA 1080 

CAAATTCCAG TTATTATCGT ATTTATATTG TCGATTTTCA GTGTTATTAG TAAGTTAGGT 114 0 

TGGcTGAAAA TTAACTTTAT AGAATCATTG CCTCTAAGAG CGTATTCTTT AGAGTGGkTC 1200 

CCAGTAGCAA TTATTGCAAC GATATTAGGC TATCTAGTCG GCATATTTGT AAAACAAGAT 1260 

40 CCAATTAAAT ATCAACAGGA ATAACGAATA ATATAAAAGA GGTTGGGACA TAAATCCCTA 1320 

AAAAAACAGC AGTAAGATAA TTTTCAATTA GAAAATATCT TACTGCTGTT CTCTATTTAT 1380 

ACAATACTTC GTATTGAATG GCTTCGCTTT CCTAGGGTGC CGTCTCAGCC TCGGTCTTCG 144 0 

ACTGGCACTG CTCCCTCAGG AGTCTCGCCA TTAATACTAC GTATTAACGT GTAATTTTAC 1500 

TTTGAAATAC TTTAAAAAAA TAAGACACTT TGCCCAACTT GCACATAAAT GTAAAATTCA 1560 

ATAAAATAAA TTTTCTGTGT TGGATCCCTT CGTATAATTT AATAAATACT ACTAAACTAA 162 0 

50 

ATTAACGAGG TGCCTTATGT ATAPiPJiATTA TAACATGACC CAACTTACAC TACCAATAGA 16 8 0 

AACTTCTGTT AGAATTCCTC AAAATGATAA TACGCGATAT GTTAATGAAA TTGTTGAAAC 174 0 
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AAGATATGCA TACCGTAATG ATAGATATAG TTTTAAACGT GACTTCAAGC TATATGAATG 18 6 0 

TGATGACTGT TCATCATGTT CTTTGAGACA TCAATGCATG AAGCCAAATT CGAAATCCAA 1920 

5 

TAAGAAAATT ATGAAGAATT ATAATTGGGA ATACTTTAAA GyCCAAATTA ATCAAAAGCT 198 0 

TTCTGAACCA GAAACGAAAA AAATCTATAG TCAAAGAAAA ATTGATGTAG AGCCTGTTTT 204 0 

TGGATTTATG AAGGCTATTT TGGGTTTCAC TCGAATGTCA GTTCGAGGAA TAAATAAAGT 2100 

10 

TAAACGAGAG CTAGGTTTTG TATTAATGGC ACTTAATATA AGGAAAATAG CAGCTCAACG 2160 
AGCTGTACAT TATAAAATAC ATATCAAAAA AGCTGATTTC TATCAAATAA TTAATAGAAA 2220 
75 TCAGCTTTTT TACATTGCCT AAGAATTTAA TGTCCCAAGC CCTTTTTATC GAATAACTTA 22 8 0 

TTGTAAACCT TGTCTTTCTT GGTTATTGTT TTCGTTATTT TTTTCGTGTT TTTGTTTCCA 234 0 

CTCTTTTTGA GTCATTACAT CGTCAACTTG CATGTTAACT TCAACTACAT CTAAACCAGT 24 00 

AATATATTTA ACTTGTTCTT TAACTAATTC AGTTACTTTA CGGAAGATTT TTGGTGCTGA 24 60 

TTCACCATAT TCTAAGATTA CTTTTAAGTC TACAGCAGCT TGTTTTTCAC CAACTTCAAC 252 0 

AGATACACCT TGAGTAACAT TGTTGCCACT TGAGAATGCA TTAGTGAATG TATCAGTTAA 25 80 

25 

GCCACCTTTC ATATCTAAGA TACCTTTAAC TTCACX3TGCA GCGATACCAG CAATTTTTTC 264 0 

AACAACTTCA TCAGAGAATG TTAATTTGnT TTTGAATTGA GGCTCCTGAT TTnGTTCnTG 27 00 

{2) INFORMATION FOR SEQ ID NO: 512: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 512: 

^0 ATCAAATACA ATAATTAAAA TAaTGGCTAA TACaCCTAaA ATaATAATcG AGTTaGAGTA 60 

GCCTAAGCGA yCACCcTAAC aGTArACATT CTAGGCATAT ATTTaTCTTT AGACATTGAA 12 0 

GCCGCTAACA TCGGAAAAGC TGTAAATCCA GTGTTGGCTG CTAATACCAA AATCATCACG 180 

GTCGTTGCCT GTACAAAATA GAAGGCAGCA TTATCACCAA ATATTTGCAT AGCTAATTGT 24 0 

GATAAAACTG TCGTTTCCGT TTGTGGcAAA ATTCCATAAA CATATGCTAA ACCAACGATA 3 00 

CCAACTAATA AAAACGCTAA AATTGAACCC ATAGCAATTA ACGTTTTCAC AGCATTTTTA 36 0 

GCACTTGGCT CTCTAAAATT GGTTACCGCA TTTGAAATAG CTTCAACACC CGTTAACGAT 420 

— ^^^^^^T^-AA-"^^^ -^T-AATA^r AAGAATAATG TTACTCCAGG AACCGCAGTT 480 
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10 



AATATTAATA TCACTAACCC TATAATGAAA AGATATACTG GATAGGATAA TACGGTGGCA 6 00 

GATTCAGTTA AACCACGTAA ATTTAATATT AAAATAAAAA GTACAAGTAw ACATGCAATC 6 60 

AGTrCkTTAT GCCCATATAA ACTTGGGAAT GCAGCAACAA ATGCATCAGC ACCAGATGAT 72 0 

ATmCTAACAG CGACAGTCAG TATGTAATCG ACTAATAATG AGCCTCCTGC AAGCAATCCC 7 80 

CATTTTTCTC CTAAATTGGT CTTGGACACC ATATACGCGC CGCCACCTTT AGGATATGCA 84 0 

TAAATAATTT GCCTATAAGA CATAATTAAA GCAGCTAATA AGATCAGAAC AGCACCTGCA 90 0 

ATCGGTAAAG TATACCAAGT TGCAACTGCA CCCACTACTG ATAGTGTAAT CAGTATTTGT 96 0 

^5 TCGGGACCGT AAGCCACTGA AGATAATGCA TCCGACGAAA GAATCGCTAA CCCTTTAAAC 102 0 

TTCGATATCT TTTCGTCTTT TAGTTCTCTG TTTTTTTTAG GTTGCCCTAT AATAAGtCTT 108 0 

TTAAATTGAT kGAcATAAtC TCCTATTCCT TTTTATAGTT TTnGATGGAA ATCAC 113 5 
(2) INFORMATION FOR SEQ ID NO: 513: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 918 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



25 



30 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 513: 

riATGGTAAA ACTTTAACAC AAGCATTACA TAAATCGCAA TCATTTATGT GGCAAAAACG 6 0 

TTGAATTTAA GCCATTTCTA TTCATTTTAC GTAATTATTA GCCGTATATG TGTAATAATA 12 0 

CACATTTTAT TCAGATTTTT TATCGCGCTC CATTAAATCT TTTACGCATT CTTTTACTGA 180 

GATATTTTCA AATAATACTC TATATAATGC ATTTGTAATT GGCATATCCA CATTTTTTTC 240 

TTTAGCTAAA TGATAAACTG ATTTAGTTGT ATAAATACCT TCAACAACCA TATTCATTTC 300 

40 AGATAATGCT TGATCCATTG ATTCACCTTG TCCAAGTTTA TATCCTAATG TGAAATTCCG 360 

AGAATGTGTT GATGTGCAAG TAACGATTAA GTCACCGATA CCACCTAAAC CTAGAAATGT 4 20 

CATAGGATCG GCACCTAACT TTTCACCTAA TCTACTAATT TCCGCTAAGC CACGAGTCAT 48 0 

TAATGCAGCT TTTGCATTAT CACCGTAGCC AATTCCAGCT ACGATACCAC TTGCTACTGC 54 0 

GATGATATTT TTCAATGCAC CACCAAGTTC AACACCAATC AAGTCATCAT TCGTGTACAC 6 00 

ACGCAAATAA TCATTCATAA ATAAATCTTG CGTTAATTTA CTTACACTTT TATCTTTTGA 66 0 

TGAAGCAGCA ACTGTAGTTG GTTGCTTGAC TACAACTTCT TCCGCATGAC TTGGCCCTGA 72 0 

CAACACGCCA ATACCTGCAT TATATTCAGG TGAAATAGAA TCTTCAATCA TTTCTGACAC 780 
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CAGCTTATCA TTAATTTGAG AAGCAACTTC TCGCATTGCT TTAGTAGGTA AAGCCATTAA 900 
GTAAATATCT GCAAATTG 918 
(2) INFORMATION FOR SEQ ID NO: 514: 



(i) SEQUENCE CHARACTERISTICS: 

lA) LENGTH: 587 base pairs 
(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 514: 
CAATACTCGG TATTATAGAT AGTCCTACTA CATTTATATG GGTTTGTATC ATTACATTAA 6 0 

TTGCACATCA ACTTGAAGGT AATATCATTA CACCAAATGT AATGGGTAAA TCTTTAAGTA 120 
20 TCCATCCTTT AACAATTATC GTTGTTATTT TAGCAGCAGG AGATTTAGGT GGCTTTACAT 180 

TAGTTCTGAT TGCAGTGCCA TTATATGCTG TACTTAAAAC GGTTGTTAGT AATATTTTCA 24 0 

AATACCGCCA ACGCATTATT GATAAAGCAA ACAGTAATGT TAAGGACTAA TTCTGTGGAT 3 00 

GTCTTTTAAG AATATAAGAT ACTATCGCAT CAAAAGTTGA AACTACAGCT TTTGAGGCGA 3 60 

TTTTTTTGTG CATAAAAAAT CAGTCmAATG AAATATCAAA TAATTTTCCA TCAGTCCGAT 42 0 

TATTATAAAA GCAAAAAAGC TTTGCTCACA TATATAATAA CGTGAACAAA GCTGTTGAAT 4 80 

30 

GATATTATTT AATTGCGTGG AATCCGCTAT CTACATGAAT ATTTTCACCT GTAACGCCAC 54 0 

TTGATAAATC ACTTAATAAG TAAGCCGCAG TTTTACCTAC TTCTACT 587 
(2) INFORMATION FOR SEQ ID NO: 515; 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
40 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 515: 

GGTTCATTCC AAAAAAGTAC GCGATAATTA GCGAAGAAGA ATTTGAAAAT TTAAATGTTG 6 0 

TTAAACCAAA TAAAAATAAT GTTTTCTGGT CAGTTATAGG AAGTTCGGTT TTGTTTGGAG 12 0 

TTACTTTAAG GAAATACATA CATGTTTTTG ATGTTCAATT AGATAAGCTA GTTGTAATGA 180 

TATTGTGTGC TCTCGCTTTA ATTTGTGTTA TAGTTTTTTA TTTTAACTTA AATAGAAAGC 24 0 

TTAAGTTAAA AGTGTTTGAT ACAAATATTG AAAAAnATAA GAGAGTTATA TAAAwAcCAA 3 00 
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TTTCATTAAT TGCCCTTATG ACAATCGAAC CTCAAAATAT AATAATATTT ATTTATTGGA 42 0 

TTATGATGAC AATGCTTTTC nTTTGTTAA ATATGACTTC GATAGGTAAT GAAAAAGTTC 4 80 

GCGTTATAAT GAAAAATAAT TGATTACATT TAAAATATTC TAAATGTTGT CGACACAATC 54 0 

CTTTTAAGAC GCTAGTAGAA TTTAAATGAC TTCTAATGTA TATGAAAGTG TATCAATATA 60 0 

AAACCAATTG AAAAGAAGTG GAGACATTGC TTTGTGAAAC TGAAAATATT AATAAGAATC 66 0 

CCAAATATAG AATTATCAAA TACAAAGATG AATATTTGAT GATTGATTTA GTAAGTACAT 72 0 

GGTTAGCACT CTTTTTCCCA ATGATTAATT GGCTGATTCC AAAAAAGTAC GTCAAAATCA 780 

GCGAAAAAGA TTTTGAAACT TTAAACATTG TG 812 
(2) INFORMATION FOR SEQ ID NO; 516: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 526 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 516: 

GTTCTATTAT TTCTTCCAAT GGTAATGGAT TTTCGTAAAA TGATTGATAT AAGTTGATAA 60 

TCTCAATAAC TTTTAATTGA TCTGGAAACA TCGTTTTTTG GAACATTATA CTGCGATTTT 12 0 

CACTTTGTAA TAACTTGTCT TTATCAAATA TCTCACCAGA ATTAGCATTA ACATTACCAA 180 

TTAATATATC AATTAACGTT GACTTTCCAG CACCATTTTT TCCAATTAAA GCGATACATT 24 0 

TACCTTGTTC AATATCGAAC GAAATATTTT TTAGAACACA TCTTTTATTA AATGACITGT 3 00 

TGATATTAGA TATTTGAATC AATTTAATCA CCTCTATTTT TTTCTTAATT TAATATTAGT 360 

AAATTTATTA GATTTAAAAT AGAACACTTT GTCATAGATT TGAAATGACA AATGTCATTA 4 20 

40 TTAGATkTAC ATAATATATT TATCGTtATT TTAATTTTGG GCAAAATAAA AAGAGCCTCT 4 80 

ATAATCGrGc TCCTTACAAA TAaATTATAA AAttGGCGAA cTAAAT 526 
(2) INFORMATION FOR SEQ ID NO: 517: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4544 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS- double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 517: 
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GGAACACCTA AAGAAAGAAA TGATGCATTA AACACAOAGG CTGATATCTA TGTAACCAAT 120 

AAAGAAAATA CTAAATGGTT ATGCGATCAA TATAAAAAAG AATGGCCATT TGATATGGTT 180 

5 

GTAATTGATG AACTGTCTAC ATTTAAAAGT CCTAAGAGTC AAAGGTTTAA ATCTATTAAA 24 0 

AAGAAATTAC CACTCATTAA TAGATTTATA GGATTAACAG GAACACCTAG CCCAAATAGT 3 00 

TTACAGGATT TATGGGCTCA AGTTTATITG ATAGACAGAG GCGAAAGACT TGAGTCTTCA 3 60 

w 

TTCAGTCGTT ATCGAGAAAG GTACTTTAAA CCAACACATC AAGTTAGCGA ACATGTTTTT 4 20 

AACTGGGAGC TAAGAGACGG ATCTGAAGAA AAGATATATG AACGAATAGA AGATATATGT 4 80 

TTAAGCATGA AAGCGAAAGA TTATCTAGAT ATGCCTGACA GAGTTGATAC TAAACAAACA 54 0 

GTAGTCTTAT CTGAAAAAGA AAGAAAAGTA TATGCAGAAT TAGAAAAAAA CTATATTTTA 600 

GAATCGGAAG AAGAAGGAAC AGTTGTAGCT CAGAATGGGG CATCATTAAG TCAAAAACTA 660 

20 CTTCAACTAT CTAACGGTGC AGTTTATACA GATGATGAAG ATGTAAGACT TATACATGAT 720 

AAGAAGTTAG ATAAGTTAGA GGAAATTATA GAGGAGTCTC AAGGCCAACC AATATTATTG 780 

TTTTATAACT TCaAACATGA TAAAGAAAGA ATACTTCaAA GGTTTAAGGa AGCAACCACA 840 

TTaGAGGATT CAAACTATAA AGAACGTTGG AATAGTGGAG ACATTAAGCT GCTTATAGCA 900 

CATCCAGCAA GTGCAGGGCA TGGATTAAAC TTACAACAAG GTGGGCACAT TATTGTTTGG 960 

TTTGGACTTA CATGGTCATT GGAATTATAC CAACAAGCAA ATGCAAGATT ATATAGACAA 10 20 

30 

GGACAAAATC ATACGACTAT TATTCATCAC ATTATGACCG ATAACACAAT AGATCAAAGA 1080 

GTATATAAAG CTTTACAAAA TAAAGAACTA ACGCAAGAAG AATTGATGAA AGCTATTAAA 1140 

GCAAGAATAG CTAAGCATAA GTAATGGAGG TATAAGATGG GAAAGGCATC ATACGATATT 1200 

35 

AAGCCAGGTA CATTTAAATA TATTGAGTCA GAGATATATA ACCTACAAGA GAACAAGAAA 12 60 

GAGATAAATA GATTGAGAAT GGAGATACTT AACCCAACGA AAGAGCTAGA CACTAACATT 1320 

40 GTGTATGGAC CGTTGCAAAA AGGTGAACCA GTTAGAACAA CTGAACTAAT GGCAACAAGG 138 0 

TTATTGACTA ATAAGATGTT ACGAAACCTA GAAGAAATGG TCGAAGCAGT TGAAAGTGAA 144 0 

TACTTAAAGT TACCTGAAGA TCATAAGAAA GTAATTAGGC TAAAGTATTG GAATAGAGAT 150 0 

'^^ AAGAAGCTAA AGATAGAGCA AATAGGAGAT GCATGTCACA TGCATCGTAA TACAGTTACT 156 0 

ACTATACGAA AGAACTTTGT TAAAGCGGTA cGwtATCATG CAGGTATCAA ATAACATTGT 162 0 

GCAAAGATTG TGCAAAAGGC CTACAAATCT GTAGTAATAT GATAGTATCG GAAAGATGTA 1680 

50 

TAAAGTTATC TAAAAGTTAT ACGACACAAG TACACGAGGC ACATCGCTAT GCGtGTGTCT 174 0 

TTT'lTTATGr AATCAAAGAG GTGTAAGAGA TGACCAAGCA TAATAACATT TATAAGCATG 180 0 
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AGATAGCATT AGATAGGGAT AATCATCTTT 
CAGATGCAAA CATAGTGCAT CATATTATTT 
^ ACTTAGATAA TTTGATGTCA GTTTGTTATA 

ATGACAAAAG TAATCTTAAG AAAATTAGAG 
TTTTTATAGC CCCCTACCCA TCGGCTTAAA 

10 

CTTCGCTTGC AACGCGGATA AACTTTTATG 
AAACAGCTGA AAGAATATAT AGAGGATTAT 
TATATAGAAA CGTATGAATT TTATTGTCGG 

15 

ATGATAGAGC ATACAAACAA GGCTGGTGCG 
GAACTGACAA AAACAGTTCA AACACTAAAT 

20 GCACAAAGAA AAaAGATAGT TCAAGAAGAA 

TGAACCTTCA CCAAAACTAT TAACAACATG 
AAAAACAAGC AAATATGTTA AAAAAGAATG 

^5 AGGTAAATGG GTATTTGATG AAGAATTAGC 

TTGTAAACCT TCCAAAGGAT CTAAACGTCA 

TATTGGCAGT TTGTTTGGTT GGGTTCATAA 

30 

TTTGATATTT ATGGGGCGAA AAAATGGTAA 

TGCTGTTTCT CAAGATGGAG AAAACGGCGC 

ACAAGCTAGG ATTCTATTTG ATGAATCTAA 

35 

AGAAAATTTT AGACCTTTGA GAGATGAAAT 
GCCACAGGCT TCAGACAGTG ATAAGTTGGA 
TGAAATTCAT GAATTTAAAG ATTATAAATT 
AAGGTTACAA CCCCTTCTTA TCTACATTAC 
TGTTAATATG GTAGAAGCGG GAAGAGACAC 
45 TTTTTACTAT TTAGCTTCTC TCGATGATGA 

TAAAGCAAAT CCTAACCTAG GTGTTTCTAT 
AAAGGCTAAG AGAACACCAG ATGAACGTGG 

50 

TGCTAATAAT GATGAGATGA GTTTTATTGA 
TATTTCCTTA GATGAGTTGG AAGGTAGACC 

55 



GTCAAATGTG TTTACGTGAA GACATAGTAA 1920 

ATGTTGATGA AGATTTTAAT AAAGCTTTAG 19 80 

GCTGTCATAA CAAAATTCAT GCAAATGATA 2 04 0 

TATTAAAAAT TTAAATAAAA AAATAATTTA 2100 

ATGTTTTTTC GACGGGTACC GGCGGGGGCC 216 0 

AAAGGGGGTC TTTATATGAA ATTAACAAAA 2220 

AAAAAATCTG ATGACATATT AATTAATTTG 22 80 

TTAAGAGATG AACTTAAAAA TAGTGATTTG 234 0 

AGCAATATTG TTAAGAATCC ATTAAGCATA 24 00 

AACTTACTCA AGTCTATGGG TTTAACTGCA 24 60 

GGTGGATTCG GTGACTATTA AAGnTTAAA 2520 

GTATGCAGAG CAAGTCACTC AAGGGAAAAT 2 58 0 

TGAGAGACAC CTTAGATATC TAGAAAATGG 2 64 0 

GCACCGTCCT ATTCGATTCA TAGAAAAGTT 2 7 00 

ACTTGTATTA CAACCATGGC AACATTTTAT 2 76 0 

AGAAACAAAA CTGCGCAGGT TTAmAGAAGC 2 82 0 

AACAACTACT ATATCTGGTG TTGCTAACTA 2 88 0 

TGAAATCCAT CTTTTAGCAA ACGTAATGAA 2 94 0 

GGCGATGATT AAAGCTAGCC CAAAGCTTAG 3 000 

TCATTACGAT GCAACTATAT CTAAAATTAT 3 060 

TGGTTTAAAT ACACATATGG GCATTTTTGA 3120 

GATTTCAGTT ATAAAAAACT CAAGAGCGGC 3180 

GACAGCAGGG TACCAACTAG ATGGACCACT 3 24 0 

CTTAGATCGA ATCATCGAAG ATGAAAGAAC 3300 

CGATGATATA AATGATTCGT CGAATTGGAT 33 60 

CGATTTAGCT GAAATGAAAG AAGAGTGGGA 342 0 

AGATTTTATA ACCAAAAGGT TTAACATCTT 34 80 

TTATCCAACA CTTCAAAAAA ATAATGACAT 3 54 0 

ATGTACTATA GGTTATGATT TATCAGAAAC 3 6 00 
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AACACATTCT TGGATTCCTA AGCATAAAGT TGAATATTCT AACGAAAAGA TACCCTATAT 3 72 0 

AGAATGGGAA GAAGACGGAT TACTAACAAT ACAAGATAAT CCTTATATAG ACTACCAAGA 3 78 0 

5 TGTTTTAAAT TGGATAATAA AGATGAATGA GCATTATGTT GTCGAAAAAA TCACTTATGA 3 84 0 

TAGGGCGAAT GCTTTTAAAT TAAATCAAGA GTTAAAGAAT TATGGCTTTG AAACAGAAGA 3 900 

AAcwAGACAA GGGGCTTTGA CCTTGAGCCC TGCaTTGAAG GATCTAAAAG AAATGTTTTT 396 0 

10 

AGATGGGAAA ATAATATTTA ATAATAATCC TTTAATGAAA TGGTATATCA ATAATGTTCA 4 02 0 

GCTGAAACTA GACAGAAATG GGAACTGGCT GCCATCTAAA CAAAGCAGAT ATCGTAAAAT 4 080 

AGATGGTTTT GCAGCATTTT TAAACACATA TACAGATATT ATGAATAAAG TTGTTTCTGA 414 0 

CAAGGGTGAA GGAAACATAG tuATTTATTAG TATTArAGAT ATAATGCGTT AAGGAGGTGA 4200 

ATGTTATCGC AAAAGAGAAT ATTGTCACAC GCATAAAGAA AAAATTGATA GACAATTGGA 4 2 60 

20 TTGaTCAGTC AGCTTCTAAG CTTTATGACT TTAGCCCATG GAAAAATAAA TCTTTTTGGG 4 3 20 

GTGTAATCAA TAATACGCTT GAAACTAATG AAACGATATT TTCAGCTATT ACnAAGTTAT 4 3 8U 

CTAATTCGAT GGCTAGTTTG CCCTTGAAAA TGTATGAAGA TTATAAAGTA GTTAATACAG 444 0 

AAGTATCTGA TTnACTTACA GTGTCACCGA ATAATTCTCT GAGCAGTTTT GATTTTATTA 4 5 00 

ATCAAATTGA AACAATCAGA AATGAAAAAG GTAATGCATA TGTG 4544 
(2) INFORMATION FOR SEQ ID NO: 518: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 881 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 518: 

40 CATGGCAGAA TATTGAAGCT GTAAAAAAAG GACATGTAAT TTCATATAAA GCAGAAGATT 6 0 

ATTGGTTCAC AGATCCTATT ACATTAGAAC ATTTGAGAAG TAAATTAAAA AAAGAAATTT 120 

TAAATAAAAA ATAATAGAAA TAAGTTGTAA AAATTTTCTT ATGCATTGGT ACTAATGTTT 180 

TTAAGGAGTG ATTAAATGAA GCAACTGGTT GGAATTCCCG AATCAATGTT AATTCCTTTG 24 0 

ATAGCTCGAG CAAAAGAGTA CGAAAACGAA AAACCAATAA TAAAAGACGC ACTATCAAAA 3 00 

AAAATATTTG ATGGTTTAGA TGATATGTAC AAAAATGTTA CATGTGATGA CATGTCTCAA 3 60 

50 

ATTGGAATTA GTATACGTAC TGTGATAATA GATTGTGTTA CTAAGAGGCT TATCAAGGAT 4 20 

-r-r,r.Acn— ^^A'^ror^.nT CAATATAGGT TGTGGCTTAG ATACAAGGTT TCAAAGATTT 4 80 
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ACATTTTTTA AAGAAAGTAA TAGTTATAAG ATGATATCTA AATCTATGCT AGATTACAGT 6 00 

TGGATTGATG ATGTCAAAAA TTATAAATTT TTTAATAGTA AGTCAGATAT ATTGTTTATT 660 

5 

ATTGAAGGTG TATTGaTGTA TTTTGATGAG AGTGTAATGA CTCAATTATT GGACACTATT 72 0 

ATCAAAAAGA TGGGAGATCA TAATTTGACA TTTGCGATTg AATTTTGCTC aAAAACAATT 7 BO 

GCGAATAATA CmAAGAGACA TCAATCGGTA TCCAAGTTAT CCTCACCACC TGTTTTTAAA 84 0 

10 

TATGGGTACC ATGATTTAnA AAAATTGGAT GAnATTTACC C 881 
(2) INFORMATION FOR SEQ ID NO: 519: 

^5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3122 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 519: 

TGAATaaAAA TATATTAATA GATAAACACA AATGTGTCcA AATACCCCTA GAGGTATTTG 6 0 

ACnAGTTCCA TCCAACTGTT TAAAATACCC CTACAGGTAT TTTTAGGGAG GTTATTATGA 120 

AACAATACGG AGAAAaGTTT ATCGATGAAT TTAGTAAAGC AGAATTGGAA AAACTAGCCA 180 

AGCAAGGGCA ATTAATTGAC GTTAGAACAG AAGAGGAGTA TGCATTAGGA CATATCAATG 24 0 

30 

GTTCCATACT TCATCCTGTT GATGAGATTG AGTCATTCAA TAAAGAAAAA AATAAAACCT 300 

ATTATGTAAT CTGTAGAAGT GGTAACAGAA GTGCTAATGC TAGTAAATAT TTAGCTAAAC 3 60 

AAGGTTATAA CGTTATAAAT CTTGATGGTG GTTATAAAGC TTATGAAGAA GAAAACGATA 420 

35 

GTTATGATAC ACAAGAAGAA TATAAAAGTA TAGAAATTAA AGCAGATCGT AAACAATTTA 4 80 

ACTATCGTGG TCTTCAATGT CCAGGGCCAA TTGTAAAAAT TAGTCAAGAA ATGAAGAATA 54 0 

40 TTGAAGTAGG TGACCAAATT GAAGTCAAAG TCACAGACCC TGGATTCCCT AGTGACATTA 600 

AAAGTTGGGT GAAACAAACA AGGCATACTT TAGTTAAGCT TGATGAAAAT AACAATGGAA 66 0 

TTAATGCGAT TATTCAAAAA GAAAAAGCAA AAGATTTAGA TATAAATTAT TCTGCTAAAG 72 0 

GTACTACAAT TGTATTATTT AGTGGAGAAT TAGACAAAGC TGTAGCAGCG TTGATTATTG 780 

CAAATGGTGC TAGAGCTGCT GGAAAAGATG TAACTATCTT CTTTACTTTT TGGGGGCTTA 84 0 

ATGCATTAAA AAAAGTGCAA ACAGTTAATG TTAAAAAGCA AGGTATTGCA AAAATGTTTG 900 

50 

ATTTAATGTT GCCCAAAAAG AATATACGAA TGCCTCTTTC CAAAATGAAT ATGTTTGGTT 960 

TAGGAAATAT GATGATGCGC TACGTAATGA AAAAGAAAAA TGTTGATTCA TTACCAACAC 1020 



55 
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TCATGGGTAT TCAGAAAGAA GAACTTAGAG ATGAAGTTGA GTACGGTGGT GTAGGCACTT 114 0 

ATATTGGTGC TACTGAAAAT GCGAATCATA ATTTATTTAT CTAATTAAAT CTATTAATAA 1200 

5 

AAGGAGTTGT TATCATGTTT TTTAAACAGT TTTACGATAA TCATTTATCT CAAGCATCAT 126 0 

ATTTAGTGGG TTGTCAACGT ACAGGAGAGG CAATAATAAT AGACCCTGTT CGTGATTTAT 132 0 

CGAAATATAT AGAAGTTGCA GATTCTGAAG GTTTAACAAT TACACAAGCT ACAGAAACAC 138 0 

w 

ATATTCATGC TGATTTTGCT TCAGGAATTC GTGATGTGGC TAAACGCTTA AATGCAAATA 144 0 

TATATGTGTC TGGCGAAGGT GAAGATGCAT TAGGGTATAA AAATATGCCA TCAAAAACAC 1500 

;5 AATTTGTTAA ACATGGAGAT ATCATTCAAG TAGGCAATGT TAAATTAGAA GTTCTGCATA 156 0 

CTCCAGGACA CACGCCTGAA AGTATTAGCT TTTTACTCAC TGATTTAGGT GGTGGTTCAA 162 0 

GTGTTCCGAT GGGATTATTT AGTGGTGACT TTATTTTTGT TGGTGATATA GGTAGACCTG 16 80 

20 ATTTAtTAGA AAAATCTGTT CaAATAAAgG OTTCTACAGA AATTAGCGCG AAACAAATGT 174 0 

ATGAGTCCGT TCAAAATATT AAAAATTTAC CAGACTATGT TCAAATCTGG CCGGGTCATG 1800 

GTGCTGGAAG CCCTTGTGGT AAAGCATTAG GTGCCATACC TATATCTACA ATAGGTTATG 1860 

AGAAAATTAA TAACTGGGCA TTTAATGAAA TTGATOAGAC TAAATTTATT GAATCATTAA 1920 

CATCAAATCA ACCAGCACCA CCGCATCATT TTGCACAAAT GAAACAAGTT AATCAGTTTG 1980 

GTATGAATTT ATATCAATCA TATGATGTTT ATCCTAGTTT AGATAATAAG AGAGTAGCAT 2 04 0 

30 

TTGATCTTCG TAGCAAAGAG GCCTTTCACG GTGGCCACAC AAAAGGAACA ATCAATATAC 2100 

CATACAACAA AAACTTTATT AATCAAATTG GTTGGTACTT AGATTTTGAA AAAGATATAG 2160 

ATGTAATTGG AGATAAATCT ACTGTTGAGA AAGCGAAACA CACTTTACAA TTAATTGGGT 2220 

35 

TTGATAAGGT AGCAGGCTAT CGTTTGCCAA AATCAGGCAT TTCAACCCAG TCCGTTCATA 22 80 

GCGCTGATAT GACAGGTAAA GAAGAACATG TATTAGACGT ACGTAATGAT GAAGAGTGGA 234 0 

40 ATAATGGACA CTTAGATCAA GCAGTTAATA TTCCGCATGG TAAATTATTA AATGAAAATA 24 00 

TTCCTTTTAA TAAAGAGGAT AAAATATATG TACATTGTCA GTCAGGTGTT AGAAGTTCAA 24 60 

TTGCAGTGGG TATATTGGAA AGCAAAGGTT TTGAAAATGT GGTGAATATT AGAGAAGGCT 2 52 0 

ATCAAGATTT TCCAGAATCA TTAAAATAAT TTAAGGATGT GGAAAAAATG AATAAGCATT 2 5 80 

ATCAAATTGT TATTATTGGT GGCGGTACAG CAGGTGTTAC CGTAGCATCA AGACTATTAA 264 0 

GAAAAAATCA AAACTTAAAA GAGAAAATAG CAATTATAGA TCCAGCAGAC CATCATTACT 27 00 

50 

ATCAACCATT ATGGACGTTG GTTGGTGCAG GGGTATCTAG TTTGAAAAGT TCTCGTAAAG 2760 

' — r — , - --T^pTA^' — nAA'V^TGCTA ACTGGATAAA ACAGGCTGTT TCAAGTTTTC 2820 
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TAGTAGCTCC AGGATTACAG ATTAATTGGT CTTCAATTAA AGGACTAAAA GAAAATATAG 2 94 0 

GTAAAAATGG TGTTTGCTCT AACTATTCAC CTGACTATGT TAACgAAACT TGGAACCAAA 3 00 0 

5 TTTCTAATTT TAAACAAGGA AATGCCATTT TTACGCATCC AAACACTCCT ATAAAGTGTG 3 06 0 

GAGGTGCgCC TATGAAAATT ATGTATTTAG CTGAAGATTA TTTTAGGAAA CATAAAATCC 312 0 

^'^ 3122 

10 

(2) INFORMATION FOR SEQ ID NO: 520: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 982 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 520: 

ATAnAGATAT AnATnAATAT ATTGAGGTCA AACGATGATA ATTAAAATTT TAACAATTCT 6 0 

ATTACTACTT TGTATATTGA GCTATTTGGT TACAAATAGA AAGAAGCCTT TTCTGTTCTT 12 0 

AAAGACACTC TTTATGGGTG TGGTATTTAT CTTTATAGGA TATATTTCAC TGGCAATATC 180 

TGCCGTAATT ATTTATGGTA TTATTCAATT TATCACAATT GATTTTGGTA GTTTTTTCTT 24 0 

AATGGGTATT ATATTGATCT TGATTTCAAG TATATTCCAA TTATTTATAG TTAGATTACT 3 00 

30 

TTTTAGAAAA AAGAATGTCG ATTTGACAGA GGTTGTCGTT TTAsAsCATT TAATTCAATG 36 0 

GTTCTTAGTT TACTTTGCGA TCTATCAAGC AGTAAATGAA AAAATGGACA TTAATGATAT 420 

TAATATCGAC AATTTCCAAT CTGTCTTTTT TGACGTGTCT AATTTGAATT TAGTAATTCT 480 

35 

ACCAACGTTA ATCATTAGCT GGGTCACAAT ATTTAACTAT AGAATGAGAA GTTACAAATA 54 0 

AAATCTATGA GATTATACCT TCAGACACCA ACATTCAAAT GGTGTCTTTT TTGTTGTGTG 600 

40 GTTTTATTTT TGAAATTCGA AAAAGTAGAG GCATGAATTT TTTGACTAGT GTATAAGTGC 66 0 

TGATGAGTCA CAAGATAGAT AGCTATATTT TGTCTATATT ATAAAGTGTT TATAGTTAAT 72 0 

TAATAATTAG TTAATTTCAA AAGTTGTATA AATAGGATAA CTTAATAAAT GTAAGATAAT 78 0 

AATTTGGAGG ATAATTAACA TGAAAAATAA ATTGATAGCA AAATCTTTAT TAACATTAGC 84 0 

GGCAATAGGT ATTACTACAA CTACAATTGC GTCAACAGCA GATGCGAGCG AAGGATACGG 900 

TCCAAGAGAA AAGAAACCAG TGAGTATTAA TCACAATATC GTAGAGTACA ATGATGGTAC 96 0 

50 

TTTTAAATAT CAATCTAGAC CAAAATTTAA CTCAACACCT AAATATATTA AATTCAAACA 10 20 

TGACTATAAT ATTTTAGAAT TTAACGATGG TACATTCGAA TATGGTGCAC GTCCACAATT 10 80 
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TCAAAATCTT GTGAGAGAAT TTGAAAAAAC ACATACTGTC AGTGCACACA GAAAAGCACA 1200 

AAAGGCAGTC AACTTAGTTT CGTTTGAATA CAAAGTGAAG AAAATGGTCT TACAAGAGCG 126 0 

AATTGATAAT GTATTAAAAC AAGGATTAGT TAAATAAAAC TTCAATCGTT GCTGTTATCT 1320 

GGAAATAATT AATTAAATGT TATGTTAATT TTTGTTAATG AAAAAAGTAA TCTATTTAAT 13 80 

GACAGGTTAA TGTAATTGTC CTGAAATTGA CTATATACTC AGTAAGTATC AATTTTAAGG 144 0 

AGAGCTTATA ATGAAATTTA AAAAATATAT ATTAACAGGa ACATTAGCAT TACTTTTATC 1500 

ATCAACTGGG ATAGCAACTA TAGAAGGGAA TAAAGCAGAT GCAAGTAGTC TGGACAAATA 1560 

TTTAACTGAa aGTCAGTTTC ATGATAAACG CATAGCAGAA GAATTAAGAA CTTTACTTAA 162 0 

CAAATCGAAT GTATATGCAT TAGCTGCAGG AAGCTTAAAT CCATATTATA AACGTACGAT 16 80 

TATGATGAAT GAATATAGAG CTAAAGCGGC ACTTAAGAAA AATGATTTCG TATCAATGGC 174 0 

TGATGCTAAA GTTGCATTAG AAAAAATATA CAAAGAAATT GATGAAATTA TAAATAGATA 1800 

ATAAATAAAA CAGGTTGAGA CAAAAAATGG TCTTAACCTC TTTTCAATTT GCATATGTGA iRfiO 

TAAATTCTAT ATCAAAATGC TTATGTATAA TGAATGACAT TTAAAAGTAG GGGAGACAAA 192 0 

25 TATAAATACA ATAGTTCCTA GGATTACTCT CAAAATAACT ATATCAATTA TTTACTTTGC 1980 

TCTCCTATTT TTTAAAATAT GTACATGTTT AAACAATCAA AAGTGTACAA TATTAAATTA 2 04 0 

TCATTTCCAG TTCTAGTGCT ATATTGGTAG TAGTTGACTA AATGAAAATA AGCTTATAAC 2100 

AAGTTTTTTC AATACTCGTG GGGCCACAAC AGAGAGAAAT AGGATCACCA ATTCCAACAG 2160 

ACAATGCAAG TTGGCGGGGC CCCAACATAG AGAAATTGGA TCACCAATTT CAACAGACAA 2220 

TGCAAGTTGG GGTGGGGCCC AACACAGAAG CTGGCGAAAA GTCAGCATAC AAAAATGTGC 2280 

AAGTTGGCGG GGCCACAACA GAGAGAAATA GGATCACCAA TTCCAACAAA CAATGCAAGT 2 34 0 

TGGCGGGGGC CCCAACATAG AAGCTGGCGA AAAGTCAGCT TACAAAAATG TGCAAGTTGG 24 0 0 

CGGGGCCCCA ACATAGAGAA ATTGGAACCC CAATTTCTAC AGACAATGCA AGTTGGGGTA 24 60 

GGACATCGAT AAAGAAATAC TTTTTCTTTA GCAATTAGTA TTTCTTATGC ATGAGCTTTA 2520 

CTCATGTATT CATTTTTTAA GTACaCATTA GCTACAGCTA ATGATAAAGA ACCACTACAT 2 58 0 

AATAAATCAT TAGTGGTTCT TTATCATTTC TATCTCACTC TTTTACTGGA AGAAAAAGTT 264 0 

TACGTTTGTA GAACATGCCA CAATACCAAA AATAATTAAG AAAAATAAGA CGATAAGCAT 2 70 0 

GATGACACTT TTCAAACAAC CTCTATCAGT TTCTTTCGAT TTTCTTTGTT GAACCTTTTT 2 76 0 

ATAATCTTCA AGTAGTTTTG CGGCTTTTTT ATTTATATGT TTATTCATGA TGTTGACTCC 2 82 0 

7TATAATATA TGTTTAATTC ATTAAAATAG TTGAAAACAT GACTTGAAAT AAAGATATAA 2 88 0 
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AATTTGATGA TGATATTTGC TTTTTATTTT CCAAATGGAA TTTACTTAAA CTGATGCATT 3 0 00 

AAAATATTAA TGAAGCACTA GAATACATAA ATGAATAGTa ATGGTGcACA GTATAGAATA 3 060 

ATTAAGGCTA TATTAAGTAT AAATATCGTT AACTGTAAGC TATCTTTAGT TTTAATATAA 312 0 

ACTATTAGGA TAATCGACGT AAGAAGAATC ATATATATTA ATGATGAAGA AGTCCATACA 3180 

AAATCCGCAT CATTTGTTGT TAATAATGGG ACTATAATTA ATCCGAAATT AATCATGCAT 3 24 0 

GCTATATATA CTATAATGTT ATACACAATG TTAATTTTTG TTCACCACCT TATACTTCTA 33 00 

TTTTAAAAAC TTCTTTATAA TGATATATTG TTTAATGTTG AAATAATTAG ATTATCTAAT 33 60 

TTTCATTTGC TTTACATGTA AAAGGCTATA TATAGTATGC TCTTTATGAT TCTAAATGCT 34 20 

TTTTAATATT TAATGCTCAT CAACATTTGG ATTTTGAATA TTCAATTCAA AAACTTTATT 34 SO 

AGCTACGTCA ATTGTAAAAT CAGAACCATA GTTGACATGA GCTACTTTTA ATTTTCCATC 3 54 0 

TAAATAATAG ATTGCGATTG CAACATCGTA AAATTCGTCA ATGACAAATA AACTCTTTTC 3 6 00 

GTTTGTTACA ACCTCATGCT CTCCTGAGTA TACAACGTTA ATTTCCCAAT CATTAAAAAC 366 0 

CATTTGTTAA CCTCCTTGAA CATTTAAATT GATTCAACTT AAGTTTAACT TATTCATACA 3 72 0 

2s ACTTCGTACA ATATCTAGAT GAACATTAAT TGTATTTCTA GAAATCTTTT TCAATTATAT 3 780 

GTACTAATTA TACTTTTAAA TTTCTTATTT CAGTATAGTT TTAAAACGAT TTTAAAATAA 3 840 

TTCTGCAAAT ATATTAACAC ATAATGTGTT CAAAAAGTTT TGAACAATTT CAAAACTTTT 3 90 0 

30 ATATAAAGGG nTTGACAACA TGGATTCAAA TnTCTTATTT TAAAAATTAC CTCATATAGT 3 96 0 

GTCATGTTAG CCAATTTTTA AG 3 982 
(2) INFORMATION FOR SEQ ID NO: 521: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1353 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 521: 

AGCTTGGaTG ATTTAATaTG GtCCtTTCCC AaCcTTAGAT AATGAAAGAT tTGATAATCC 6 0 

TGAATATAAA GAAGCTATGA AAAAATATcA ACAGAGATTT ATGGCTGinAG ATGAGGCTTT 12 0 

GAAGAAATTT TTTAGTGAAG AGAAAAAAAT aAAAAATGGA AATACTGATA ATTTAGATTA 180 

TCTAGGATTA TCTCATGAAA GATATGAAAG TGTATTTAAT ACTTTGAAAA AACAAAGTGA 24 0 

GGAGTTCTTA AAAGAAATTG AAGATATAAA AAAAGATAAC CCTGAATTGA AAGACTTTAA 3 00 
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GTTAGGTAAA ACATTTTATC AAAACTATAO AGATGATGTT GAAAGTTTAT ATAGTAAGTT 42 0 

AGATTTAATT ATGGGATATA AAGATGAAGA AAGAGCAAAT AAAAAAGCAG TTAACAAAAG 4 80 

GATGTTAGAA AATAAAAAAG AAGACTTAGA AACCATAATT GATGAATTTT TTAGTGATAT 54 0 

AGATAAAACA AGACCTAATA ATATTCCTGT TTTAGAAGAT GAAAAACAAG AAGAGAAAAA 600 

TCATAAAAAT ATGGCTCAAT TAAAATCTGA CACTGAAGCA GCAAAAAGTG ATGAATCAAA 660 

AAGAAGCAAG AGAAGTAAAA GAAGTTTAAA TACTCAAAAT CACAAACCTG CATCTCAAGA 720 

AGTTTCTGAA CAACAAAAAG CTGAATATGA TAAAAGAGCA GAAGAAAGAA AAGCGAGATT 780 

TTTGGATAAT CAAAAAATTA AGAAAACACC TGTAGTGTCA TTAGAATATG ATTTTGAGCA 84 0 

TAAACAACGT ATTGACAACG AAAACGACAA GAAACTTGTG GTTTCTGCAC GAAGAAAGAA 900 

ACCAACATCA CCGACTACAT ATACTGAAAC AACGACACAG GTACCAATGC CTACAGTTGA 9 60 

GCGTCAAACT CAGCAACAAA TTATTTATAA TGCACCAAAA CAATTGGCTG GATTAAATGG 102 0 

TGAAAGTCAT GATTTCACAA CAACGCATCA TiTCACCAACA ACTTCAJLATC APArGCATAA 1080 

TAATGTTGTT GAATTTGAAG AAACGTCTGC TTTACCTGGT AGAAAATCAG GATCACTGGT 114 0 

25 TGGTATAAGT CAAATTGATT CTTCTCATCT AACTGAACGT GAGAAGCGTG TAATTAAGCG 1200 

TGAACACGTT AGAGAAGCTC AAAAGTTAGT TGATAATTAT AAAGATACAC ATAGTTATAA 12 60 

AGACCGAATA AATGCACAAC AAAAAGTAAA TACTTTAAGT GAAGGTCATC AAAAACGTTT 1320 

TAATAAACAA ATCAATAAAG TATATAATGG GCA 13 53 
(2) INFORMATION FOR SEQ ID NO: 522: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1987 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 522: 

GTCTGAGTCA GGTGCTGTTT GGTTAGATGC TGAAAAAACA AGTCCTTATG AATTTTATCA 6 0 

ATTCTGGATT AATCAATCAG ACGAAGATGT AATTAAATTC TTnAAATACT TTACTTTCTT 12 0 

AGGAAAAGAA GAAATTGATC GCTTAGAACA ATCTAAAAAT GAAGCACCGC ATTTACGTGA 180 

AGCTCAAAAA ACATTAGCTG AAGAAGTAAC TAAATTTATT CATGGTGAAG ATGCATTAAA 24 0 

50 TGATGCAATC CGTATTTCAC AAGCATTATT TAGTGGTGAT TTAAAATCAT TATCAGCGAA 3 00 

AGAATTAAAA GATGGATTTA AAGATGTGCC TCAAGTGACA TTATCAAATG ACACAACAAA 3 60 
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TGTTAACAAT GGTGCGATTT ATATTAATGG TGAGAGACAA CAAGATGTTA ATTATGCTTT 4 80 

AGCACCAGAA GATAAAATTG ATGGCGAATT TACGATTATT CGTCGCGGTA AGPJkPiAPiATPi 54 0 

CTTCATGGTT AACTATCAAT AAATATAATT GCATAGCTAA ATAAATTAGA GCCTACTCAT 600 

ATTCATTCCT AAGAATGTAA TGAGTAGGCT CTTAATGTAC TTTTCTGTCT GTAAATTATC 66 0 

TAAAGAAATC TCTATCGCCT TGGCCAGGTG ATTGACGTTC TGATCGGCTT TGACGTTTTG 72 0 

GTTGTTCTTT TTGTTGTTTT AATTTCACTT TAACTTCTTT TGTTTTACCA TCACGGATAA 78 0 

CGGTAACAGT GACTGATTCA CCAGGTTTTT TATTTTCATA TAAATAGCTT CTTAAATCAA 84 0 

CATCATCTTT AATTTTCTTG CCATCAATTT CTGTAATAAT ATCACCTTTT TTAAGATCAA 900 

TATCACTATC AGCTTTGGCG ACATAAATAC CGTCTTCTCT ATCAGTATGA AGTTGCTCGC 96 0 

GCTCTTCTTC AGGAATATCT TTCAAATTAA TTAAACCAAT ACCAATCGAA GGGCGGTCAA 102 0 

TTTTACCATG TTTTACAAGT TGTTCAATTG TTACTTTAAC TTCATTACTT GGAATAGCAA 108 0 

ACCCGATACC TTCAACTTGT GTCGCAGCAA TTTTCATTGA GTTAATACCA ACTAAATTAC 114 0 

CATTAATATC TACTAATGCG CCACCTGAGT TACCTGGGTT AATAGCAGCA TCTGTTTGAA 1200 

25 GAACGCTAAC TTTTGTATTG CCACCAGTTG TCTCAGCGTC AATCGTACGT TCGCTTGCTG 12 6 0 

AAATGATACC AGATGTTACA GAGTTAGCAA ATTGTAATCC TAATGGGTTA CCCATTGCGA 13 2 0 

ATACGCTATC GCCAGTTTGT ACTTTTGAAG AGTTGGCAAA TTGAATCGCT TTAATACCTT 13 80 

30 TTGTATTTTC AATTTTAAGT ACAGCAATAT CAGTTACTGC ATCTTTACCA ACTAATTTCG 144 0 

CTTTAACTTG TTTTTTATTA TGTAATTGGa CTCTAATTTC ATTTGCGCCA TCAATAACAT 150 0 

GATTGTTTGT AACGATATAA GCTGAATTGT TGTTTATTTG aTAGATAACA CCTGAACCTA 15 6 0 

CTCCAGCTTC AGATGGTTTA GATGATTTGC CyTTTAATAA GTCGTCTACA CTTGATGCTT 162 0 

TTtGCATGTk AATAACTCCA ACAATTGTAG GGGAGACAGA TTTTATCATT TCATGAACGG 16 8 0 

TACCGAATTT CTTGCTTTGA CCGTCTAATT GATTGCCACC TTTATTATTT GTTGTCTGAA 174 0 

CAGTTGAACC ATCTTTATTT AAAATTGTAC TATTTAATAC TTTGCCTATA CCAAGTACTA 180 0 

GAAGTGCACC AATAATTCCA GCAATCAATG CAACGATGAC TGTnTTAAAC CATGGAAATT 186 0 

TAGGTCTTCT GTATCTAGGT GTTTGGCTAT GGTTTGTTGT AGAATGATCT GTATGATTAA 192 0 
AATCTGACAT ACTTAACCTC CATTATATGA TTTATATATG CTTTAATTAT GTCTTTTnTT 
TATAATT 

so (2) INFORMATION FOR SEQ ID NO: 523: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1351 base pairs 



1980 
1987 
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TO 



20 



(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 523: 

TTAGAAGTCA AATCATTAnT GGCGTCnTAT CGAGTATACT ATTAACTTCA ACTATTTTAG 60 

CAATTGCATA TATTTTAATG TGGTTTAACG GCCATATGAC aCTAACTTTG ACCTTAACGA 12 0 

CAATAATTAC AAGCTGTTTA ACCTTATTAA TATGTAGTAT TTTTATTAAT CCACTTATAC 180 

AAAAAATTAA GCAGTTTAAT ATAAAAACTA AGCAATTTGC TAACGGAAAT TACGCAAGCA 24 0 

15 ATGATAAAAC GTTTAATTCA CCAAAAGAAA TTTATGAATT AAATCAATCT TTTAATAAAA 300 

TGGCTTCTGA AATTACGCAA CAAATGAATC AAATTAAATC CGAACAACAA GAAAAAACAG 360 

AACTGATTCA AAACTTAGCC CATGATTTAA AAACACCTTT AGCAAGCATT ATTTCATATT 42 0 

CTGAAGGACT ACGTGATGGT ATAATCACTA AGGATCATGA GATTAAAGAG TCATACGACA 4 80 

TATTAATTAA ACAAGCAAAC AGATTATCAA CATTATTTGA TGATATGACT CATATTATCA 54 0 

CTTTAAATAC AGGTAAAACA TATCCCCCAG AATTAATACA ACTAGACCAA TTACTTGTAT 6 00 

CAATATTGCA ACCATATGAG CAACGTATCA AACATGAAAA CCGCACATTA GAAGTGAATT 660 

TCTGTAACGA AATTGATGCA TTTTATCAAT ATCGAACGCC ACTTGAGCGT ATTTTAACAA 720 

ACTTACTTGA TAATGCGCTA AAATTTTCAA ATGTTGGTAG TCGCATTGAT ATTAATATTA 7 80 

GTGAAAACGA AGATCAAGAT ACTATCGACA TTGCTATTAG CGATGAAGGT ATTGGCATTA 840 

TACCAGAACT ACAAGAACGT ATATTCGAAC GTACATTCAG AGTAGAAAAC TCTCGTAATA 900 

CAAAAACGGG TGGTTCTGGA TTAGGCTTAT ATATAGCTAA TGAACTCGCG CAACAAAATA 960 

ACGCAAAAAT CAGTGTAAGC AGTGATATAG ATGTAGGAAC TACGATGACT GTAACATTAC 1020 

ACAAATTAGA CATTACGTCA TAATCCGATT TATTTATAAA ATAAAATGCA AAGACTAAAA 1080 

AGAAGCTCCC ATTAATGAGG GCTTCTTTTT TTGTTTATTT AGAATAAACT TTATGGGTAT 114 0 

CCTTCTCATC ATTTTCAAGA CTTGAAAGAT TTGTAGCTTG AATAATATAT TTAGGACGTG 1200 

CCTTAACTTC ATAATATATC CTGCCAATAT ATTCACCTAC AACACCAATT GAAATTAACT 12 6 0 

GTATGCCGCC TAATAATAAA ATAGCTGCAA TCGTTGAAAA ATATCCCGGA ATATTAACAC 132 0 

CAGATATCAT AATATTGATG GAATAAATAG A 13 51 
(2) INFORMATION FOR SEQ ID NO: 524: 



25 



30 



35 



40 



SO 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 433 base pairs 

(B) TYPE: nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 524: 

^ TTGTTGTCAG TTTAACGCAA CGTGTTACCG ACTTTTCAAG TACACAAcAA TATATATGTC 6 0 

TATTCCATTT TTAGCCCcTG CCATTTTCAT CATTGGTGGT ATTATGTTGT TTATTTCAAC 12 0 

ATTTAATAGC TTAGATGAAA CTGCTGAAAA TAATAACAAA ATAAAGAAAC TAATGATTAA 180 

TO AGGACTTATC ATTATTAACA TTTCATTTAT CGTTATGATG GTTTTAACAC CATATTGGTA 240 

CTTGTATTTA ATCGTCTATC TTATTTTCTT GTTGTTTTTA TTGTGGCAAA AGGTTTATAA 3 00 

ATTTTAATAC CAAAACTATT AAACACTTCT GATATTCTTA GTTCAAAATA TCAGAAGTGT 360 

15 TTTATAGTGT TATCTAGTTC AGATAAATAT TTCCTTACTT AAAAAAACGC CCTCCTCTTA 420 

TTTTGACCCC nAT 433 



(2) INFORMATION FOR SEQ ID NO: 525: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1845 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 525: 



30 


CCCCCACTAT 


GATATGCTTG 


GCCTATTGCG 


TCAATCCCCT 


TATATTTCGG 


CAATAGAGAT 


60 


GGATGTATAT 


TCAATATTTT 


ACCTTCAAAT 


GAAGCTAATA 


AGTCTGGACC 


AATTAGACGC 


120 




ATGTAGCCAG 


CTAGAATAAT 


CCACTCTACC 


TTATCTTCAT 


TTAATAATGT 


TACTAAATGT 


180 


35 


TGTTCATACG 


CTGCTTTTGA 


ATCAAATTGT 


TTTGGTTCAT 


TAATATAAAC 


AGGAATATCG 


240 




TGCTTTTTTG 


CTCTATCTAT 


ACAAAACGCA 


TTTTGATGAT 


CCGTATATAG 


CGCCGTAACT 


300 




TCAATATTTT 


CAAGTTTTCC 


TGATTCAACA 


TGCTCAACTA 


TATTTkCAAA 


GTTACTTCCT 


360 


40 


GAACCTGATG 


CAAAAATCGC 


AATTTTAACC 


ATTGTTATAC 


CCCCAACAAT 


TCAATTGCAG 


420 




TTGACTCATT 


TTTCACAATA 


TGACCAATTT 


GATAGGCTTC 


CACATTTTGT 


TCTGCTAAAA 


480 




TCTTCAAAGC 


GCGTGATACA 


TCTTTTTCAT 


CAACGATAAC 


CGTATAGCCA 


ATACCCATAT 


540 


45 


TAAAAATGTT 


ATACATTTCA 


TTTGTGTCTA 


TATTGCCTTG 


TTGTTGTAAC 


CAATCAAATA 


600 




TTTTTGGTGT 


TGGAAATGAT 


GTAGTATCAA 


TTCTAGCAGC 


ATATCCGGCT 


GGCAATGCAC 


660 




GTGGAATATT 


TTCATAAAAA 


CCTCCACCAG 


TAATATGATT 


CATTGCCTTA 


ATAGAAACTT 


720 


50 


CTTTTTTCAA 


AGCAAGTACA 


GGTTTGACAT 


ATAATTTAGT 


TGGTTCTAAA 


AAGACATCGA 


780 
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GCACTAAACT 


GTATCCATTT 


GAATGAATGC 


CACTTGACGC 


AAGCCCTATA 


ACAACTTGTC 


900 


CCTCTTTCAC 


TTCTGAACCA 


TCTACATAGT 


CATCCTTTTC 


AACTGCTCCA 


ACAGCAAATC 


960 


CAGCTACATC 


ATATTCGCCT 


TCGTGATACA 


TTTCACCCAT 


TTCAGCAGTC 


TCTCCACCGA 


1020 


TAAGTGCAGT 


ATTCGTTTCA 


ACACATGCAT 


CACTAATACC 


TTTAACAATT 


TGTTCAATAA 


1080 


CTTCAGGAAC 


AACTTTGTTT 


GTAGCAATGT 


AATCTAAAAA 


ATATAATGGT 


TCTGCACCTG 


1140 


TCGTTAAAAT 


ATCATTAACA 


CACATTGCGA 


CTGCATCGAT 


ACCTATCGAA 


TCATGTTTAC 


1200 


CATAGTCGAT 


AGCTAGTTTT 


AATTTCGTAC 


CTACTCCGTC 


TGTTCCAGAA 


ACTAAAACTG 


1260 


GCGCTGtCAT 


ATTTAATTGT 


GATAAATCAA 


ATGTAGCACC 


GAATCCACCT 


AAACCACCGA 


1320 


TAACTTCTTT 


ACGCATCGTA 


CGtTTAACaw 


GaCtAGaCaT 


TCtTTCtACA 


GCTTCATAAC 


1380 


CAGCATGAAT 


ATTTACACCA 


GATTGTTCAT 


ATGCTTTAGA 


CATTTAAATT 


TCCCTCGCTA 


1440 


TCAAAAAAGT 


GTTTGTTTTT 


AGAAATATAT 


TGCTTTTGTC 


GATGACTTAA 


ATGCGCTTTA 


1500 


T AATTTG CTT 


V i-k X i-vf-i * \ — <-» 


T AAC C C TG CA 


GGATAATCTC 


CAGTGAAACT 


TTCTACACAT 


1560 


AAGCCACTAT 


ATGGCGCGTC 


ATAATCTAAA 


CCAATTGATT 


CAATTAACCC 


ATCTACAGAT 


1620 


AGATATGCTA 


ATGAATCAGC 


GCCAATATAA 


TCTTTAATTT 


CTTCAGGTGA 


TTTGCTTGCA 


1680 


GAAATTAATT 


CTGCCGTAGT 


TGAAACATCG 


ATTCCGTAAA 


AACTTGGAAA 


CATAAATTCC 


1740 


GGTGATGCTA 


TACGCACATG 


CACTTkATTT 


GCACCAGAAT 


CkTTTAACAT 


TTTCACAATG 


1800 


CGTCGAATTG 


TCGTACCGCG 


nACAATGGAA 


TCATCAACAA 


GACTG 




1845 



(2) INFORMATION FOR SEQ ID NO: 526: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1141 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLCXJY: linear 

(xil SEQUENCE DESCRIPTION: SEQ ID NO: 526: 

TCATAAATnG TAAATGTTAA TCGTCATATA ATATTAATTT ACAACACCAT TTTGGTTATT 60 

TGAAGCTTGT GGCGCTTGTT GTGTGCCACC TTGATTTTGA TTTGAGTTTT GATCTGTAGC 12 0 

AGGTTGTTGT TGATTGCTGG AmtCACTGTT ATTAGTTGAm TCACTGTTCT CGTTAGATGT 180 

CTTATCTTTA TCTGTCGTAT CATTTTGTTC TTTTCTCAAT AAACTATTAT CTAAAGGCGT 24 0 

TAATGGTATT AATGAACCAT AATGATTAAT GACACGTTGA TCTAAGAAAT CATTTTTATC 3 00 

ATTAATAGGT GATAATTCTA AGTCTTTACG AAGTAAGTTT GCATATTTTT GAATGCTTTC 360 
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10 



AGTTTTAATT TCAACATCAT TTGTTaAGTA CATTTTTGCT AAAGCTTTAA TTTCAGAGTT 4 80 

AGTTAAATTA TGCTTTGCAT TTTTACCTAC AATTTGAATC ACGTTATCAA GTTTATCAAT 54 0 

AGAATCAACT TCCTGTGCTT TTTGGAATAA AATCTTAATT AATTCCATTT GACGTTGTCC 600 

ACGTTTTAAG TCTGAATCAT GATGTCTAGT TCTAGCAACT GCTAAAGCCT CATCACCATT 66 0 

TAATTTTTGG TACCCTTTTT TAATTTTAAT CTTACCAGTA TCATCTGTGT TAGGTTCATT 72 0 

TAAGTCGTAT GGCACATCAT AGTATATGCC ACCAAGCTCA TTTACAGCCT CGACAAATGC 78 0 

TTTCATATTG ACTCTCACAT AATAATCAAC AGGTACATTC ATGGTAGCTT CTACCGAATC 84 0 

CATTGCGGCA ATTGGACCAC CATATGCATG TGCATGGGTA ATCTTATCGT AATAGCCAAC 900 

TTTAGGAATG TAGCTGATAG TATCACGTGG AATACTAAGC ATTCTAATTT GATGTTTTGA 96 0 

TTGATTAAAA GTAGTTAAAA TCATAGCGTC TGATCTAGAG TGTTCAGCAT CCTGTCCTTT 102 0 

20 TTTTCTTCTT CCATCGTTAT CATCGATACC TAAGAAAAGA ATAGAGATAG GTTGTTCTTC 1080 

GGGATGACTT TATTATCTCT TAAGTTGGAT TGACGTTAGC ATTTTTGCTG TCTTGAGAAG 114 0 

A 1141 
(2) INFORMATION FOR SEQ ID NO: 527: 



75 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1565 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



3^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 527: 

GCACCATTAG CATTGGGCnC ACTGATTGGA GTAnCAGTTG TTGAAAATTC GGCGCCAACA 6 0 

AGTAAACmGG CmCaGGCaGC mATamCCCCa TATTATAcTT ATAATGGTTA TATTGGTAAT 120 

40 AATGCTAATT TTATTCTGGA TAAGAATTTT ATTAACGCGA TTAAGTATGA TAATGTGAAA 18 0 

TTCAATGGTA TTAAATTAGC TAAAACGAAT ACGATAAAAA AAGTAGAGAA ATATGATCAA 24 0 

ACTTTTAAAG GTGTTTCTGC AAAAGGAAAC GAAGCAAGTC AATTGCAATT TGTAGTTAAA 3 00 

45 

AATAATATTT CATTAAAAGA TATCCAAAAA GCTTATGGCA AGGACTTGAA AAAAGAAAAT 36 0 

GGTAAAACAA AGGAAGCTGA TAGCGGTATT TTTTACTATC AAAATGCTAA AAAGACATTA 42 0 

GGCATCTGGT TTGTCGTTGA TCATAATAGA GTTGTCGAAG TAACAGTTGG ACATACACCA 4 80 

50 

TACAAAACAA GTAAATAAAA TAATGGCATA TTAAGGCTAG AGTGTGAGGA GTGATACCGC 54 0 

ATTCTAGTCT TTTTTATTAA ATAATAACGA TTATTGCGTC TTACATAGTT GTTTGAAATT 6 00 
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GATTAAGTAT 


ATAGAGCACT 


ATTTTGTATT 


TGTTAATATT 


TTCACAAAAA 


TAAAGCCTTG 


720 


ATAAATTTTA 


AAATATAATT 


AAGCTCAATT 


TTTAAAATTT 


TATTTAGCTA 


CAGATAACAT 


780 


TTTTAAAAAA 


GAAAAGAATC 


AATAAATAAA 


ATCAACGAAC 


AAAAAGTATA 


GAAATAAATA 


840 


GAAATAATCA 


TTTACTTTTC 


TGAAAAATTA 


AATTAATATT 


TTATTTATAA 


GCTGTTTTTA 


900 


AGATTTCAGG 


AGGAATGAAA 


TGTGaGGAAA 


TTTTCAAGAT 


ATGCATTTAC 


AAGTATGGCA 


960 


GCATTAACCT 


TGTTGAGCAC 


TTTATCACCA 


GCAGCATTAG 


CGATTGATTC 


AAAAAATAAA 


1020 


CCAGCTAATT 


CTGATATTAA 


ATTTGAGGTG 


ACTCAAAAGA 


GTGATGCGGT 


CAAAGCATTA 


1080 


AAAGAATTGC 


CTAAATCCGA 


AAATGTAAAA 


AATATTTATC 


AAGATTACGC 


TGTTACTGAT 


1140 


GTAAAAACTG 


ATAAAAAAGG 


ATTTACGCAT 


TATACATTGC 


AACCGAGTGT 


TGATGGTGTT 


1200 


CATGCACCTG 


aCAAAGAAGT 


GAAAGTACAC 


GCAGACAAAT 


CAGGAAAAGT 


CGTTTTAATC 


1260 


AATGGGGATA 


CTGATGCGAA 


GAAAGTAAAG 


CCAACGAATA 


AAGTGACATT 


AAGTAAAGAT 


1320 


GACGCAGCCG 


ACAAAGCATT 


TAAAGCAGTT 


AAGATTGATA 


AGAaTAAAGC 


GAAAAATCTT 


-1 a r\ 

X J <J u 


aAAGATAAAG 


TCATTAAAGA 


rAACAAAGTT 


GAAATCGATG 


GTGACAGTAA 


TAAATACGTT 


1440 


TATAATGTTG 


AGTTAATTAC 


AGTGACACCA 


GAAATTTCAC 


ATTGGAAAGT 


TAAAATTGAT 


1500 


GCTCAAACTG 


GCGAAATTTT 


AGAAAAAATG 


AACTTAGTTA 


AAGAAGCTGC 


AGAAACTGGT 


1560 



AAAGG -L^t^ 

30 

(2) INFORMATION FOR SEQ ID NO: 52 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2870 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52 8: 

GGATAGGTAA ATCCATTCAA AATTGGTTGT ACTCATTTTA AATCATATTT TTTAATATAG 60 

AAACACATAA TTAAAGGAGT GATGATATTA ATGAATAAAC TTCGAGACAC TACTTTTCTA 12 0 

^^"^ TCATATTTAT TTACTATCAT ATTGTGGGGA TCTGCATTTC CAATGATAAA GATTGCGTTA 180 

AATGATTTTA GTGCAGAATC ATTGTCGGCA TTTCGTTTAA TTTTAGCAAC AATAATTCTC 24 0 

TTGCCGTTCG TAATTATAAA GAAATTGCCT ACCCCTGAAC TAAGAGATAT CCCTGTTATT 3 00 

50 

TTTATTTTAG GATTTTGTGG ATTTGTGATA TATCACACAG CTTTAAATTT TGGTGAAaCT 3 60 

—rnr^-^A^.TG rA^r^TATAtC TGGrATTCyA GTCTCTACAA CGCCTATTTT TTCTAGTGCT 4 20 
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GCATTTATAG GAATATCCAT TATTTCAATA 
TTAGGTGTTT TTATTATTTT ACTTGCATCT 

5 

AAAAAATACA TAGAAAAATA CGGCTTCATC 
TCACCATTTA TGCTTATTTT TATTCCTGAA 
ACATCAATAG TATCGGTACT TTATTTAGCT 

10 

CTTGCTTATA TTGTGAAGTC AGTTGGTGTC 
CCTATCGTTT CTTTATTATT ATCTTATCTG 
75 ATTATAGGCG GAATTATCAC TCTACTAGGC 
TAATTATTCC AAGTCCCGCA CCTCAGAATC 
ATCAAAAATA TAAAAATGAT CTAATTTCGT 
CAATTCACAA AGGGGTATAG TCTGAGTGTA 
ATTACAGTTA TGGAAATTAT GAAGCATTTG 
ACAAATCCGC TTACTTAATC GGATCTGGTC 

25 

TAAGGGATGG TCAAATGGAA GGTTCGAAGA 
GTGGTAGTCT TGATGGTGAA AATATGCCTT 
AAATGGAGAA CCACTTTGAA TGTTTGTGGG 

30 

TCGATAACGC GTCTGTATTA GATGAGTTCT 
CTCGCTGTCG TGTTATTGAG AAACAGGGTC 
35 TGACTAAAAC GGCGATTAAA GAAATTTTAG 
ATGATGTCAA AATAACAGAT GTATTTTCCG 
ACTGGAAAAC GATGTTTGCA TTTGAACCGT 
TAATGCGATT CGTTCATCAT ATTAGTGGTC 
AATATAATCA ATATGAATCT TTAGTATTAC 
TTCAATTTGA ATACGATGTA AAAGTCGAAG 

45 

AAATTGCCCG AGAAATATTA ATTGACCGTA 
TAAACGATCT TGTCTTTGTG ACAAACGGTA 
ATGATACACC AGCGCCACCA ACTGACGAAT 

50 

TAGCGCGACA AAGTCCTGAA TTTGGTAATC 
AAAGTTGGTT TGTTTCAGCA ACTTCTACAA 



AGTAAAGATG ATTACACAAC TATTAATGTA 54 0 

TTTAGTGAAA GTTTGTATTT TACTTTCCAG 6 00 

GCTTTCACAC TATaTACAAT AATGGCAAGC 660 

ATCATCAACG ATATACACGG CGCCACTTTT 72 0 

ATATTCCCTA CTATAATTCC ATACGTTTTG 780 

TCTGATGCAA CAATGTCTCT TTATTTAACA 84 0 

TTATTAGACG AGCTACCAAC AACCCTTGCT 900 

GTTAGTTTAA GTAACTTCTT TCAAAATACA 960 

CAAAAACATT CGAGTGATAA AATTTTAAAA 1020 

CAAATTTACC AATATAAATA CTAATATTTG 10 30 

TTCTAATACG AAAGGACTTG GTGGATATGT 114 0 

CGCGCCCTAA AAAACCTGAA AATGTAGAAA 12 00 

TAGCTTCACT TGCTGCAGCT TGTTTTTTAA 12 6 0 

TTCATATTTT AGAAGAGTTA CCTAAAGCAG 13 2 0 

TAAAAGGCTA TGTTGTCCGC GGTGGTCGTG 13 8 0 

ACTTATTCAG ATCTATCCCT TCATTAGAAA 14 4 0 

ATTGGCTAAA CAAAGAAGAC CCTAACTATT 15 00 

AACGTTTAGT CACAGACGGA GACTTCACTT 1560 

ATTTATGCTT AACGAATGAA GAAGATTTAG 162 0 

ATGACTTCTT TAATTCAAAC TTTTGGATTT 16 80 

GGCATTCTGC AATGGAAATG CGTCGCTATC 174 0 

TCGCAGACTT TTCAGCTTTA AAATTCACTA 18 00 

CTATGGTTGA ATATTTAAAA TCGCATGGGG 1860 

ATATTAAAAT AGATGTTACG ACAAGTCAAA 192 0 

ATGGTAATGC AGAATCTATT AAACTGACTA 1980 

GTATTACAGA AAGCTCTACT TATGGTGATA 2 04 0 

TAGGTGGTAG TTGGACACTA TGGAAAAATT 2100 

CTGATAAGTT TTGCCAAAAT ATTCCTAAAA 2160 

CAAACAATAA AGAGATTATC GATACAATAG 2 22 0 
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TCAATGATTC TGCATGGCAA ATGAGTTTTA CAATCAATCG TCAGCAACAG TTTAAAGACC 234 0 

AACCTGAAAA TGAAATATCT ACATGGATTT ATGCCTTATA TTCAGATGTA AACGGCGATT 2400 

^ ATATTAAAAA GCCAATTACA GAATGTAGTG GTAATGAAAT ATGCCAAGAA TGGCTGTATC 24 6 0 

ACTTAGGTGT ATCAACTGAC AAAATTGAAG ACTTAGCAAA ACATGCATCT AATACGATTC 252 0 

CTGTTTATAT GCCATATATC ACATCTTATT TCATGACGCG TGCTATCGGC GACAGACCTT 25 8 0 

w 

TAGTCGTCCC GCATCAATCT CAGAACTTAG CATTTATTGG TAACTTTGCA GAAACAGAGC 264 0 

GAGACACTGT ATTTACAACA GAATATTCGG TTCGTACTGC CATGGAAGCT GTTTATCAAT 2700 

TACTAAATAT AGATCGTGGT ATTCCAGAAG TCATCAATAG TCCATTTGAT CTTCGCGTCT 27 60 

15 

TAATGGATGC CATATACGAA CTGAATGACC ACCAAGATTT GCGTGAGATT ACTAAAGATT 2 620 

CGAAAATGCA AAAACTCGCA TTAGCAGGAT TCCTTAAAAA GATAAAAGGT 2 870 
20 (2) INFORMATION FOR SEQ ID NO: 529: 

(i) SEQUENCE CriARACTERISTICS: 

(A) LENGTH: 3105 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

25 {D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52 9: 

30 

CnGTTTAGCT GCAAAAAGTA AATAACGACA CTGTATTTGG AATTTTGCAA TTAGAAACAC 60 

TTTTGGGTGA CATTAACTCA ATTTTCAGCG AGATTGAAAG CGAATACAAA ATGTCTAGAG 120 

AAGAAATTTT AATTTTACTA ACTTTATGGC AAAAAGGTTC TATGACGCTT AAAGAAATGG 180 

35 

ACAGATTTGT TGAAGTTAAA CCGTATAAGC GTACGAGAAC GTATAATAAT TTAGTTGAAT 24 0 

TAGAATGGAT TTACAAAGAG CGTCCTGTTG ACGATGAAAG AACAGTTATT ATTCATTTCA 30 0 

ATGAAAAGTT ACAACAAGAG AAAGTAGAGT TGTTGAATTT CATCAGTGAT GCGATTGCAA 36 0 

40 

GTAGAGCAAC AGCAATGCAA AATAGTTTAA ACGCAATTAT TGCTGTGTAA GTTTAATAGC 42 0 

ATAAAAAGAG GTTTTCATTA AGTTGAAAAC CTCTTTTTGT TGTTGGCATT AATTTTTCAA 4 80 

ATGTTGACTA CTCAATCCTA AATTATAAAT AGTATAGCGC aCAAaTGCTT AAGAAATTTT 54 0 

TTCTATGGCA CAAATGAATG GAGCATGATT ACGTTGGTTT AAAAATTGAT ATTGCAAAAC 6 00 

TTGCGCATGC TTTTGATCCA AAGTACTCAA GTAATCAAGC AATGCATGCT TCTCAATTTG 66 0 

TCCTTCGCTA TGACCATGAT ATATAACAAG TACAATAATA CCTTCAATTG ACATTAATGA 72 0 

TAGCAATGAA TTAATAGCTT GGATTGTCGT GTCAGGCTTT GTCACGATAG ATTTATCACC 780 



10 



15 
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TATATGATGT TCAATATTTT CATGTCCATC TTTTATTAAA GAAACATGAT TGAAATCCTT 90 0 

AACTTTATCA CGTGTATTTT CCAAAGCTAA ATCTTGAATG TCGAAACCAT AAACATGTCC 96 0 

TTCTGGTACT TGTTCGGCTA AAAATAAAGT GTCATTGCCX} TTACCGCAAG TTGCGTCTAC 102 0 

AACAATACTT TCTGGTGTTA TATGTTGTTT AATAAGTGTT TTTGAAAAAG GGAGTATACG 10 8 0 

TTCTAATTTC ATTGCTTCAC CTTAGACTTG TAACGCAAGC CTTGATAAGA ATTTCTACGT 114 0 

GCTAATTCAG CATCGATGCC ATTTAATACT TCCCATTTAT TAACACTCCA CATTGGACCT 12 00 

ACGATGATAT CTATTGGACC ATCACCGGTA ATTCGGTGAA CGATCATTTC AGGGGGAATC 12 6 0 

ACTTCTAATT GGTCCACAAC TAGGTTTGTG TACTCTTCTT GAGTCATAAA AGTTAATAAA 132 0 

CCTTTATCGT ATTGTTTTAC CATCGGTGTA CCTTTTAACA AATGAAGTAA ATGAATTTTA 13 8 0 

ATACCTTGTA CATCCATTTG TGCAcTTCTT TGGCAGTAGC CATCATCATG TCATAGTCTT 14 4 0 

20 CGCCAGGTAA GCCATTAATG ATGTGTGTAC ATACATTGAT ATTATGCTTA CGTAATTTTG 150 0 

CCACACCATC ATAATAAGTT TTCATATCAT GGGCACGATT GATTAAATCA GATGTTGACT 1560 

GATGGATTGT TTGTAGTCCT AATTCAACCC ATAAGTATGT TCGTTGATTC AAATCTGCTA 162 0 

^5 AATATTCGAC AACATCGTCT GGTAGACAGT CAGGACGCGT ACCAATAGAT AATCCCACAA 168 0 

CACCCGGTTC TTTAAGTACA GGTTCGAATT TTTCTTTTAA TACTTCAACC GGTGCATGTG 174 0 

TATTTGTAAA TGCCTGAAAA TAAGCAATAT ATTTTCCTTC GTGCCATTTC TCATGCATCT 1800 

TTTCTTTAAT TTCTTTAAAT TGTACTGCGA TTGAATCTGC ACGATTACCT GCAAAGTCTC 18 6 0 

CGCTACCTGC AGCAGAACAA AATGTACATC CACCATGTGC TACAGTGCCA TCGCGGTTAG 192 0 

GACAGTCAAA CCCGCCATCC AATGCAACTT TAAATATTTT TTGTCCAAAT TTATTTTTTA 1980 

AATGGTAATT CCATGTGTGA TAACGTTTGT TTTCAAAAGC GTATTGGAAA TGATTGCCCA 2 04 0 

TATGTCATTT TCCTTTCTAT AAAAAAAGAG TTCTAAGTAC AGATTTTAAC ATATTTTAAT 210 0 

GTTATAGTGT TTATTATAGT TTGACAAAAA AGAGAGAGGA ACTATGAAAT ATGAATATAC 216 0 

CTAAATCAGT CTGGTGGCTA GTAATTGGCA TGGCGTTAAA TATTACTGGT TCCAGTTTTT 22 2 0 

TGTGGCCTTT AAATACAATT TATATGAAAC AAGAACTTGG AAAAAGTTTA ACTGTTGCTG 22 8 0 

45 GTTTAGTGCT AATGATAAAT TCATTTGGCA TGGTTATTGG AAACTTATTA GGTGGTTCAC 2 34 0 

TATTTGATAA ATTAGGTGGA TACAAGACGA TTTTAATTGG AACTTTCACT TGTCTTTGTA 2400 

GTACAACGCT ACTTAATTTC TTTCACGGGT GGCCTTGGTA TGCTGTATGG CTTGTAATGT 24 6 0 

50 

TAGGGTTTGG TGGCGGAATG ATTATTCCTG CGATATACGC TATGGCTGGA GCAGTGTGGC 2 52 0 

CAAATGGCGG AAGACAAACG TTTAATGCGA TATACTTAGC GCAAAATATT GGTGTGGCTG 25 8 0 
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ATCTTATTAT GTATGTTGTG TTTGCGCTTG TCGCGGTAAC GCAATTTAAT ATTGAAATTA 2 7 00 

ATGCGAAAGT TAAATATCCA ACTCATTTAG ATATTACrGG TAAAAAGAAT AAAGCAAGAT 2760 

5 TTATTTCATT AGTACTAATT TGTGCAATGT TTGCAATTTG TTGGGTTGCA TATATTCAAT 2 820 

GGGAGTCTAC AATCGCTTCA TTTACACAAT CTATTAATAT TTCAATGGCA CAATATAGTG 2 8 80 

mTATOGAC AATTAACGGA ATAATGATTT TAGTAGCACA ACCATTAATT AAACCGATTC 2 94 0 

10 

TCTATCTGTT AAAAGGAAAC TTAAAGAAGC AAATGTTTGT CGGCATCATC ATTTTTATGT 3 000 

TGTCGTTCTT TGTCACGAGT TTTGCCGAAA ACTTTACAAT ATTTGyTGTC GGTATGATTA 3 060 

TTTTAACTTT TGGAGaATGT TTGTATGGCC AGCAGTTCCA ACTAT 3105 

(2) INFORMATION FOR SEQ ID NO: 53 0: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5532 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 530: 

TAATGATTAA ACCAGATGCA GTACAAAGAA ATCTAATTGG TGAAGTAATT TCAAGAATTG 60 

AAAGAAAAGG ACTAAAACTT GTCGGTGGTA AATTAATGCA AGTACCAATG GAACTTGCTG 120 

30 

AAACACATTA TGGTGAACAC CAAGGTAAAC CATTTTATAA TGATTTAATT TCATTTATTA 18 0 

CATCAGCACC AGTGTTCGCA ATGGTAGTTG AAGGTGAAGA TGCAGTTAAT GTATCTAGAC 24 0 

ATATTATTGG CAGCACCAAT CCTTCAGAAG CTTCACCAGG ATCAATTAGA GGTGATTTAG 300 

35 

GTTTAACTGT TGGTAGAAAT ATCATTCACG GTTCAGATTC ATTAGAGTCT GCTGAACGTG 36 0 

AAATTAATCT ATGGTTTAAT GAAAATGAAA TTACTAGCTA TGCTTCACCA CGTGATGCAT 4 20 

40 GGTTATATGA ATAAAATATA AACTGTAAAC CTTTACGATT TATTTATAAA GGTAGAAAGG 4 80 

GTTTTGTTAT GTGGTTAGTC ATTATGaTTA TACATAACAA GGCCCGTTTT TTATGTTGTA 54 0 

GTAAATTACT TGAAAAATTT TATAGTTTTT JcGGTAAcACG TAtTaAAAaG AGAGGAATAT 60 0 

TCTTTATCAA ATGAAACTAA ACAGAGAGAA GGGGTTGTTA AAATGAAgAA TATTATTTCG 66 0 

ATTATTTkGG GGATTTTAAT GTTCTTAAAA TTAATGGAAT TACTATATGG TGCTATATTT 72 0 

TTAGATAAAC CACTTAATCC TATAACAAAA ATTATTTTTA TACTGACTCT CATTTATATT 7 80 

50 

TTTTATGTAT TAGTAAAAGA ATTGATTATA TTTTTGaAGT CAAAGTATAA CAAAAGCGCT 84 0 

TAACATATGT ATATTTTAAT ATCATAATTT TTTTAAACGG ACTGATTAAC TTTATTAATA 90 0 
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GATACGATTA TATTAAAACG GCTAATCATT TTTAATTAAT GATTATATGA TGCAACTGTT 102 0 

TAGAAATTCA TGATACTTTT CTACAGACGA ATATATTATA ATTAATTTTA GTTCGTTTAA 1080 

^ TATTAAGATA ATTCTGACAT TTAAAATGAG ATGTCATCCA mTCTTAAT TGAGCTTGAA 114 0 

AACAAACATT TATGAATGCA CAATGAATAT GATAAGATTA ACAACATATT ATAATGTTAT 1200 

CGTGGAAGTA TGAAAGGAGC GAGTGTGTAT GAGATACCTA ACATCAGGAG AATCACATGG 1260 

10 

ACCTCAATTA ACAGTTATTG TTGAAGGTGT ACCTGCAAAT ATAGAAATTA AGGTTGAGGA 13 20 

TATTAATAAA GAAATGTTTA AGCGTCAAGG CGGTTACGGA CGTGGACGTC GTATGCAAAT 13 80 

TGAGAAAGAT ACAGTAGAAA TAGTATCAGG CGTTAGAAAT GGTTATACAT TAGGTAGTCC 14 4 0 

AATTACTATG GTTGTAACCA ATGATGACTT TACGCATTGG AGAAAAATTA TGGGAGCAGC 1500 

TCCAATAAGT GAAGAAGAAC GTGAAAATAT GAAACGTACT ATTACAAAAC CAAGACCTGG 1560 

20 TCATGCAGAT TTGGTTGGAG GTATGAAATA TAATCATCGT GATTTACGAA ATGTGCTAGA 162 0 

GCGATCATCT GCTAGAGAAA CAGCAGCTCG AGTTGCAGTC GGTGCCTTAT GTAAAGTGTT 168 0 

ATTACAACAG TTAGATATCG ATATATACAG TCGTGTTGTT GAAATAGGTG GAATTAAAGA 174 0 

TAAAGATTTT TATGATTCAG AAACATTTAA AGCAAATCTT GATCGTAATG ATGTTCGTGT 18 0 0 

AATTGATGAC AGTATCGCAC AAGCAATGCG AGATAAAATT GACGAaGCTA AAAATGAAGG 1860 

AGATTCAATT GGCGGTGTCG TTCAAGTTGT AGTTGAAAAT ATGCCTGTTG GTGTAGGTAG 192 0 

30 

TTATGTGCAT TATGATCGTA AGTTAGATGG TAAGATTGCA CAAGGTGTTG TCAGCATAAA 198 0 

TGCTTTTAAA GGTGTAAGCT TTGGTGAAGG ATTTAAAGCA GCTGAAAAGC CAGGTAGTGA 204 0 

GATTCAAGAT GAAATTCTAT ATAATAGTGA AATTGGTTAT TATCGTGGAT CTAATCACTT 2100 

35 

AGGTGGTTTA GAAGGCGGTA TGTCAAATGG AATGCCAATT ATCGTTAATG GTGTAATGAA 216 0 

ACCAATTCCA ACGTTATATA AACCATTAAA TTCAGTAGAC ATTAATACTA AAGAAGACTT 222 0 

40 TAAAGCAACA ATTGAACGTT CTGATAGTTG TGCTGTTCCT GCAGCAAGTA TCGTCTGCGA 22 80 

ACATGTCGTA GCATTTGAAA TAGCAAAAGC ATTATTGGAA GAATTCCAAT CAAATCATAT 23 4 0 

TGAGCAACTT AAACAACAAA TTATTGAGCG CAGACAATTA AATATTGAGT TTTAACAACA 24 0 0 

AGAACAATTG AGGTGTAATC ATGAAATTAC AAACAACATA CCCTTCAAAT AATTATCCAA 24 6 0 

TATATGTTGA ACACGGTGCA ATTGACCATA TTAGCACGTA TATTGATCAG TTTGATCAAA 252 0 

GTTTTATATT AATTGACGAG CATGTAAATC AATATTTTGC TGATAAATTT GATGATATTT 25 80 

50 

TATCATATGA AAATGTACAT AAAGTTATTA TTCCAGCTGG TGAAAAGACG AAAACATTTG 264 0 

AGCAATATCA AGAAACATTA GAGTATATTT TATCCCATCA TGTAACTCGT AATACAGCAA 27 00 
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ACGAGGCGTG CACTTTATAC AAGTGCCAAC GACTATACTA GCGCATGATT CTAGTGTTGG 2820 

CGGTAAAGTG GGTATTAACT CAAAGCAAGG TAAAAACCTT ATCGGTGCAT TTTATCGTCC 28 80 

AACTGCTGTG ATTTATGATT TAGTCTTTTT AAAGACGTTA CCATTTGAGC AAATATTAAG 294 0 

TGGCTATGCA GAAGTTTATA AGCATGCGTT ATTGAATGGT GAATCAGCGA CGCAAGATAT 3000 

CGAACAGCAC TTTAAAGATA GAGAGATATT ACAGTCATTA AATGGTATGG ATAAATATAT 3 060 

TGCTAAAGGT ATTGAAACGA AGCTGGATAT TGTTATTGCA GATGAAAAAG AACAAGGTGT 3120 

ACGTAAATTT TTAAATTTAG GTCATACATT TGGTCATGCT GTTGAATACT ATCATAAAAT 3180 

ACCTCATGGT CATGCAGTGA TGGTTGGCAT TATCTATCAA TTTATAGTTG CGAATGCTTT 324 0 

GTTTGATTCT AAGCATGATA TTAATCATTA TATTCAATAT TTAATACAAC TCGGCTATCC 3 3 00 

TTTAGACATG ATAACTGACT TGGATTTTGA AACGTTATAC CAATATATGC TAAGTGATAA 33 60 

AAAGAATGAT AAGCAAGGTG TACAAATGGT CTTGATTAGA CAATTTGGAG ATATCGTTGT 34 20 

ACAACATGTT GATCAACTAA CATTACAACA T-GCATGTGAA CAATTAAAAA CATATTTTAA 3^.90 

GTAGGTGAAT GAAATGGTAA ATGAACAAAT CATTGATATT TCAGGTCCGT TAAAGGGCGA 3 54 0 

AATAGAAGTG CCGGGCGATA AGTCAATGAC ACACCGTGCA ATCATGTTGG CGTCGCTAGC 3 600 

TGAAGGTGTA TCTACTATAT ATAAGCCACT ACTTGGCGAA GATTGTCGTC GTACGATGGA 3 660 

CATTTTCCGA CTGTTAGGTG TAGAAATCAA AGAAGATGAT GAAAAATTAG TTGTGACTTC 3 720 

CCCAGGATAT CAATCTTTTA ACACGCCACA TCAAGTATTG TATACAGGTA ATTCTGGTAC 3780 

GACAACACGA TTATTGGCAG GTTTGTTAAG TGGTTTAGGT ATTGAAAGTG TTTTGTCTGG 3S4C 

CG ATGTTTCA ATTGGTAAAA GGCCAATGGA TCGTGTCTTG AGACCATTGA AACTTATGGA 3 90 0 

TGCGAATATT GAAGGTATTG AAGATAATTA TACACCATTA ATTATTAAGC CATCTGTCAT 3 96 0 

AAAAGGTATA AATTATCAAA TGGAAGTTGC AAGTGCACAA GTAAAAAGTG CCATTTTATT 4020 

TGCAAGTTTG TTTTCTAAGG AACCGACCAT CATTAAAGAA TTAGATGTAA GTCGAAATCA 4 080 

TACTGAGACG ATGTTCAAAC ATTTTAATAT TCCAATTGAA GCAGAAGGGT TATCAATTAA 414 0 

TACAACCCCT GAAGCAATTC GATACATTAA ACCTGCAGAT TTTCATGTTC CTGGCGATAT 4 2 00 

TTCATCTGCA GCGTTCTTTA TTGTTGCAGC ACTTATCACA CCAGGAAGTG ATGTAACAAT 4 26 0 

TCATAATGTT GGAATCAATC CAACACGTTC AGGTATTATT GATATTGTTG AAAAAATGGG 4 320 

CGGTAATATC CAACTTTTCA ATCAAACAAC TGGTGCTGAA CCTACTGCTT CTATTCGTAT 4 380 

TCAATACACA CCAATGCTTC AACCAATAAC AATCGAAGGA GAATTAGTTC CAAAAGCAAT 44 4 0 

TGATGAACTG CCTGTAATAG CATTACTTTG TACACAAGCA GTTGGCACGA GTACAATTAA 4 50 0 
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AAACTTGTTA GGGTTTGAAT 
ATTTAAAACA AATGCAACAG 
^ AGTTGCTTCT CTACTTTCAA 
ATCATTTGCA GGATTTTTAC 
GGAAGATATC TATAAATTAA 

10 

TCGTGTTAAT GAAGCAATAA 
ACTTTACAAT TTTGGATTAA 
ATATCCAGAC GAAAGTGAAT 

15 

AACTGACGAA GCGTTAGAAT 
AGAAGCAGAT TTATATCAAC 
20 AGCACTTGAA CTAGAGCCAA 
TTATGATGGT CAATATTTAC 
ATATCAAGTT AATGGTGTAA 
kAACTATAGT GATkCcGATt 
AAGATTATCT CAAAAGAGCC 
AAATAATGAC TACATTACTT 

30 

aATCaTTATA TG 553 2 

(2) INFORMATION FOR SEQ ID NO: 531: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 942 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 531: 
AATTGGGTTA TACTATAGGT AAATTTAAGG AGGTAAGAAA ATGGATAAAA AAGAATTAGC 6 0 

GAAATTTATA GGCAATAAAA TCAGATACTA TAGAACCaAA TTGaACTTAA CTCAAGATCa 12 0 

ACTTGGAGAA AAACTCmACa CTAAAAArGC TACTATTTCA AATTATGAGA CAGGGTACAG 180 
AACTCCTAAA CAAGATGATT TGTTTGAAAT TGCTCATATT TTAAATATCA GTATCGATGA 24 0 

50 

TTTGTTTCCT ACAAGAAATA ATAAAAAAAA CGACATCACT TCCATATACA ACAAACTCAC 30 0 

ACCTCCCCGC CAAGAAAACG TACTTAACTA CGCAAATGAG CAATTAGATG AACAGAATAA 36 0 



TACAACCAAC TAATGATGGA TTGATTATTC ATCCGTCAGA 46 2 0 

TTGATAGTTT AACTGATCAT CGAATAGGAA TGATGCTTGC 46 8 0 

GCGAGCCTGT CAAAATCAAA CAATTTGATG CTGTAAATGT 4 74 0 

CAAAACTAAA GCTTTTAGAA AATGAGGGAT AATATAAAAT 4800 

TAGACGATAT CAATCTACAA AAACTAGAAA ATTTAGACTC 48 6 0 

CTACTGACAA CGATGACGCA TTATTTATTC TAGGAGAGAC 49 2 0 

TGCCaCAAGG TTTGGAAGTA TTCCGCGTGT TATATCACAA 4 98 0 

TGCTGATTTA TTTTATTGAA GGTTTAATGT CTGAAAATCA 5 04 0 

ATTTAAGTTA TGTTGAACCA TCACCTGAAA AGTTGATGTT 5100 

AAATTAATAT GATGGAAGTT GCTATTGATA AATTACAAGA 516 0 

ATGATCCAAT AATCCATTTT GCATTGGCTG AAATGTTATA 5220 

GTGCTACCTC TGAATACGAA ACCGTTTTAG AAACTGGTGA 52 80 

ACTTATTCTC TCGTATGGCA GATTGTAGTT TACAAAGTGG 534 0 

CgCTTATACG ATGrAATTAA TGAAGATGAA ATGACTTCAG 54 00 

ATTTCTnACG ATAAAAATGA CATCACTCAA GAAGCAATTA 546 0 

TCTAAAGATC CTGATTATAT TCAAgGCTAC TTGTATTTAC 552 0 
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AACTGGTGCT GGCATAGGAG AAGAATTATA TGATGACATA TTGCATGAAG AAGTATTTTT 4 80 

TAAAGAAGAC GAAACGCCAT CAAATGCTGA TTTTTGTATT TTAGTTAATG GTGATTCAAT 54 0 

^ GGAACCTATG TTAAAACAAG GAACATACGC TTTTATTAAG AAAGAAGATT CTATTAAAGA 600 

TGGTACAATT GCACTCGTTG TATTAGATGG AGTAAGTCTT ATCAAGCGTG TAGATATATG 660 

CGAAGACTAT ATTAATTTGG TATCTCTAAA TCCGAAGTAT a^TaATATCA AAGTCGCTTC 72 0 

TO 

GTTTAGTAAT ATTAAAGTAA TGGGCAAAGT TGTATTGTGA TTAATAGCGC CTATATGGCA 780 

CTTTAATATA AAAGACGTCT ATTTCAGCAG TGTTTAAAAG GAGTTTATAA TGAAAATAAC 84 0 

TAATTGCAAA ATAAAAAAAG AAACTATAGT ATATGAAGTT TTAACTAGTG GTAATCAACC 900 

15 

ATTCACTTAT GAGTTACCTA AAGATTTATC GTCACATAAT GC 942 

(2) INFORMATION FOR SEQ ID NO: 532: 
20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 417 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 532: 
TGGnAAATGC AAACCAAAAT ATGaTCCTCG TGcAAGTTGA AGCGGGACGT TATGAAGAAT 60 

30 

GGGTAAAGAA TGGTTATTTT AAACCGTCAG AAGATAAATC AAAAGAAACA TATACAATTG 12 0 

TTATCCCGCC ACCAAATGTA ACTGGTAAAT TACATTTAGG ACATGCATGG GATACGACTT 180 
TACAAGATAT CATTACACGT ATGAAACGTA TGCAAGGATA CGATACGTTA TACTTACCAG 24 0 

35 

GTATGGATCA TGCTGGTATT GCGACACAGG CAAAGGTAGA AGCTAAATTA AATGAACAAG 300 
GAATAACTAG ATATGATCTT GGTCGTGAAA AGTTTTTAGA ACAGGCATGG GATTGGAAAG 36 0 

AAGAGTATGC GTCATTTATT CGTGCGCAAT GGGCTAAATT AGGTCTAGGT TTAGATT 417 
(2) INFORMATION FOR SEQ ID NO: 533: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 733 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
CD) TOPOLOGY: linear 



50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 533: 
^A^^^TCAA^ rCGCATTTGT TTCCACTAAA ACAGTATGCC CACTTTCTAC TAAAGCGTGC 6 0 
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10 



ATTTTCATAC CATCCACCTC CATAATCATC TTAACGCGAA CATTTTGAAA GCGCAATCAA 180 

AAATCCACAA AATTGTAAAG GTTATTACAC TGACTTTTCC GAAAATTGTG GTAAAATATA 24 0 

ATTAAGAAAG AACAAGGAGG CACTTACTAT GATTACTTAC AAAAATATTT TAATCGCAGT 300 

TGACGGTTCA CATGAAGCGG AATGGGCATT TAACAGAGCA GTTGGTGTTG CTAAACGTAA 360 

CGATGCGAAG TTAACAATTG TGAATGTAAT TGATTCAAGA ACGTATTCTT CTTATGAAGT 4 20 

TTATGATGCT CAATTTACTG AAAAATCTAA GCATTTTGCA GAAGAATTAT TAAATGGTTA 480 

TAAAGAAGTA GCTACTAACG CTGGTGTTAA AGATGTAGAA ACGCGTCTAG AGTTTGGyTC 54 0 

TCyTAAATCT ATCATTCCTA AAAAGCTTGC ACATGAAATT AATGCAGACT TGATTATGAG 600 

TGGTACATCA GGCTTAAATG CCGTGGAAAg ATTTATTGTT GGTTCTGTAT CAGAATCTAT 660 

CGTTCGTCAT GCGCCATGTG ACGTGTTAGT TGTTCGTACT GAAGAGTTAC CAGCAGACTT 72 0 

20 CCAACCACAA GTT 73 3 
(2) INFORMATION FOR SEQ ID NO: 534: 



15 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6060 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 534: 

TGATGATCCT GAAGCGCTAT TGGATAATTA CAACACTGAA GATGTTGATG CACACAATTA 60 

CAATAATATA AATCATGTTA TTTTTGCCTG CGATGCGGGT ATGGGTTCTA GTGCAATGGG 12 0 

35 

TGCArGCATG TTACGTAATA AATTTAAAAA GGCGGGCATT AATGATATTA CAGTTACAAA 180 

TACTGCGATT AATCAATTGC CAAAAGATGC TCAATTAGTT ATCACTCAGA AAAAACTAAC 24 0 

40 TGATCGTGCT ATTAAACAAA CACCAAATGC CATCCATATT TCAGTGGATA ATTTCCTTAA 3 00 

TTCACCAAGA TATGAAGAAC TTTTAAATAA TCTAAAAAAA GATGATCAAG CATAATAATT 360 

AAATAAATTA AAAAATGGAG GATACCGCCA TGTTATTGAG TACACGTGAA AAAGAAATGA 420 

TAGCCCtAtT GATTAAGTAC CACGGtCAAt ATATCACTAT ACACGACATT GCTCAGCAAC 48 0 

TTGCGGTGTC CTCTCGTACT ATTCACCGTG AATTAAAAGG TGTTGAAGCA TATTTAACTT 54 0 

CATTTTCATT AACTTTAGAA CGCGCAAACA AAAAAGGGcT ACGCATTGCT GGCACAGATT 60 0 

CTGATTTAAA CGATTTGAAG CAATCGATTG CACAACATCA AACCATTGAC TTATCTGTTG 66 0 

AAGAGCAGAA AGTAATTATT ATATACGCTT TGATACAAGC CAAGGAGCCA GTTAAACAAT 72 0 
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TAGAGCTTGA TTTAAATAAG TACCAACTAT CTTTATCTCG AAAGCGTGGC GAAGGCATTT 840 
ACTTGGTAGG TACTGAATCA AAGAAACGTG AATTTTTAAG TCAATTAATG GTGAATAACT 900 
TAAATAGTAC TAGCGTTTAT TCAGTAATTG AAAATCATTT TGTCTTTCAT TCATTAAATC 96 0 

AAATCCACAA AGACTTTGTT GACTTAGAGC GCATTTTTAA TGTTGAAAGA CtATTaATGG 1020 

ACTACCTAAG TGCCTTACCC TACCAACTTA CCGAJ^TCAAG TTATTTAACT TTAACTGTCC 10 80 

ATATCGTGCT CTCCATTTCA CGTATAAAAA ATGGAGAGtA TGTCGCATTA AACGATGATA 114 0 

TTTATGATTC TGTACAAAAC ACATTTGAAC AC/lAAGTaGc AAGCGAACTT GCTGATAAAC 12 00 

TTGGTCAAAT ATATGACGTC ACGTTTAATC AGGCAGAAAT TGCTTTCATT ACTATCCATT 1260 

TACGTGGAGC TAAACGAAAA AATCTTAATG ATACATCATT AAATAATCGT TGTGAAGAAA 13 20 

ACAAAATTAA AGCGTTTGTT AACAAAGTAG AAATGATTTC CGGTATGACA TTTGCAGATT 1380 

TGGATACTTT AGTAGATGGA CTGACGCTAC ACCTTAATCC TGCAATCAAT CGTTTGCAAG 14 4 0 

CTAATATCGA GACC-rAiAAT CCGTTAACAG ACATGATTAA GT7CAAATAT CCAAGACTAT 15CC 

TTGAAAATGT AAGATTAGCT TTAAATGATT GTTGGCCTGA TTTGATTTTT CCAGAGAGTG 1560 

AAATTGCTTT TATAGTtTTA CACTTTGGTG GCTCGATTAA AAACCAAGGT AATCGATTTT 162 0 

TAAACATATT AGTCGTTTGC AGCAGTGGTA TGGGAACTAG TCGTCTATTA TCAACTCGTC 16 80 

TAGAGCAAGT TTTTAGTGAG ATTGAGCGTA TTACACAAGC ATCAGTCAGC GATTTGAAGT 174 0 

CACTAGATTT AAGTCAATAT GATGGCATTA TTTCTACTGT GAATTTAGAC ATCGACTCCC 18 00 

CCTATTTAAC GGTAAACCCA TTATTACCAG ATAGTGATAT CAGTTATGTC GCACAGTTTT 1660 

TAAATACAAA GTCTACGTTC CAAGAGACGC ATGATAAATC ATCAAACATG ATTGATAAGG 1920 

ATGATGTTCA TGTTGAAACG AAAGATGTTG ATGGCAACAC ATCTTTTGAA AATGAACAAA 1980 

CTTCATACTT AACTTCAGTT TTCGAAAAAC ATTTAAGTGA CGAAAAATCA GAACAATTAT 2 04 0 

TGCATCATAT GCGTTCGGGT TTAACTTTGC TTGATTCAGT GAAAATAGTT AGTACCGAAG 2100 

TTAAACAGTG GCAAACATAT ATCGCAGATT ATCTATATCA ATGCGATGTA ATAAACGATC 2160 

CAACGTCATT CGCTGAACTA CTAGAGCAAC GATTGATTGA CAATCCAGGA TGGATATTAA 222 0 

GTCCATATCC TGTTGCAATA CCACACCTAA GAGACAATAT GATTAAACAC CCTATGATTC 2 280 

TAATCACAGT TTTAGAAGAA CCGTTAACAT TGCCTAGTAT TCAAAATGAC AATCAAACAA 2 34 0 

TTAAATATAT GATTTCCATG TTTATTTCTG ACAATGATTT TATGGCATCA CTGGTAAGTG 24 0 0 

ACTTGTCCGA ATTTTTAAGT TTGAAATTAG AATCTATTGA TACTTTTATG GAAAATCCAC 24 6 0 

AGGAACTTGA AACATTATTA AGAAACAAAT TTTTAGAACG AATTAAAAAA CAATTTATTT 2 52 0 
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TAACAGCCAA AATGAAGCAA TTGAAAAAGC AGGTAAAGCC TTAGTTGATA GTGGTGCTGT 264 0 

AACAGATGCT TATATTCAAG CAATGAAAGA TCGTGAGCAA GTCGTATCAA CATTTATGGG 2700 

5 

AAATGGCTTA GCAATTCCTC ATGGCACAGA TGAAGCTAAA ACAAATGTGA TTCACTCAGG 27 6 0 

TTTAACATTA TTACAAATCC CTGAAGGCGT TGACTGGGAT GGCGAAGTAG TTAAAGTTGT 282 0 

CGTGGGAATT GCTGGTAAAG ATGGCGAACA TTTAGACTTG TTATCTAAAA TTGCAATTAC 2 880 

10 

ATTTAGCGAA GAAGAAAATG TGGATCGTAT CGTTCAAGCA AAATCTGCAG AAGAAATTAA 294 0 

ACAAGTATTC GAGGAGGCAG ATGCATAATG AAAGCAGTTC ACTTTGGTGC TGGTAACATA 3 000 

^5 GGTCGTGGTT TCATTGGTTA TATTCTgCAG ACAACAATGT TAAAGTAACA TTTGCAGACG 306 0 

TCAATGAAGA AATCATTAAT GCTTTAGCTC ATGATCATCA ATACGATGTT ATTTTAGCTG 312 0 

ATGAGTCTAA AACAACGACG CGCGTGAAtA ATGCTGATGC AATTAATTCA ATGCAACCTT 318 0 

20 CTGAAGCGTT GAAACAAGCA ATTCTAGAAG CTGATATTAT TACAACAGCT GTTGGTGTTA 324 0 

ACATACTACC TATTATTGCT AAATCTTTTG CGCCTTTCTT AAAAGAAAAA ACAAACCATG 3 3 00 

TTAATATTGT TGCTTGTGAG AATGCTATTA TGGCAACTGA TACATTGAAA AAAGCAGTAC 3 360 

TTGATATTAC TGGCCCTCTT GGTAACnaTA TTCATTTTGC TAACTCAGCA GTTGATAGAA 342 0 

TTGTACCATT ACAAAAGAAT GAAAATATAT TAGACGTTAT GGTTGAGCCA TTTTACGAAT 34 8 0 

GGGTTGTTGA AAAAGATGCA TGGTATGGTC CAGAACTAAA CCATATTAAA TATGTTGATG 354 0 

30 

ATTTAACACC ATATATTGAG CGTAAATTAT TAACTGTGAA TACAGGACAT GCATATTTAG 360 0 

CGTATgCTGG tAAATTTGCA GGTAAAGCTA CAGTTTTAGA TGCAGTTGAA GATAGTTCAA 366 0 

TTGAAGCTGG CTTACGCCGT GTTTTAGCTG AAACTAGTCA ATATATTACT AATGAATTTG 3 720 

35 

ATTTTACTGA AGCGGAACAA GCTGGTTATG TTGAAAAAAT AATAGATCGT TTCAACAATT 3780 

CTTATTTATC TGATGAAGTA ACACGTGTCG GACGAGGTAC ATTACGTAAA ATTGGCCCTA 3 84 0 

40 AAGATAGAAT TATAAAACCA TTAACATATC TTTATAATAA AGATTTAGAA CGCACTGGTT 3 90 0 

TATTAAATAC AGCTGCATTG TTATTGAAGT ATGATGATAC AGCAGACCAA GAAACTGTTG 3 96 0 

AGAAAAATAA TTACATTAAA GAACACGGTT TAAAAGCGTT TTTAAGTGAA TATGCTAAAG 4 02 0 

''^ TTGACGATGG CTTAGCCGAT GAAATAATTG AAGCGTACAA TTCACTTTCA TAATTTATTG 4 080 

AGCTTTGTTT GAAACAAGAA GTTTCCAACG TTATTCGTTA ACAATCAGTA ATAATGTAGT 414 0 

AGTTCCCTTG AATTAACAAT ATTAAATTTC TGAACATAAA AAATACTCCC TTC7VACATAG 4 20 0 

50 

ACACTTAACT TGTGTTATGT ATGAAAGGAG TATTTTTGCG TTAATAATTT GTTTTATTTT 4 26 0 

CGAGCCACAG CCACCTATTC AATGGCTATT GGTCATTACT AAAACAAATT CATATTAACT 4 32 0 
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TTGAATAAAT TTTATTCTTC AGTTTGTTGG TCTTTCTTAG TGAATCTTCT AATTAAGAAT 444 0 

GCCATACCTG CACCTAGAGC TAATTCAGCA TATGGTAAAT CGTCATTATG TGACATACCA 4 500 

^ GTATCTGGTA AAGTTTTAGC TTGTTGTTTA GCTTTATTAA CTTTTCCTTG TTGAGCTGAT 4 560 

TTTGTCTTAG CTTGGTGGTC GTCAGTGTTA GTTACATTAA GCATATCTTG ATTAGCACTA 4 6 20 

TTGCTTCCAT TTGAAACTGT AGCTGGAGAT GCATTGGCAC CGTCGTTTTG CGTAgyTTTA 4 680 

10 

TTGTTTGCAG CTGAACCAAC TGATTTTTGC GTATCATTAG TATCTGCTGT TGCCGTATCA 4 74 0 

TCTTTTTGGC TAACATTAGT TGAAGTCATT TTTTCTTTTG CTTCAGAAGA TGCAGATGTT 4 800 

GATGGTTTAT TCGAAACTTC AGTATCAGCT TTGCTTGGCG ATTTATCTGC TTCGTTAGAT 4 86 0 

15 

GCAACGTTAG TTTCAGACTT AAGTTGTCCT GCATCAGTTT GATTTGTCGT ACTTTCTTCT 4 92 0 

TTATCTTTTG ATGTATTAGA AGGTACATTT GGTTCTGTTA TGTCTGCTGA AGGCAATGTT 4 980 

20 TCAGTTGTtG ATTCAACCAT ACTTTGATTT GTTGAATCAC TACCATCTTT TTCTGCCTTA 504 0 

GCTTTATTTT CAGATTTTGG TTGTGCaaCC TTGTCATTAG TTGATTGAGA TTCAGCACTA 5100 

TTATTTACTT CAGCATTTTG TTTTGAATCA TTTACAGATG CATTATCTTT GCTATCAGCA 516 0 

GATGATGCTG CTTCTGTGCT CGCAGTTGTT GGAGCCGTTG CTGTTGATCC TGTTGGTGCA 522 0 

TTCTCGTTTG TTGCTGTAGT TGTACTATTG TTATTTGTTG TGCTTTCTGC TGGCGTTGCA 528 0 

TTATCAGTTT CTGTTACAGG TTTATCAGTT GTGCCGTTAT TAGTTGATTC TACTTCTGGT 534 0 

30 

TTACTAGTTA CATCGTTATC CATTGTCGGA CTGTTTGTTG ATGCATCTAC ACTAGAATTG 54 00 

TTATTAGCTT GCGGTTTATC ATTTGCATCA TCAGTTGCTG ATGTTGCTGT TGTTTCACC'T 54 60 

GTTGCCGCAT CACTATTATT TGGTGTTGTC GGAGAAGCGT CTGCTTTGCC ATTAGCTGTC 5520 

35 

GTCTCAGATA CGTTAGGTTG TCCAGTATTT TCTGGTGTTG CATTAGCATT TGAATTTGCT 55 80 

GTTGCATCAT TATTATCTAT ACCATTATTA GTATCATTAG CATCTGGATC ATTCTGAGGC 564 0 

ACAATCGCTT CAATTGCAGG TATCGTTACA TTTTGTAATT CAGCAACTTC TGCATTTGTT 57 00 

TGTGTTT T AT CTAATTTATC AGCAAATCTG TCAAAATATC TACCTAAATC CGTACGTGCA 57 60 

ATTTCTTTCG CCGATGCATC TGCATCTGCA TTTTTAATTA TTTCTATTTG CTTGTTAACC 5 8 20 

J5 ACTTCTCTGA TTGCTTCCAA AGCATTTTTC TTAACTTCAG GATTAATACG TTGTGCTTTA 5B80 

AGTTGTTCAA GCGCACTATT TTTGACAGTA GCGATTTCTG CATTTGTAGT TTGATCAGAA 5 94 0 

ATATCTTCAG TTGCTTTTGA TAAAATGTCT TCTAAAGCAT TCGTAAACGC TTCTTTTTCT 6 000 

50 

TCAGTTGTAG CATCAGCGTT GACATTTACA CCTGCTTCAA TCTGGTCTAG TGCAGTTTCT 606 C 
(2) INFORMATION FOR SEQ ID NO: 535: 
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(A) LENGTH: 977 base pairs 

fB) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 535: 



15 



20 



25 



30 



35 
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GACTATACAA 


ACATGTTACG 
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GATCAGCGAT 
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660 


AGTCCATACT 


TTACACATTC 
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TTTGTCTTAG 


CACATGAATT 
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GGTCATTTTA 
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AAAACATCAA 
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CCATATCTTG 


AATCAGAAGC 


ATCAATGTAC 


TTTGTTGAAG 


CCCCTTCTAC 


AATGAATGAA 


B40 


ATGTTGATGG 


CCAATTATTT 


ATTTAACACA 


AGTGATAATC 


CAAGATTTAA 


GCGTtGGGTT 


900 


ATTGGCTCAA 


TTTTATCTAG 


AACATATTAT 


CATAATATGG 


tACCCmTTTA 


TTAGAAGCnG 


960 


CTTATCCACG 


GGGAGTG 










977 



40 

(2) INFORMATION FOR SEQ ID NO: 536: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

so (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 536: 

AGACAGTGAT yGaATTTCAT TTACAGTACA CAAATCATCG AAAAATTGGT AACATTCTTC 6 0 

TCTATTTTCT AACGTTAgwA TTgCATCAAA CAATTCATCT AACGCTGCAC CTCGTAATTT 12 0 

55 
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AATCTATTAA AGTGTAGCGa TTTATATTTT ATTAAATCTG AATCGTTACT TTATTTAATT 24 0 

TTATGCTAAT CCAGCGCGTT CGAAAATAGT GTCAACTTGa TTCAAATGAT GTTTAGGATC 3 00 

5 

GAAACATTCA TCCAATTCTT CTTTTGTTAA AACACTTGTA ATAGACTCAT CTTGTTCGAT 36 0 

TAATTCACGG AACGGTGTTT TCGTTTCCCA AGATATCATC GCTTTTGGTT GTACTTTGTC 4 20 

GTATGCTTCT TCACGAACCA TACCTTTATT AATTAATGCT AATAAGACAC GTTGTGAGAA 4 80 

w 

AATCAGACCA AATGTTTTAT CTATGTTATT ACGCATATTA TCTTCAAATA CAGTTAAACG 54 0 

GTCCACAATA TTTGTGAACG ATTCAATGCA TAATCTAGTG CtATTGTAAC ATCTGGTAAC 600 

ATAATACGCT CAGCAGAAGA ATGAGAAATA TsTCTTTCAT GCCATAATGG CACATTCTCA 66 0 

TAAGCTGTAG TAATATAACC ACGAATGACT CTTGAAATAC CTGTGATATT TTCAGAACCA 72 0 

ATTGGATTTC GTTTATGAGG CATTGCAGAT GAACCTTTTT GGCCTTTTGC AAATGCTTCT 7 80 

20 TCAACTTCTC TCGTTTCGGT TTTTTGAAGG TTACGTATTT CAACGGCAAA TTTTTCTAGT 84 0 

GATUTCGCGA TTAATGCTAA TGTCGCAATA TAGTATGCAT GTCGATCGCG TTGCAATGT7 300 

TGCGTTGATA CAGGCGCTGT GCCAATACCT AAATGTTTAC ACACATAACT TTCTATTTCA 96 0 

GGAGGAATGT TAGCAAAAGT ACCTACTGCA CCACTCATTT TCCCTACTTC AATTTCTTCT 102 0 

CTTACTTGTT TGAAACGTTG TAAGTTACGT TGCATTTCCG TGTACCACAA TGCCATTTTG 1080 

ACACCAAATG TAGTTGGTTC TGCATGCACT CCATGTGTAC GTCCCATCAT CAATGTATAT 114 0 

30 

TTATAATTTT TTGCTTTTTC AGCTAAAACG TCGATAAATC TTTCTAAATC TTTTTCAATA 12 0 0 

ATGTCATTTG CTTGTTTAAT AACGAAACTT AAAGCTGTAT CTACAACATC AGTAGAAGTT 1260 

AAACCATAAT GTACcACTTA CGTTCTTCAC CTAGCGTTTC AGAAACTTGT CTAGTAAAGG 13 20 

35 

CTACAACATC ATGGCGCGTT TCTTGTTCAA TTTCTTGTGC ACGTTCGACA TTTACCTTTG 13 80 

CGTTTTGACG AATTTTTTGT ACGTCAGCTT TCGGTATATG TCCTAATTCA CTCCATGCTT 144 0 
(2) INFORMATION FOR SEQ ID NO: 53 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 784 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 537: 
GATAAATCTA TnCAGTTTCC GTCCAAATAT CtGCaCCTAA AGCTTTTAaG 
TATCTgTATA ACCTCTATAA ATATGTTTAA CATTGTAAAT TGTAGTTACA 



TGTTCTACAA 



CCCTCAGCAA 
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CATGTAATGT TGATGGTTTT ATCGTTGCTG TGCCTTCGTC AACTTCAATA TTTGCACCCA 240 

TGCGCTTTAA TTCTTCAACA TGTTTAAAAC GCTCCGGATA AATCGTATCA GTTACAAATG 300 

5 

AAGGACCATT TGCCATAAAT AATAATGGTG TAATAGGCTG TTGCAAATCA GTAGCAAAAC 360 

CTGGATATAC TAGTGTTTTA ATATCAACAA ATTGATATGG CGCATTATTA TTGATGCGAA 42 0 

TTCTTTCGTC TCTTACATCA ACATTCACAC CTAATTCACT AAATTTAGCA GTTAATGTTT 4 80 

10 

CTACATGTTT CGGAACAATA TTATTTAATA TAACATTTTC TCCACATGCT GCAGCGATAC 54 0 

ACATATATGT GCCTGCTTCA ATTCTATCAG GTATAACTTG ATACTCAGAA CCATGTAATT 600 

^5 CTTTGACGCC ATTGATTTTA ATTGTTGATG TACCCGCTCC CTTAATATTA GCTCCCATAC 66 0 

TTGTTAAGAA GTTAGCAACA TCAACTACTT CCGGTTCTTT AGCAGCATTT TCAATTACAG 720 

TTTGTCCTGT TGCATAAACT GCAGCTAGCA TAATGtnAAT TGTTGCACCT ACGCnAACCA 780 

20 TATC 784 
(2) INFORMATION FOR SEQ ID NO: 538: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3733 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 538: 

CAATCTCCAC CAATGACACG TCGTCATTCG TGACCTCATA CCArACAAAA AACAGTCTCG 6 0 

CAATCAAGAC TGTTTTCCAC TCAATATATT CATCCATTAG CGTAATAGAT TATTTGACTT 12 0 

35 

CTGTAGCTAC AAAGaTTTTA CGTTTTTCCC AAACGCCTGT cTTTTCATTG TAATCATCAC ISO 

AAGTAATTAA TGTTAATTGT TTATCTTTAC CTTTTTGTTC ATCTAGAAcT CCTACATCTG 240 

40 TAGGCTTAAC ATCTCTTATA CTTGTCATTT TATACTTACG TGTTTCATTA CCAACTTTAA 3 00 

AGTACACCAT ACTACCTTTT TTGGCTGCTT TAAGATTTGT AAATTGATAG TTCGGACGGT 360 

CAATGAAAGT GTGTCCTGCA ATTGAAATAT TTTGATCATC TAGTGATTCA TTTTCTTCTG 420 

45 CAAAGCTTAC ACCTCTATTT AATTGTTCAG GTGTTGCTGG TCCTGGATAT ACTGGTTCTT 480 

TAATATCAGC ATCTGGAATT TCAATATAGC CTGCCACTTT CGATTTATCT TTCGGAATTT 54 0 

GAGGTTTAGC TTGCTGCTTT TTATCTTTAC TCGCCTGTTC TTTTACATTT TTATCATATT 600 

50 

GTTCAATCTT TTCATCTTTA TCTTTATCGT GAAGATAATT ATCGATATGT GGTTTAGCAA 66 0 

ACAAATATGC TGCCACTAGG aTAAGTACCA CACCAGCGAT TGTCATTAAT CGATTTGTCC 720 
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w 



15 



20 



AGTATACCAT TAATTTCAAA ATGACTCATA GCAATTCATT TTATACTATA AAATTTACAT 84 0 

GTATACTTTT ACGTTAGATT TCATTACACA TATTTGCATT CAAATAACGA AACGCTTTTA 900 

ATAATTACTA AGGGGGAATT GATATOATTA GATACGCTAA AAAAGAGGAT TTAAACGCTA 96 0 

TATTAGCGAT ATACAATGAT GCCATTATCA ATACTACAGC TGTTTATACT TATGAACCAC 1020 

AAACCATAGA CGAACGTGTC GCATGGTTTG AAACGAAACA ACGTAAGCAT GAGCCTATCT 10 80 

ITGTATTTGA GGAAAATGGA AGTGTCTTAG GGTTTGCGAC GTTCGGTTCA TTTAGACCTT 114 0 

GGCCAGCATA CCTATATACA ATCGAACATT CTATTTATGT CGATGCTTCA GCTAGAGGAA 120 0 

AAGGTATTGC TAGTCAATTA CTACACCATT TAATTGTGGA AGCAAAAGCT AAAGGTTATC 12 60 

GTGCGCTAGT TGCAGGCATT GATGCTTCCA ACAAAGCGAG TATTCAGTTG CATCAAAAGT 132 0 

TTGCTTTTAA GCATGCCGGC ACACTGACCA ATGTAGGTTT TAAATTTAAT AGATGGTTAG 13 8 0 

ATTTAGCATT TTACGAATTA GATTTACAAG ACTAGTAATG TTTGAATCAC ATAATATAAA 144 0 

CAAGACAACC ATGTTA7^»TTC CCTTAACATA ACAAGCCAAC ATATAAAATT TTAAACTTCT 15 00 

CAGGGGAGTG GGACAGAAAT GATAAAGAGC CACTAATGAT TTATTATGTA GTGGTTCTTA 156 0 

CACATTAGCC ACAGCTAATG TGTAcTTAAA AATAGGAATA CATGAGTAAA ACTCATGCAT 162 0 

AAGAAATACT AATTTCTATA GAAAAAGTAT TTCTTTATCG TCGTCCCACC CCAACTCGCA 168 0 

CATTATTGTA AGCTGACTTT TCGTCAgCTT cTGTGTTGGG GCCCAAAAAG CTTGTTACAA 174 0 

GCGCATTTTC GTTCAGTCAA CTACTGCCAA TATAACTTTG TAGAGCATAT TACATTGATT 1800 

TACATTGTCC CTTTTATTTA TTCTTTTCAA ATACTATCCC CATAGCTTTG ATTTAACGCT 186 0 

TTTTCTCAAT AACAAAACGA ATATAGTAGA ACATGAAAAC GATAATCATG CTGAGCGATA 192 0 

AAGATTTAAA TAATAGATTG ACCCACGTTC CCTCAGTCGT ATATCCATAT GTAATCGTTG 198 0 

TGTTAATGAT GAATGCTATA AAGATGATTG ATAGTCTTAG CATATCATCA CTCCTTTTAA 204 0 

GTTATTTTAG ATATACGGGG GCGCTTTTGC AATCACTATT TTGATTAGTA TGCATTTTCC 2100 

ATAAATCTTT CAACTTCTTC AGAGATAATT AAGAAGCATC TATCTGGTAC TAATGATCCA 216 0 

GACAGATGCT TCTTTTTTAT CAATATTTTA TTGTTATCTC ATTAATTATT TTTAACCATA 222 0 

45 TCTTCAGCTG TGCCAAAGAT TTTACGTTTA ATTGCTTCGC CAGTTGGTGT GCCTGCTAGT 22 8 0 

CCACCCAATC CAGTTTCACG TAATGATGCA GGAAGGTTAC GACCAACCTT ATCCATTGCT 2 34 0 

TCAATAACTT CATCAACAGG GATTCTACTT TCAATACCTG CTAATGCTAA ATCTGCTGAA 24 0 0 

ATTAAAGCGT TACCCGAACC AATTGCATTT CTCATAACAC AAGGAATTTC AACAAGTCCG 24 6 0 

GCTACTGGAT CACAAACTAA ACCTAATAAA TTACTTATCG CTAATGCCAT AGCGTGCCCG 2 52 0 
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GAACCAACTT CAGnTTGGCA GCCACCTGTT GCACCAGCTA CACTTGCATT GTTTGCTACG 264 0 

ACACGCCCAA ACAATGCTGA AGTGAATAAG AAATCAATCA TTTGCTCTTC TGTTAAATCA 270 0 

5 TGTGTTTTTT CTAATTTAAA AAGTGCACCG GGAATGGTAC CCGAGGAACC AGCTGTTGGC 27 60 

GTtGCACAAA TAATACCCAT CGCAGCATTG ACTTCATTTG TTGCAATGGC AcCtTTGcTG 2820 

CGTCAATCAT TTCATATCCA GACAAAGCAT GATGTGTTTC ATTATAATCA CGTAGTTTAG 2 8 80 

10 

CAGCATCATG ACCAGTGTAG CCCGTTACAC TTTCAACCCC ATCACCTGTC GTCCCTTTGA 2 94 0 

TTACTGCGTC TCGCATGACA TCTAAATTTT GTTTCATTTG CGCTCGCACT TCATCACGTG 3 0 00 

ATTTACCGCT TAATTCCATT TCTTCTTTAA CCATGATATC CGCAAATGAC ATATTATTTT 3 060 

15 

CTACGGCATA ATCTATAGTC TCTCTAATTG AATCAAACAT GTTTATTCCC CCTCTAATTT 3120 

ATATAGGAAA CGTTTACGTC ACTGTATTTC TCTTTAATTG TATTTAATGT TGATTCTGAG 318 0 

20 ATTGCTTTAT TTAATGGTAT TACAACCAAG CATTTATCTT CATCTATCTT AATAAATTCA 3 24 0 

TCTTTACAGT CTAATTTCAT ATCGTTGATA TCATTGATGA AATGATTTAC TTGTGCTTTA 33 00 

GTCATATTTC CGTCAACAAC TAAAATTGGT AATCCATGAT TTAAATCTAC TTCTAGTCCA 3 3 60 

25 TTThTATGAA TACCTTTAAT TTTAATTGTA CCACCACCGA TTGAAATACC GATAATTTCA 3420 

ATGTAGCGAC CATCATTACG AGATGATTTG ATATAAGCAC AGTTTGGATG TTGACCAATA 34 8 0 

CTATCGCCTT CTTCTTCGAT GATATCTATT TTAATACCAT CATCAGCTGC AATTTCTAAT 3 54 0 

30 

GAAGATTTAA TTCGGTTATC AAATGTTGAA TATCCCATTG CTCCACCCAC AATAGCGACA 3 600 

TCTGTACCAT GTCCTTGGTG TGTTTGAGCA AATGATTCAT AATAATGTAT TTCAATATTT 3 660 

TTATATCTCC CAATATTGCG CGTGCTGAAT TCCCCTTTAC TGCACCAGCC GTATGAGAAC 3 72 0 

35 

TTGAAGGGCC CAT 3 73 3 
(2) INFORMATION FOR SEQ ID NO: 539: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 525 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 539: 
TGGCTGTCTT CTCTATGAGT GTAGTAAGTA AGTTAACGGA TTTAACGCCA AGGCAAATAC 60 
GTTACTATGA AACACATGAA CTCATCAAAC CTGAAAGAAC AGAAGGTCAA AAACGTCTGT 120 
TCTCACTCAA TGATTTGGAA AGATTACTAG AAATTAAATC ATTATTAGAA AAAGGATTTA 180 
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AAGAGATAAG AAAAAAGATG ATTGTAGATG CCACGCAAAA GCCTATTGGA GArACTTTGC 3 00 

CAATAAATCG TGGTGATTTA TCCCGATTTA TTAAATAAAA TTTGGAGGAT TTTAAAATGC 360 

CAAAACGTAC TTTCACTAAA GACGACATTC GTAAATTTGC AGAaGAGGAA AaTGTAaGaT 420 

ATTTAAGATT ACAATTCACT GATATTTTAG GAACAATTAA AAATGTTGAA GTGCCTGTAA 480 

GCCAATTAGA AAAAGTACTT GATAACGAAA TGATGTTTGA CC-GTA 52 5 



10 

(2) INFORMATION FOR SEQ ID NO: 54 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 08 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 0: 

TTGATTTGCT ACAAAGTATC TxiCTCATTTC TGTATCCTGA AAAATCTTTA GTGTAATAAT SO 

GTTGTTCAGT TTTAATATTT TCAGTCATAG TTGACTACCT CCGTATATTT TGATTTAATT 120 

AAGTTGTATA TTTTGATGAA CACTTATTGT TACTTGTTGG CGCAAGTAGC AGTTTTTTCA 180 

TTCTTCATAA AAGTATTCCT TATAGAATAT GAATGTTGCG ATACTTTGCGA ATCCTGCAAT 24 0 

TGaCCATGCT GtAGTGAAGT ATAGAAACGG CATAAGTACA ATCGCTAAGA CTGTGAAGCA 3 00 

30 

TAGTACTGCT ACTAGGTAGC TTTTATAAAT GTTACTCATT TTCTTTTTTC AACTCCTCCA 3 50 

TTATTCTCTG GTCTGATAAG TCGTGATAAG GGAATTTTTT CCtAGCTAAT TGGACTGGTA 4 20 

TTCTGCCTCG TATCGCAATG TATCCTTCAT CTTCAAGCTC TTTATTCAGT TCTCTTATTA 4 80 

35 

TTTGTCCTGC TTTGGATTTA GAAACAGATA AAATTACCgC AAGTTCTTTA GCTTGCAAAC 54 0 

TATTTTTCAT CATATCTTTT CCTCCTTTAA AATAACTGTT GATTCTCTGG GTTATCTGCT 600 

TCGTAATTAT CTGCAATAAT ACTTTTAGCG AAAAAGTCCA AACTGACCTT ATATAGGTTG 660 

40 

TTCATAGATT TCTTTACGTT AACCCCTTCC TCAAGTACAT AAGGCACCCT AAAATCATTT 720 

ATAAACAGTC CGTTTTCGTC TAAAGTAACG GTTGGTAATT CAGGTTTGTT CCGTCTATAA 78 0 

^5 ACTTCTCCTA GTGTAGGTTT TTGCTTTTCA GCTTGTTTAG TGAAGTCGGA AAATGCCTTA 84 0 

AGTAGTTTTA TTCCTGAATC AGGATCACTG TGTCGCTCAA TCGTTTCTGC TGTAGACTCT 900 

TTACTAAAAT CATTTCTATT GATTACAGGC TTTCTCGTAT TTCGTTCAAT CTTCCAAACC 96 0 

^0 TTCCACGTCA CAACTGCCAT TGTGGTGAGG AGGGTTGTTT TGTATAGTGC GTTCATTTGT 1020 

AATTCCTCCT ATTAAGTTGT TTGTTCAATT GTGTGTGTTA TTCTTCTTCG TCTAAATCAA 1080 
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CGACTTCGTA AGTGTGCTCA ATCTCGCCTG cATATGTCAC AGTAAGAGTA TCTTTGTGTG 1200 

TGTATGTTTG ACTTTTGTTy TCTtTAACTG CATAAAGTGT TAATACTATA TTGTTTAGCT 12 6 0 

TTyCTTTTTG TTCTGGTGTC ATTTACGCTC CCCCTAmATT AGCyTCATAA CCGAATTCAG 132 0 

TCATGATTTC ATGTATTTTC AATCTGCCTT TTTGTGTCCA TCTAGTTTGT AAAACTGTGT 13 80 

CTTCTCTGCC ATCAGAACGC ACAATTGT 14 08 
C2) INFORMATION FOR SEQ ID NO: 541: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 541: 

GTTCGTCAAT GATTTTTATC CGATGCTATG AGCATTaTCA AATACAAAAT GCTCTCTTAA 60 

AAGCAGTTAT TGACTGAAAA TCTACTTCTA AGAGAGCACT TTATTTAATT ACTTAAGAAA 12 0 

25 TCTTGAAATT TCAATATACG ATGTTTATGA TAAGTCGCTT ATTTCATCTT TAGGCTTGTT 18 0 

ATTAGTAAGT AGTTTAATAC CACTGATTAA CCATAAAGCA AATGTAATTA TGTTACCACT 24 0 

TATTACAGCT CCAATAATCA ACAATATACC ACTCATTTTT TTGTTTTTAG ATGCTTTAAA 30 0 

30 

CATACCGATT GCACCTAAAA TAATTGAAAT GATTCCAAAT ATGAATAGGG ATAAGAATAA 360 

TACAGTGAAA ATTGCTGCTG CTGTTTCTGc ATCAACTGGG nCAACCTCAC CATTAACTGT 42 0 

TGTTGGACAC AT 43 2 

35 

(2) INFORMATION FOR SEQ ID NO: 542: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 2426 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 542: 

ATAATCATGA AGTnGCTAAA nCGCCAAATA ATGATGGTTC TGGACATGTT GTGTTAAATA 6 0 

AATTCCTTTC AAATGAAGAG AATCAAAGCC ATAGTAATCG ACTCACTGAT AAATTACArG 12 0 

GAAGCGATAA AATTAATCAT GCTATGATTG AAAAATTAGC TAAAAGTAAT GCCTCAACGC 18 0 

AACATTACAC ATATCATAAA CTGAATACGT TACAATCTTT AGATCAACGT ATTGCAAATA 24 0 
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GTATAAAAAG TCAACGAAAT ATTATTTTGG AAGAACTTGC ACGTACTGAT GATAAAAAGT 3 60 

ATGCTACACA AAGCATTTTA GAAAGTATAT TTAATAAAGA CGAGGCAGTT AAAATTCTAA 4 20 

5 

AAGATATACG TGTTGATGGT AAAACAGATC AACAAATTGC AGATCAAATT ACTCGTCATA 4 80 

TTGATCAATT ATCTCTGACA ACGAGTGATG ATTTATTAAC GTCATTGATT GATCAATCAC 54 0 

AAGATAAGTC GCTATTGATT TCTCAAATTT TACAAACGAA ATTAGGAAAA GCTGAAGCAG 6 00 

10 

ATAAATTGGC TAAAGATTGG ACGAATAAAG GATTATCAAA TCGCCAAATC GTTGACCAAT 6 60 

TGAAGAAACA TTTTGCATCA ACTGGCGACA CGTCTTCAGA TGATATATTA AAAGCAATTT 720 

TGAATAATGC CAAAGATAAA AAACAAGCAA TTGAAACGAT TTTAGCAACA CGTATAGAAA 780 

GACAAAAGGC AAAATTACTG GCAGATTTAA TTACTAAAAT AGrAACAGAT CAAAATAAAA 84 0 

TTTTTAATTT AGTTAAATCG GCATTGAATG GTAAAGCGGA TGATTTATTG AATTTACAAA 900 

20 AGAGACTCAA TCAAACGAAA AAAGATATAG ATTATATTTT ATCACCAATA GTAAATCGTC 960 

CAAGTT7ACT AGATCGAT7G AATAAAAATG GGAAAACGAC AGATTTAAAT AAGTtAGCAA i020 

ATTTAATGAA TCAAGGATCA GATTTATTAG ACAGTATTCC AGATATACCC ACACCAAAGC 1080 

CAGAAAaCGt TAACACTTGG TAAAGGTAAT GGATTGTTAA GTGGATTATT AAATGCTGAT 114 0 

GGTAATGTAT CTTTGCCTAA AGCGGGGGAA ACGATAAAAG AACATTGGTT GCCGATATCT 1200 

GTAATTGTTG GTGCAATGGG TGTACTAATG ATTTGGTTAT CACGACGCAA TAAGTTGAAA 126 0 

30 

AATAAAGCAT AATTATATTG GGGGAAGAGC ATCTATATAT TTTTTTAAGT ATATAAGACG 1320 

TCTTATTTCC CCTTAATTTA TTGTGAAGTA TATGCAAAAT GCAATGAATA GATTGTCCAT 13 80 

CATTTTAACG TTATAATGAA TTTAACGACT TAGAACTACA CAAGTAAAGG AGAATGAAGA 14 40 

35 

TGTCTCGAAA AACGGCGCTA TTAGTTTTGG ATATGCAAGA AGGTATAGCG AGTAGTGTAC 1500 

CTAGAATAAA AAATATTATT AAAGCGAATC AGAGAGCAAT TGAAGCAGCA AGACAACATC 1560 

GAATACCAGT CATTTTCATA CGTTTAGTGT TAGATAAGCA TTTTAATGAT GTCTCCTCGA 1620 

4u 

GTAATAAAGT GTTTTCAACA ATTAAAGCTC AAGGATATGC GATTACTGAA GCAGATGCAT 16 80 

CTACACGAAT ACTTGAAGAT TTAGCACCAC TAGAAGATGA GCCGATTATT TCTAAGCGAC 174 0 

GCTTTAGCGC ATTTACAGGT AGTTACTTGG AAGTTTATTT ACGTGCAAAT GATATTAATC 18 00 

ATTTAGTATT AACGGGTGTC TCTACAAGTG GAGCTGTATT GAGCACGGCA TTAGAAAGTG 18 60 

TAGATAAAGA CTATTATATT ACTGTTTTAG AAGATGCTGT TGGTGATAGA TCAGATGATA 1920 

AACATGACTT TATTATTGAA CAAATTTTAT CACGCTCATG TGACATTGAA TCCGTAGAGT 19 80 

CATGGAAAAG TAGTTTATAG TTAATATAAC GTCAATTAAA GCTCGGCAGT AATGTTTGAG 2 04 0 
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GAGGAACATT TGAACATAAA ATAATATATT TATATAAAAC GACCgAGGCG TTCGAACTGA 216 0 

ATGtCCTCGG GTTTAATTGA ATAGAAATCG GACTTATGAA CGAAATATGT TTAAGTCGAA 222 0 

CTCCTTGTTT ATACTTATAA ATTTTACGGG TTTAATATAA TACTTATTTA CCTGTAATAT 2280 

ATGCATAATT nCTTCAGTCG GTCAGCCTGT CGTTGCATAG TTCCTATGCA GCAAATGCAT 234 0 

ATCCTAATCC TTTAACATTG GCATTnCTGC AAATGAACGC ATAGAATCCA TTCACTGTTA 24 00 

ACTTTTTnCA ACAAATGTCT nACATG 2426 
(2) INFORMATION FOR SEQ ID NO; 543: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1874 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 543: 

GAGTTGGGGA ATGTGCTCAA AATATGCGGA CTTTATGCAT TyCGGAATTG sCCaATTGCA 6 0 

25 GCTTTAAGCT ATGGTCAAAA AAAGAGGGTC ACTATAGCAT CTGTTTTAGT CTTAAATCCG 12 0 

GAAATAATCA TATTGGATGA ACCGACTGCT GGTCAAGATT TCTATCATTA TAATGAGATA 18 0 

ATGTCATTTT TAATTGAACT AAACAGACAG GGGAAGACGA TTATTATGAT TACGCATGAT 24 0 

ATGCATTTAT TGTCTGAGTA TAGTTCAAGA ACAGTTGTAT TATCAAAAGG TCAAGTCGTT 30 0 

GCTGATACCA CGCCAgTATT GGTTTTAAAT GATAAAAAAA TCTGTGAGAT TGCATCATTG 36 0 

AGACAAACAT CGCTATTTGA AATGGCCGAA TATATAGGGA TTAGCGAGCC ACAGAAATTA 42 0 

GTACAATTAT TTATTAACCA TGATAGGAAG GTGAGACGcC AATGAATCAA TATAATACTA 480 

TAGGTTTTCA CCCGGGAAAT AGTCGTATTC ATCAATTAAA TGCGACTGTT AAACTTTTAT 54 0 

TCTTATTAGT TGTTTCTATT TCTGCAATGG TGACTTATGA CACAAGATAT TTAATTTTAA 600 

TTAGTGCTTC ATCTATTTTA TTGGTCAAAT ATGCTCATAT TGAATGGAAA CAAGTTCGCT 66 0 

TTGTTGTTAA ATTCATTCTG TTTTTCACAA TAtTAAATAT TATTGCCGTG TACATATTTG 72 0 

45 ACCCTGAATA TGGTGTGAAG ATTTATAATC AGCGTACAGA GTTAGTCAAT GGTATTGGTC 780 

GATTTACGCT AACATCACAG GAATTATTCT ATCTTTTTAA TCTAATATTA AAATATATTA 84 0 

GTACAGTTCC TTTAGCGTTA ATATTTTTAT TCACAACGAA TCCGAGTCAT TTTGCTGCAA 900 

GTTTAAATCA GCTAGGTGTG AATTATAAAA TCAGTTACGC AGTCTCACTA GCATTAAGGT 96 0 

ATATTCCAGA TATTCAAGAA ACATATTttA ATATTTCACA AGCGCAACAA GCAAGAGGAT 1020 
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T AC CTFTTAAT 


ATTTT CTAGT 


ATCGAAARAA 


TTtt A PA PT a.T 

X X \3/%^«M^ X /\ X 


TAf^TAPTGPT 
X n\J X X u V. X 


ATPPAPTTA a 
r\ X wVj/AvJ X X /v\ 


1140 




GACAATTCGG 


ACAGTATAAA 


AGGAGAAfCT 


GGT APrST P A a 


AAAAPAAT^A 


A AAAA ArUkT*^ 


1200 


5 


ATTATGTTGT 


TTTGTGTTTG 


ACGTTAATAC 


TTPTGATGTT 

X X ^ X \ X VJ X X 


AGT AGT^TAPA 


TT A TT PT*P*7*T' 


1 £ r\ 




TAAATAATAG 


TCGATATTTC 


AACCCGTGGC 


ATTAfiTATTP 

X X n.\J X £\ X X Vk* 


ATATAAATAG 


TPTTTAAATA 


1 J 20 


10 


GAAATAGGAG 


GGAGACATTT 


AaT^ATAAAT 


APTGAAAPAT 


TA A ATTT AAT 


p A TT^rT' a ii.r'T 


lion 

^ ^ \^ 


TCCTCGCATT 


TAATTGAACT 


TT AT A AT ATT 


TGTAGTPATP 

X U X £WJ X \»/\ X ^ 


PAPAAPPAAA 


TATAT'^/^&r'T 
X>\i>\l/i^^V_ 1 


14 4 0 




CCCAAAGGTT 


TACATAATTC 


CAAATTAGAP 


APAPAAPfV^T 


ryiATTriA AA A 




1 c n r\ 
1 D U 0 


15 


CATTGGCAAC 


AATATCAATT 


TGGTTAPTTT 


GTATT(V;T A A 
w x/\x X vjvj xrvi 


AA A AAATAP A 


i 1 vj X /\Lj i \j 1 i 


1 c ^ n 
i b b u 




ATTGGTATTT 


GTGGATATGA 


ATATCGAPAA 


TTAAAHPAAP 


A ^ A ^ R & '1*' 1' 


7V A ft. ^P^^'IP & ^P^^^P 

AAfti J. 1 Ai 1 i 


T O A 

1 b ^ U 




TATAAATTAC 


ATCCAAGTTT 


TGAAGGACAA 


GGGT A PG PA T 


nTPAPPPTAT 




1 c Q n 


20 


ACAAATTTTG 


TGAATTATAT 


rnATPAAGAA 


APAflTAA AA(^ 




AAA i AAVjr i\j 1 


1 /4 □ 




AACCAACGTT 


CAATAAATTT 


AGCAGAAAGG 


CTTAAATTCA 


AGCGAGACGA 


TACTATGGAC 


1800 




GACATTATCA 


ATCAAGGAGA 


TATTGTGTTT 


TAAnAATAAA 


ATACTATGAC 


ATTATCTAAA 


1860 


25 


AAATAAAATT 


AAAA 










1674 



(2} INFORMATION FOR SEQ ID NO: 544: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5280 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 544: 



TCAACATTTC TAACACCAAT GTGAAAATGA TCTATGTGAT TTGCAATGGC TTGATTTGTA 6 0 

ATATGTGTGC CTAAATGACC TGTAGCACCT GTTAACATAA TATTCATTCA CTTCATCTCC 12 0 

TAATCTTTAT ATACATAACA TAATACTTAT TTGATGGTTT TCAAAACATT TGATTTTATA 180 

AAAAATTCTA ATCTGTATTT ATTGTCGACG TGTATAGTAA ATACGTAAAT ATTATTAATG 24 0 

TTGAAAATGC CGTAATGACG CGTTTTAGTT GATGTGTATC ACTAATATCA TTGAAAATTT 3 00 

TAATCaGGTA CTACGACAAT ATGATGTCTG TTTTGTGTCT GAAAGTTTTA CAGTTTTTAA 3 60 

AATAAAAATG GTATAAAGTG TGATTTGTAT AAAAAAGAGT CTCGACGGAT AAGAATTGAT 4 20 

TAATAACAGT TAGCATTTTA TTAATTACCT TAACAATGAT TCAAGTTTAG TTAAATGAGG 4 80 
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AAGCAAGTTT AGCATTGGGA ATGTTAGCAA CAGGTGTAAT TACaTCGAAT GTACAATCAG 6 60 

TACAAGCGAA aGCAGrAGTT AAaCAACAAa GTGAaTCAGA GTTAAAACAC TATTATAATA 72 0 

AmCCAATTTT AGAGCGTAAA AATGTGACTG GATTTAAATA TACTGATGAG GGTAAACACT 78 0 

ATTTAGAAGT CACAGTAGGG CAACAGCATT CTCGAATCAC TTTACTTGGA TCTGATAAAG 84 0 

ATAAATTTAA AGACGGAGAA AACTCAAATA TAGATGTGTT TATCCTTAGA GAAGGTGACA 900 

GTAGACAAGC AACAAATTAC TCAATTGGTG GCGTTACAAA ATCAAATAGT GTGCAGTATA 96 0 

TTGATTATAT CAATACGCCA ATTTTAGAAA TCAAGAAAGA TAATGAAGAT GTACTTAAAG 102 0 

ATTTTTACTA CATTTCAAAA GAAGACATCT CATTAAAAGA ACTTGATTAT AGATTAAGAG 10 80 

AACGTGCGAT TAAACAACAC GGCTTGTATT CAAATGGTCT TAAACAAGGT CAAATTACAA 114 0 

TTACAATGAA TGATGGCACA ACACATACAA TCGATTTAAG TCAAAAACTT GAAAAAGAAC 120 0 

20 GTATGGGTGA GTCAATCGAC GGCACTAAGA TTAATAAAAT TCTAGTAGAA ATGAAATAAT 126 0 

ACTTTCTAAC AACAAAGCGC TATGTTGAAT AGTGCTTGTT ATGGAAATAT ATGGAAGTTA 132 0 

AGCGACGTAC TGTTGCTTAG CTTCTTTTTT TGAGGGGAAA AGTTACAAAA CTCACACAAA 13 8 0 

CAGTCGCACC ACGCATTATC TTTTGCTTAA ATAGCTTAAT CATATTTTAT GAATAGTTAA 144 0 

AAACAGGTTA ATGTGAATAT CCGAATACAG CTCCTATAAT ATGGGTGTAT GATTCAAATT 1500 

ACGTAATAAA ACAATCTAAT TATAATAGAT TGGAGCATAC AACTATGAAA ATGAAAAATA 1560 

TTGCAAAAAT AAGTTTGTTA TTAGGAATAT TAGCAACAGG TGTAAACACT ACAACGGAAA 162 0 

AACCAGTTCA TGCCGAAAAG AAACCTATTG TAATAAGTGa AAATAGCAAA AAATTAAAAG 1680 

CTTATTATAA TCAACCTAGT ATTGAATATA AAAATGTGAC AGGTTATATC AGTTTCATTC 174 0 

AACCAAGTAT TAAATTTATG AATATCATAG ATGGTAATTC TGTTAATAAT ATTGCTTTAA 1800 

TTGGCAAAGA TAAGCAACAT TATCATACGG GTGTACATCG TAATCTTAAT ATATTTTACG 1860 

TTAATGAGGA TAAGAGATTT GAAGGTGCAA AGTACTCTAT TGGGGGTATC ACGAGTGCAA 1920 

ACGrTaAAGC TGTCGACCTA ATAGCAGAAG CAAGAGTTAT TAAAGAAGAT CATACTGGTG 1980 

AATATGATTA TGACTTTTTC CCATTTAAAA TAGATAAAGA AGCGATGTCA TTGAAAGAGA 204 0 

TTGATTTTAA ATTAAGAAAA TACCTTATTG ATAATTATGG TCTTTACGGT GAAATGAGTA 210 0 

CAGGAAAAAT TACAGTCAAA AAGAAATACT ATGGAAAGTA TACATTTGAA TTGGATAAAA 216 0 

AGTTACAAGA AGACCGTATG TCCGATGTTA TCAATGTCAC AGATATTGAT AGAATTGAAA 222 0 

TCAAAGTTAT AAAAGCATAA CACATATACT TGATGACGAA ATAAGTTGAA ATTGAAATAG 2280 

AGAGGTTAAG TGACGATCAA ACGTTGCTTA ACTTCTTTTT AATGCTTAAA AATTATTTCA 2 34 0 
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TTAATAATAC TTCAATAATT GTTAAAAGGG GTTTAATGTG ATTATCTTAG AACGCCATCT 24 6 0 

ATAATGATGT TGTATGATTC AAATTACGTA AAAAGACAAT CGAATATAAT ATAGATTGGA 2520 

^ GCATACAATT ATGAAAATGA GAACAATTGC TAAAACCAGT TTAGCACTAG GGCTTTTAAC 258 0 

AACAGGCGCA ATTACAGTAA CGACGCAATC GGTCAAAGCA GAAAAAATAC AATCAACTAA 264 0 

AGTTGACAAA GTACCAACGC TTAAAGCAGA GCGaTTAGCA ATGATAAACA TAACAGCAGG 2700 

10 

TGCAAATTCA GCGACAACAC AAGCAGCTAA CACAAGACAA GAACGCACGC CTAAACTCGA 2760 

AAAGGCACCA AATACTAATG AGGAAAAAAC CTCAGCTTCC AAAATAGAAA AAATATCACA 282 0 

;5 ACCTAAACAA GAAGAGCAGA AAACGCTTAA TATATCAGCA ACGCCAGCGC CTAAACAAGA 2 8 80 

ACAATCACAA ACGACAACCG AATCCACAAC GCCGAAAACT AAAGTGACAA CACCTCCATC 294 0 

AACAAACACG CCACAACCAA TGCAATCTAC TAAATCAGAC ACACCACAAT CTCCAACCAT 3 000 

AAAACAAGCA CAAACAGATA TGACTCCTAA ATATGAAGAT TTAAGAGCGT ATTATACAAA 3 060 

ACCGAGTTTT GAATTTGAAA AGCAGTTTGG ATTTATGCTC AAACCATGGA CGACGGTTAG 3120 

GTTTATGAAT GTTATTCCAA ATAGGTTCAT CTATAAAATA GCTTTAGTTG GAAAAGATGA 318 0 

25 

GAAAAAATAT AAAGATGGAC CTTACGATAA TATCGATGTA TTTATCGTTT TAGAAGACAA 324 0 

TAAATATCAA TTGAAAAAAT ATTCTGTCGG TGGCATCACG AAGACTAATA GTAAAAAAGT 33 00 

TAATCACAAA GTAGAATTAA GCATTACTAA AAAAGATAAT CAAGGTATGA TTTCACGCGA 33 60 

30 

TGTTTCAGAA TACATGATTA CTAAGGAAGA GATTTCCTTG AAAGAGCTTG ATTTTAAATT 342 0 

GAGAAAACAA CTTATTGAAA AACATAATCT TTACGGTAAC ATGGGTTCAG GAACAATCGT 34 80 

35 TATTAAAATG AAAAACGGTG GGAAATATAC GTTTGAATTA CACAAAAAAC TGCAAGAGCA 354 0 

TCGTATGGCA GACGTCATAG ATGGCACTAA TATTGATAAC ATTGAAGTGA ATATAAAATA 36 00 

ATCATGACAT TCTCTAAATA GAAGCTGTCA TCGGAAAAAC AAGAAGTTAA GTGACAACGG 36 6 0 

40 

TTTACATGTT GCTTAGCTTC TTTTATTATG CGTAATGATG TAAAAAGACG AATATTCATT 3720 

TGTTTGTAAA AGTGGCATTT CTATGTCTTA AAAGTGACXSA AACTTCAAAT GTGCCAAGTG 3780 

TTGAATCACA TCAAAATCAT TTTTATTTAA CGAACATTAT GGATTTCTTA ATTTACTTAA 3 84 0 

CGATGATTCA AATATAGTTA AACAAGGTTT AATGTGAATG GAGCAATACG CCATCTATAA 3 900 

TAAAGCTGTA TGATTCAATG AATGTAATCG AACAAATCTA ATAATTACGA ATGGAGCATA 3 960 

so CAACTATGAA AATAACAACG ATTGCTAAAA CAAGTTTAGC ACTAGGCCTT TTAACAACAG 4020 

GTGTAATCAC AACGACAACG CAAGCAGCAA ACGCGACAAC ACTATCTTCC ACTAAAGTGG 4 080 
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CGCCTTCAAC TAAAGTGACA ACACCTCCAT CAACAAACAC GCCACAACCA ATGCAATCTA 426 0 

CTAAATCAGA CACACCACAA TCGCCAACCA CAAAACAAGT ACCAACAGAA ATAAATCCTA 4320 

^ AATTTAAAGA TTTAAGAGCG TATTATACGA AACCAAGTTT AGAATTTAAA AATGAGATTG 43 80 

GTATTATTTT AAAAAAATGG ACGACAATAA GATTTATGAA TGTTGTCCCA GATTATTTCA 444 0 

TATATAAAAT TGCTTTAGTT GGTAAAGATG ATAAAAAATA TGGTGAAGGA GTACATAGGA 4 5 00 

10 

ATGTCGATGT ATTTGTCGTT TTAGAAGAAA ATAATTACAA TCTGGAAAAA TATTCTGTCG 4 560 

GTGGTATCAC AAAGAGTAAT AGTAAAAAAG TTGATCACAA AGCAGGAGTA AGAATTACTA 462 0 

AGGAAGATAA TAAAGGTACA ATCTCTCATG ATGTTTCAGA ATTCAAGATT ACTAAAGAAC 4 680 

AGATTTCCTT GAAAGAACTT GATTTTAAAT TGAGAAAACA ACTTATTGAA AAAAATAATC 4 74 0 

TGTACGGTAA CGTTGGTTCA GGTAAAATTG TTATTAAAAT GAAAAACGGT GGAAAGTACA 4 800 

20 CGTTTGAATT GCACAAAAAA TTAGAAGAAA ATCGCATGGC AGATGTCATA GATGGCACTA 4 86 0 

ATATTGATAA CATTGAAGTG AATATAAAAT AATCATGACA TTCTCTAAAT AGAAGCTGTC 4 920 

ATCGGAAAAA CAAGAAGTTA AGTGACAACG GCCTACATGT TGCTTAGCTT CTTTTGTTAT 498 0 

25 

GTTCGATGAT TTGAGAACCC GAATTTTCGA TGGGTCCAAA TATGACGTGG AAGAGACCTG 504 0 

AATTTATCTG TAAATCCCTA TCTATCGGGT GTGAAGCACA ACGGGATCAG TTTTATTTAA 5100 

CGAACATTAT AGATTCCTTA ATTTACTTAA TAATGATTCA ATGATTATTA AACATGGTTT 5160 

30 

AATGTGAAAG GTCAAATACG CTAACTATAA TAAAGCTGTA TGATTCAATA GACGTAAGCG 522 0 

AACAAATCTA ATAATTACGA ATGGAGCATA CAACTATGAA AATGACAGCA ATTGCGAAAG 52 80 
35 (2) INFORMATION FOR SEQ ID NO: 545: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 886 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
ID) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 54 5: 

45 

AGTAAAATTG CCGGTATGAT GGACACAAAC GGTGACCTTG GTCAAGGTGA ATTAGCGATT 60 

AATCCACCtA AATCAGATTT gAACGAATTA CCTTGGGCTA CACGTAAAAA TAAACAGCCA 120 

GCTTCATCCG AAAAAgGTTC AAGTGGTCAT CATGGTAATG CAGCAATGCC TCAAACCAmA 180 

TTAGATTATC AAATATCTAT TGATAAGGTC GTTGAACAGG CGCAAAAAGC TGGTATTAAA 24 0 

AAGCCGTTTT CAATCGTATA TCCAAGTGAT AAAAATGGTA CCTTTATTGT ATCTAATACT 3 00 
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GATCAATATA GCGGTAAAAA GCTAGGTACG ATTAAATATG ATGACTACGG TATTATTGCT 420 

AAATGGTTTA CATGGGGCAT TCCGCTTCAC GAAGGTCATT TATTCGGCAT TTTAAATAAA 4 80 

ATCATTAATT TATTTGTATG TATCGCTTTA TTAGTAGCCA TTGGCATGGG GTTTGTCTCT 54 0 

TGGATAAAGC GTACAAAAAA TACTGCAGTA AAAGTACCAC ATCGCGTAAA AAAACCAGCA 600 

TCTATATCAC TCATAATATG TTTAATTGTA TTAGGATTAT TAATGCCATT ATTTOaATTA €€Q 

TCACTTATCC TTGTATTTAT AATTGAATTA ATATTATATA TTAAAGATCG TCGTGCTAAA 720 

CAATAATGCA CTTAAAGTTT TGAACTGACG AAATTTACAA AATGgATTCT CGTCTCTCTA 780 

ATTACtTAAA ACGGGgTtCy AaTAATAAAT CgTACTGaTG GgAAAGTTTT TACTTTTTAt 840 

CTGtCCGAtT TTTTnGAAwT TGAAGATAAA AAAGCATCTA AAACGC 886 
(2) INFORMATION FOR SEQ ID NO: 546: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4336 base pairs 

(B) TYPE: nucleic acid 
IC) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 546: 

GGCATTTGTG TCCTTATATA AGGAACTGTG tTAAATACAT TACTGTTGTT AAGTTGTTTT 60 

TGTAATTCAA AGAGCAGAAC AGAGTAACAT CATCAGTTGT AGTAAACGAT AATCCGGTAA 120 

AACAACTAAA TGAAATAATG AAAGTCATTT AACCTGAACA TTAAAATATA TTTGTTTTTC 180 

ATTAAGAATA ATTCAAGTAT ATTTAAATCG AGGTTAATTA TCGTATGAAA CGATGCACGT 24 0 

TATAATAAAA ATGTATGATT CAAATTACGT AATGAAAACA ATCCAATATA TTAAGATTGG 300 

AGCAAATAAA TATGAAATTT ACAGCATTAG CAAAAGCGAC ATTAGCTTTA GGAATTTTAA 3 60 

CAACAGGAAC TTTAACAACA GAAGTTCATT CAGGTCATGC AAAACAAAAT CAAAAGTCAG 4 20 

TAAATAAACA TGACAAGGAA GCATTATACC GATACTACAC TGGAAAGACT ATGGAAATGA 4 80 

AAAATATTAG TGCTTTGAAA CATGGTAAAA ACAACTTACG TTTTAAGTTT AGAGGTATTA 54 0 

AGATTCAAGT TTTACTGCCT GGAAATGATA AAAGTAAATT TCAACAGCGT AGTTATGAGG 6 00 

GGTTAGATGT TTTCTTTGTT CAAGAAAAAA GAGATAAGCA CGATATATTT TATACTGTTG 66 0 

GTGGTGTAAT ACAGAATAAT AAAACATCTG GAGTTGTCAG TGCACCAATA TTAAATATTT 720 

CAAAAGAAAA GGGTGAAGAT GCTTTTGTGA AAGGTTACCC TTATTACATT AAAAAAGAAA 7 80 

: T: AT AA ""A^"' ^ AA.'^AA^'"^ " a a A r, '"'^ ACji A A'-: ^ A '"--"^ a a ^'^-^ A ^ ^A^.'^^ ■ 
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ATAACCTTGA TTTAAGATCT AAATTAAAAT TTAAATATAT GGGGGAAGTC ATAGAAAGCA 96 0 

AACAAATTAA AGATATTGAA GTTAACTTAA AGTAAATCAT TACGAATAAT TAAAAGTAAT 102 0 

TGAAGCGGCT TAACGGTGAA ATGTAAATTG GTGCGCATAG CTTATACAAA AAGGATGCAT 10 8 0 

CAATCGATAT CGTCGTTAAG CCGTTTTGGT TTGTGTGTCA TGAATCCTAT CCCAATCTCC 114 0 

ATAAAGGTAA AATTTCCACC ACCAACATCA AAATTCTCCA CATCGCAACA TAACCAAATG 12 0 0 

TTATAATAAA TCTATTACAC AAAGAGATAA ATTACTTATT CAAAGGCGGA GGAATCACAT 126 0 

GTCTATTACT GAAAAACAAC GTCAGCAACA AGCTGAATTA CATAAAAAAT TATGGTCGAT 13 2 0 

TGCGAATGAT TTAAGAGGGA ATATGGATGC GAGTGAATTC CGTAATTACA TTTTAGGCTT 13 8 0 

GATTTTCTAT CGCTTCTTAT CTGAAAAAGC GGAACAAGAA TATGCAGATG CCTTGTCAGG 144 0 

TGAAGACATC ACGTATCAAG AAGCATGGGC AGACGAAGAA TACCGTGAAG ACTTAAAAGC 15 00 

AGAATTAATT GACCAAGTCG GTTACTTCAT TGAGCCAGAA GATTTATTCA GTGCGATGAT 15 6 0 

TCGTGAAATT GAAACGCAAG ATTTCGATAT CGAACACCTG GCGACGGCAA TTCGTAAAGT 162 0 

TGAAACATCA ACATTAGGTG AAGAAAGTGA AAATGACTTT ATCGGTCTGT TCAGCGATAT 168 0 

GGATTTGAGT TCAACGCGAC TAGGTAACAA TGTCAAAGAA CGTACTGCTT TAATCTCTAA 174 0 

AGTCATGGTT AATCTTGACG ACTTACCATT CGTTCACAGT GACATGGAAA TTGATATGTT 18 00 

AGGTGATGCA TATGAATTCC TAATTGGGCG CTTTGCGGCG ACAgCGGGTA AAAAAGCAGG 186 0 

CGAGTTCTAT ACACCACAAC AAGTATCTAA GATACTGGCG AAGATTGTCA CAGACGGTAA 192 0 

AGATAAATTA CGTCACGTGT ATGACCCAAC ATGTGGTTCA GGTTCACTGT TGTTACGTGT 198 0 

35 TGGTAAAGAA ACACAAGTGT ATCGTTATTT CGGTCAAGAA CGTAACAATA CTACATACAA 204 0 

CTTAGCACGC ATGAATATGT TATTACATGA TGTGCGTTAT GAGAACTTCG ATATCCGTAA 210 0 

TGATGACACA TTGGAAAACC CAGCCTTTTT AGGCAATACA TTTGATGCGO TTATTGCGAA 216 0 

CCCACCGTAT AGTGCGAAAT GGACTGCAGA TTCAAAGTTT GAAAATGACG AACGATTCAG 222 0 

TGGTTACGGC AAACTTGCGC CTAAGTCTAA AGCAGACTTT GCCTTTATTC AACACATGGT 228 0 

ACATTACCTA GACGATGAAG GTACCATGGC CGTTGTACTC CCACATGGTG TATTATTCCG 234 0 

AGGTGCTGCA GAAGGTGTCA TTCGTCGTTA TTTAATTGAA GAAAAGAACT ACTTAGAAGC 24 0 0 

TGTGATTGGT TTGCCAGCGA ATATTTTCTA TGGGACAAGT ATTCCAACAT GTATTTTAGT 24 6 0 

50 ATTTAAAAAA TGTCGCCAAC AAGACGACAA CGTACTATTT ATCGATGCAT CCAATGATTT 252 0 

TGAAAAAGGA AAAAATCAAA ATCATTTAAG CGATGCCCAA GTCGAACGTA TTATAGACAC 2 58 0 

ATATAAGCGT AAGGAAACAA TTGATAAATA TAGCTACAGC GCGACACTAC AAGAGATTGC 264 0 
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GATTGATTTA 


GATCAAGTCC 


AACAAGATTT 


GAAAAATATC 


GATAAAGAAA 


TCG CAGAAAT 


2760 




TGAGCAAGAA 


ATCAATGCAT 


ACCTGAAAGA 


ACTTGGGGTG 


TTGAAAGATG 


AGTAATACAC 


O Q O A 


5 


AAAAGAAAAA 


TGTGCCAGAA 


TTGAGGTTCC 


CAGGGTTTGA 


AGGCGAATGG 


G AAGAGAAG C 


o o Q r\ 




AGTTAGGGGA 


TCTTACAGAT 


AGAGTAATTA 


^^^^ m m m m m n% m m 

GGAAAAATAA 


AAACTTAGAA 


TCG AAAAAG L 


o Q /i r» 

^ y 4 u 


W 


CTTTAACAAT 


ATCCGGACAG 


TTAGGTTTAA 


TTGATCAA-AC 


AGAATATTTT 


AGTAAATCAG 


3 000 


TTTCGTCGAA 


AAATCTAGAA 


AATTATACAC 


TAATAAAGAA 


TGGAGAATTC 


GCGTATAACA 


1 A ^ n 
3 Uo 0 




AAAGTTATTC 


TAATGGATAC 


CCATTAGGGG 


CTATTAAAAG 


ATTAACTAGA 


TATGATAGTG 




15 


GTGTATTGTC 


CTCTTTGTAT 


ATTTGTTTTT 


CTATTAAAAG 


TGAAATGTCT 


AAAGACTT CA 


J 1 oU 




TGGAAGCATA 


TTTTGATTCG 


ACACACTGGT 


ATAGAGAAGT 


TTCTGGAATT 


GCAGTTGAGG 


3240 




GTGCAAGAAA 


TCACGGATTA 


TTAAATGTTT 


CTGTGAATGA 


'i'i'ri"l"l''rACT 


ATTCTAATTA 


T T A A 

J 3 U 0 


20 


AATATCCAAG 


TTTAGAAGAA 


CAGCAAAAAA 


TAGGCAAGTT 


CTTCAGCAAA 


CTCGACCGAC 


3 3 6 0 




AAATTGAATT 


AGAAGAACAA 


AAGCTTGAAT 


TACTTCAACA 


ACAGAAAAAA 


GGCTATATGC 


34 20 




AGAAAATTTT 


CTCACAGGAA 


CTGCGATTCA 


AAGATGAGAA 


TGGTGAAGAT 


TATCCAGATT 


34 80 


25 


GGGAAAATAG 


CAAAATAGAA 


AAATATTTAA 


AAGAGAGAAA 


CGAACGTTCT 


GACAAAGGGC 


3 54 0 




AAATGCTTTC 


AGTAACTATA 


AATAGTGGCA 


TTATAAAATT 


TAGTGAATTG 


GATAGAAAAG 


3600 


30 


ATAATTCAAG 


TAAAGATAAA 


AGTAATTATA 


AAGTAGTTAG 


GAAAAATGAT 


ATTGCATATA 


3660 


ATTCTATGAG 


AATGTGGCAA 


GGGGCTAGTG 


GTAAATCAAA 


TTATAATGGG 


ATTGTTAGCC 


372 0 




CTGCATATAC 


TGTGCTTTAT 


CCAACACAAA 


ATACTAGCTC 


ATTATTTATT 


O O X rr^ Jv IV TV 

GGATATAAGT 


3 /oU 


35 


TTAAAACACA 


TAGAATGATT 


CATAAATTTA 


AAATTAATTC 


ACAAGGATTA 


ACAT CAGAT A 


3 o 4 U 




CATGGAACTT 


AAAATATAAA 


CAATTAmAAA 


ATATAAATAT 


AGATATACCT 


GTATTGGAGG 


T Q A A 

3 900 




AACAAGAAAA 


GATAGGTGAT 


TTCTTTAAAA 


AAATGGATAT 


ATTGATAAGT 


n K Ik Jk ^ IV Ik IV IV 

AAACAGAAAA 


"D Q C A 


40 


TGAAAATTGA 


AATATTAGAA 


AAAGAGA/^C 


AATCc i i 1 1 i 


ACAAAAAATG 


1TCTTATAAC 


^ n T n 
4 OzU 




TTTGATAAAT 


ACATAGATTG 


CATAAGAATA 


AAATTTGTAT 


AATTTAACAT 




4 n Q n 
4 U 0 U 




AAAGTAAAGT 


GAATTAAAAA 


CGAACATTAA 


ATTTAGGCAC 


TGTG AAAG C G 


CAGTGTCTTT 


4 1 4 U 


^5 


TTTGTGTCGA 


AATTGTGTAC 


AGAATAAGTA 


GTTAAATAAA 


GATTAAGTTG 


AGATAAAGTG 


4200 




TTATTCGTAA 


ATAAAAGAGA 


GTAGATCGAT 


AGGAATTGAA 


TGATATTAGT 


TAACTATTTA 


4260 


50 


TTAAATTACT 
TTGCTCATTA 


TAATAATGAT 
TAATGA 


TAATTTTTAG 


TTAAAGTAAG 


TTTAATGTGA 


AGCACGACCA 


4320 
4336 



EP0 786 519 A2 



(A) LENGTH: 487 base pairs 

{B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 547: 



20 



25 



45 



TAAGCTATCT 


GAGATAATTG 


CTGATAACAT 


TAAACCGGCA 


ATTTCAGGTT 


TAATTTCAAA 


60 


GCCACGTTCT 


CTAAACATTT 


TGTATAAAAT 


TGTAGCTGTA 


CAACCAACTG 


GTTCAGCACG 


120 


ATAACATAAA 


GGACCAGCAG 


TTTCGAAATT 


TGCAATTCTG 


TGATGATCAA 


TTACATGCTT 


180 


AATTGTAGCA 


GAGGCAATCG 


TATCAGAACT 


TTGTTGOsU^T 


TCGTTATGAT 


CAACTAAGAT 


240 


AACATCTTGA 


CCATCTAAAT 


CATCTGTTAA 


TAATTCCGGA 


GCAGGTACAT 


TAAATGTATC 


300 


TAACGCGAAT 


TGAGTTTCTG 


CACTCACATC 


ACCTAAACGG 


TATGCTTTGG 


CTCCTGAATT 


360 


ACCTCGAAGT 


TGTTCaAATT 


CTGCCaTAAT 


AATCGCAGAT 


GAAATTGCAT 


CAGTGkCTGG 


420 


aTTCTTATGT 


CCGAAAATAT 


ATGTTTTAGC 


CAnTGTCAAA 


TATCTCCCTT 


GTAAATTGTA 


480 


TTCTTTA 












487 


(2) INFORMATION FOR SEQ ID NO: 548: 








(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 871 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 8: 






TTGGTGGTGC 


AGCAGTTATA 


GCAATTGTTT 


TAGCATTCAT 


TGGTAAGTTC 


ACTGCATTAA 


60 


TTTCTTCTAT 


ACCTACmCCA 


GTTATGGGAG 


GAGTATCTAT 


ATTACTTTTC 


GGTATTATTG 


120 


CAGCAAGTGG 


CTTAAGAATG 


TTAGTTGAAA 


GCAAAGTAGA 


TTTTGCGAAC 


AATCGAAATT 


180 


TAGTTATAGC 


TTCTGTAATT 


TTAGTTGTAG 


GTATCGGTAA 


TTTAGTATTT 


AACTTAAAAG 


240 


AAATTGGTAT 


CAACCTTCAA 


ATTGAGGGGA 


TGGCATTAGC 


TGCACTTTCA 


GGAATTATTT 


300 


TGAACTTAAT 


CTTACCTAAA 


GAGAAAAAAC 


AAAACAATTA 


AGATTTACAA 


ATTAAGGAGG 


360 


GCGCTTTTAT 


GAATCATTTA 


TTATCAATGG 


AACATTTATC 


TACAGATCAA 


ATATACAAAC 


420 


TTATCCAAAA 


GGCAAGTCAA 


TTTAAATCTG 


GTGAACGTCA 


ACTACCAAAC 


TTTGAAGGGA 


480 


AaTATGTCGC 


AAATTTATTC 


TTTGAAAATT 


CTACTCGrAC 


AAAATGTAGT 


TTTGAAATGG 


540 


CAGAACTTAA 


GCTAGGGTTA 


AAAACGATTA 


GCTTTGAAAC 


ATCAACATCA 


TCTGTTTCAA 


600 
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TCATTAGACA TCCGTTTAAT AACTACTATG AAAAATTAGC GAATATTAAC ATCCCAATTG 720 

CGAATGCTGG TGATGGTAGT GGACAACATC CAACACAAAG TTTACTTGAT TTAATGACGA 7 80 

TATATGAAGA ATATGGATAT TTTGAAGGCT TGAATGTATT GATTTGTGGA GAcATTaAAA 84 0 

ATTCACGTGT CGCACtAGTA ATTACCAnAG T 871 



(2) INFORMATION FOR SEQ ID NO: 549: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 549: 

TCAATCTAAT ATATACTTCA TGACTTCCCG CCAAAGGCGC AATCGTAGGA TTAGTCTGTT 6 0 

TATCGATAAG ATCTATTAAT ATTGTTTCTA CTTTAGATTC ACCTATTCCC GCAAATCTTA 12 0 

ATAGTTCAGA ATGTATAATT CGATTATGGT TTATAAAATG TGACAACAAT TCATTTTTCA 180 

25 

CCATTGGTTG CATTTCTTTC GGTGGACCTG GTAATAAAAT AATTTGTTTG TTTTCAAAAT 24 0 

TCACCATCAT TCCTGGAGCC ATGCCATGAT GATTTGTTAA TACAGTTGAA CCTTCAATTA 3 00 

CTAAAGCCTG TTGTcTATTA TTAGGTGTCA TTTCTtGTCC TTGTTCCTCA aAAwAGCTTT 3 60 

30 

CAATATATTG AAAGAAGGCT CATCAATAAC TAAATCTnTA 4 00 
(2) INFORMATION FOR SEQ ID NO: 550: 

^5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1523 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 550: 
ATACCTCCAG CTAGAATACC AGCGTATTTC ATAAAATACT TCCTCCATTC AACTATATCT 6 0 

ATATTTAATT ATTTAAATTT CGTTGCATTT TCCAATTGAA AACTCATTTT AAAATCAAAA 12 0 

CTCTAAATGT CTGTGTATTA CTTAAAATTA TACATATTTT GCTTATATTT TAGCATATTT 18 0 

TGTTTAAACC TATATTACAT TATATCAGAC GTTTTCATAC ACAAATAATA ACATACAAGC 24 0 

50 

AAACATTTCG TTTATTATTT ATATCACTTA ACTAATTAAT TTATAATTTT TTATTGTTTT 3 00 

T;!JiLnTTATCA CTTAAAAATr* GT^^aGCRPJK TTCGTTGTGA CGCTTGTCCA TCTTCTAATH -^(^1 
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TTTGTTTTAA TGCATCAATG AGTGCTGTTT GATTTTCAAC AATTGGACCT GGCAACTCTT 4 80 

TTTTATAATC CATGTAAAAA CCTCTAAGCT CATCGCCATA TTTATCTAAG TCATATGCAT 54 0 

5 

AGAAAATTTG CGGACGCTTT AATACACCGA AGTCGAACAT GACAGATGAG TAGTCGGTAA 6 00 

CTAACGCATC GCTGATTAAG TATAAATCCG AAATGTCTTC ATAAtCTGAA ACGTCTTTCA 66 0 

CAAAATCATC ATGTTCATCA ATACGTGTCA CAACTAAATA ATGCATGCGT AAkAAAATAA 72 0 

10 

CATArTCATC ATCCAGCGCT TGACGCAAAG CTTCTATATC AAAGTTAACA TTAAATTGAT 7 80 

ATGAACCTTC TCGAATCGCT TCATCGTCAC GCCAAGTTGG CGCGTACATA ATCACTTTTT 84 0 

15 TATCTAATGG AATATTTAAT CTTGTCTTAA TACCATTAAT ATATTCAGTA TCATTGCGTT 900 

TATGTGATAA TTTATCATTT CTTGGATAAC CTGTTTCCAA AATCTTATCT CGACTAACAT 960 

GAAATGCATT TTGAAATATC GATGTCGAAT ATGGATTAGG TGACACTAGA TAATCCCACC 1020 

GTTGGCTTTC TTTTTTAAAG CCATCTTGGT AATTTTGAGT ATTTGTTCCT AGCATTTTAA 10 8 0 

CGTTACTAAT ATCCAAACCA ATCTTTTTTA ATGGCGTGCC ATGCCATGTT TGTAAGTACG 114 0 

TCGTTCGCGG TGATTTATAT AACCAATCTG GTGTACGTGT GTTAATCATC CwCGCTTTCG 1200 

25 

CTCTTGGCAT CGCTAAAAAC CATTTCATTG AAAACTTTGT AACATATGGT ACATTGTGCT 126 0 

GTTGGAATAT GTGTTCATAT CCTTTTTTCA CACCCCATAT TAATTGGGCA TCGCTATGTT 1320 

CAGTTAAGTA TTCATATAAT GCTTTGGGGT TGTCGCTGTA TTGTTTACCA TGAAAGCTTT 13 30 

30 

CAAAATAAAT TAGATTCTTG TTTGGCAATT TTGATAGTAA TTTAAAAGTC GTATATATAC 144 0 

TATGTTCTAT CAATTTTTTA ATTGTATTTT TAATCATGTC GTACCTCCGA CGTGTTTTTG 1500 

35 TAATTATATT AATATGTATG AGC 1523 
(2) INFORMATION FOR SEQ ID NO: 551: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 923 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 551: 

CAGTAAGAGA TTTTCTTAAT TGAAAATAAT CTTACTGCTT TTTTAAATTT AATTTCGAGA 6 0 

TTCAATATTA GTTTATCTCA TTGTGGCATT AATTGATAAA ATTGTTTTAA TGTTATAAAT 120 

CAAAGTCtTC TTCAACAATT TCAATGTCTG CATCAGATCT ATGATATGTG AAAAAGCTAA 18 0 

TTCTTATGCG GTCTAAATGC TCTAAATGGT GTCGATACTC TTCGATGGCT GCAACAATTT 24 0 
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5 



10 



15 



25 



30 



35 



50 



ATGTGGACAA 


TAAATCTlTi' 


TTCTGCGGAT 


TATCTAACTC 


ATAATCAACA 


TGTGTCACAT 


360 


TATAACGTGC 


'iTrrrTAGAA 


AGGCTAGCTA 


AAATTTGCTC 


GTGAAAAGCA 


GTTAATGAAT 


420 


CTAAATCTAA 


TTTGATTTGT 


AATAGGAAAT 


TGTTATTAAG 


TAAATATAAG 


TCGTTTTGAT 


4 80 


AACGCGACAA 


TTTGTTTAAT 


ACTTCATAAG 


CTTGTCTAGT 


CGTCTGAACT 


ACTTCTv- iUA 


c ^ r» 


AAAGTATTTT 


CTTTCTATTC 


TGTTGGTGAA 


TATCTTTTTT 


TGTAATAGGA 


fyw* rtrpm ^ /*• 

LGi ICl Iv-VjL 


o U U 


TATAGTAATC 


ATAAATTTTC 


TCTAAu-i-iTi' 


CGACACGTTG 


TTTTAAATTA 


TGACTATL i i 


o b U 


GTTTAATATT 


ATTAAACTCC 


GTCGTATCAT 


TTAATACTAA 


TTTAAACCAC 


ATAAAAATAT 


"7 A 


CTGAGGATAT 


ATTTAATGAA 


TTATAGTAAA 


TTTTTGTTTC 


AAATTTAGGT 


GGTAGAAACA 


1 Q r\ 


CAAAGTTAAC 


TAGAGATGAA 


CTTATGACAC 


CAATCATTAC 


AAGTACAAAC 


CTGTAAAAGO 


0 A rt 
o*t U 


CGGTAATATA 


GAAAGAACCG 


GTATGTTGTC 


CCATAATGAT 


TAATGCTGTT 


ACACTCGCCA 


Q n A 

y u u 


AAGTAGCAAC 


ATGTGCTAAA 


TTAAATTTAA 


ATAAAATAGC 


AATAAGTACT 


TV rrv/^ TV r^m iv iv 

ATGACGGTAA 


ft ^ A 


CAC C CAT AAT 


GATAAAATTA 


TCACTAAAAA 


TTGTTACCAT 


TGTAACAGAT 


AGTATGGCAC 


102 0 


CTATAATGTT 


ACCCAATGCT 


TGATCAGAAA 


CTGTTTTAAA 


TGAACGATAA 


ATACTAGGTT 


1080 


GCATTGCACA 


ACAGCACTGA 


CACCAGCTAA 


GGCTTTCAGA 


CCAACATCAT 


CCGGTAGTAA 


1140 


AGAAGCGATA 


GACATAGCTA 


AAATAATGGC 


TATACCAGTT 


TTAAAAATCC 


GAGCTCCTAG 


1 O A A 


TCTCAAAAAT 


AATGACGCCC 


CTTTTAAGTT 


TATTGAATAT 


CTAATATTCG 


TATTCATTAC 




TGTTATACAC 


TTACTAGTTA 


CAAAATTCAA 


GCTTATTTAT 


AGTTGTTAAA 


ATAAATCATA 


T O O A 


CATAATACTG 


ATAGCGATGT 


AAAACTTTAG 


TCAGAGATTA 


AAATAGTATA 


TV IV >TWTVH/t^Tv TV TV 

AATTTGTAAA 


1 T O A 


ATAAAAACTC 


ACATAGTGAC 


ATATCAAGTT 


AAACGTTAAT 


AGTTAACGAT 


ATAAAAltxAA 


1 A A A 
1441) 


TCTACTATGT 


GAGCATTTGC 


TTTATTTTAA 


TTCAATTAAA 


AATATACTTC 


L 1 1 AAAACj 1 1 


1 Q A A 
l^U U 


ATTTCATTTG 


ACTAAAAGCA 


TAGTCTGCAG 


CTTTTAAAGT 


TTGTTTAATA 


TCTTCTTCTG 


1 C C A 
X D O U 


TATGTTCAGT 


TGTTAAGAAC 


CAAGCTTCAA 


ACTTAQAAGG 


TGCTAAATTG 


AT AC CTTGAT 


1 CO A 


TTAACATTAA 


TTTGAAAAAT 


TTACCGAACG 


CTTCGCCGTC 


AGAATGTT CA 


ACTTGATCAT 


1 C Q A 

1 b o U 


AATGTGTGAC 


TTTTTCATCT 


GTAAAGTACA 


ATGTTAAAGA 


•T^^/*'IVT*TV TV ■I\T''J\ 

T C CAT AAAT A 


L Cj ATT AAT T b 


1 T A A 


TAGCTGTGAT 


ATTATGTTTT 


TCGATTAATT 


TAAGTAAACC 


TTCTTCTAGT 


IvjI ICjOCL I a 


" Q A A 
^ O U U 


AGCTGTCTAA 


TTTTTCATAA 


ACACCGTCTT 


GTTCTAGTAC 


TTCGACjI AA 1 


o IV TV T* TV r^r^T^r* 


1 O £ A 

1 o b U 


CTTTCATAGA 


TAACGGGTTA 


CCAGCCATTG 


TACCAGCTTG 


ATATGCAGGT 


CCTAGAGGTG 


1920 


CTACTTGTTC 


CATAATATCT 


TGACGTCCAC 


CATAGCCTCC 


AATTGGTAAA 


CCACCGCCAA 


1980 


CAATT'-TACC 


AAATGCAGTT 


AAATCAGGGA 


TAACACCTAA 


TAAATCTTGA 


GCGGCACCGT 


2040 
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AAATTTCATT AACCTCTTCT AAAAATCCAG GTTGAGGCAT TACCATTCCA AAGTTACCAA 216 0 

CAATTGGTTC TACTAATACT GCGGCAATTT CATCACCCCA AAATTCAATT GCTTCTTTAT 222 0 

5 

AGGCGTTAAT ATCATTGAAA GGTACAGTAA TGACTTCACG TGCGACGCTT TCTGGAACAC 22 80 

CAGCTGAGTC TGGAGAACCG AGCTGAGATG GGCCGCTACC TGCTGCAACC AATACTAAAT 234 0 

CAGAATGGCC ATGATAAGAT CCAGCAAATT TTATAATTTT ATTTCTTTTA GTATATGCAC 24 00 

10 

GTGCAACACG AATTGTTGTC ATGACTGCTT CTGTTCCAGA ATTTACAAAG CGAATTTTCT 24 6 0 

CAAGAGATGG AATTGCATCA CGTAATTTTT TGCTGAATTC AATTTCTAAT TCAGTCGGTG 252 0 

75 TACCAAATAA AACACCTTTA GCAGCTTGTT CTTGAATTGC TTTAGTAATA TGAGGATGTG 258 0 

CATGCCCCGT AATAATTGGA CCGTATGCTT GAAGGTAATC AATAAATTTA TTGCCATCGA 264 0 

CATCATATAA ATATGCACCG TGTCCTTCTT TCATAACAAC AGGTGCACCG CCTCCTACAG 2700 

20 CTTTATAAGA ACGAGAAGGG GAATTGACAC CGCCTAGAAT ATATTCGTTT GAAAGTTGTT 27 60 

GTAAACGTTC ACTTTCACTA AAATTCATTT ATATCAACCT CTTTTAATTT AATATTTTCA 282 0 

TCTAATATCG TATCATAAAA TTATTATAAT GAAGAAAAAG GTGATTATAT GTTGCAAAAA 2 88 0 

25 

GGAGAACAAT TTCCAATATT TAAATTAGAA AATCAAGACG GAACTGTCAT TACAAATGAT 294 0 

ACATTAAAAG GTAAAAAGGC GATTATATAT TTTTATCCTA GAGATAATAC ACCTACTTGT 3 000 

ACCACAGAAG CTTGTGACTT TAGAGACAAT TTAGAAATGT TCAATGATTT AGATGTTGCA 3060 

30 

GTATATGGTA TAAGCGGTGA TTCAAAGAAA AAACACCAAA ATTTTATTGA GAAACACGGA 312 0 

TTGAATTTCG ATTTATTAGT AGATGAAGAT TTTAAATTAG CTAAAGAAAt GGCGTATATC 318 0 

3S AGTTAAAAAA ATCATTTGGC AAAGAAAGTA TGGGCATTGT AAGAACGACT TTTATAATAG 324 0 

ATGAACAAGG TAAAGTATTA GATGTTATCG AGAAGGTTAA GGTTAAAACA CAAATAGAAG 3300 

AACTTAAAAA CATTTTGGGG TGACATATAT GAAAGTTGTT GGGTTAAATC GTATGCGTGA 3360 

AGTTGAAACT GAATTACAAC AACGCTTTTC AGATTTAGAT TTTAAATTTT ATAAAAAAGC 342 0 

ATCAGAAATA CCTGAGAGCG ACTTGGCTGA TTTAGATATA TTAGTTGGTT ATGATGGCGG 34 80 

TATCAATGAG GCATTTTTAC GACGTTGCCC GAATTTAAAA TGGATTGCAT GGTTTGCAAC 3 54 0 

45 

GGGTGTAAAT ACATTGCCGT TAGATTATAT TGCAGATCAC GGCATACTTT TAACTAATGG 3 6 00 

AAAAGGTGTT CAAGCTAAAC AATTATCTGA ATACATTTTA GCTTTCATTT TAGATGATTA 3 660 

TAAAAAGATG AAACTATCAT ATGATAACCA ACGACAACAT ATATATGATT CGAAAATAAC 3 72 0 

50 

TGGTAAACGC CTATCAGGAC AAACAGTTTT ATTTTTAGGT ACAGGTGCAA TTGCTACTAG 3 780 

AACTGCGAAG TTAGCAAAGG CTTTTAATAT GAATTTAATT GGTCTGAGCA AGTCAGGTCA 3 84 0 
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TGCTGACATT ATTATAAATG CTTTACCAGA AACGCAAGAA ACGATTCaTT TaCTAAAGAA 3 960 

AAAACATTTT GAATTAATGA AAGATGAAGC ACmTTATA AATATAGGAC GAGGTAGCAT 4 020 

^ AGTTAAAGAA GCGCTCTTAA TAGAAGTATT AAAAAGTAAA GTTATTCGAC ATGCATATTT 4 080 

AGATGTGTTT GAAAATGAAC CTTTGAAACC TAATCATGAA TTATATGAAT TGGATAATGT 414 0 

AACTATAACA GCGCATATAA CTGGTAATGA TTATGAAGCA AAGTATGACT TATTAGATAT 4 2 00 

10 

TTTTAAAAAC AATCTAGTTA ATTTTCTCAA TAAGAATGGT CTAATTGAGA ATGAAGTTGa 4 260 

TGCTAAAAAA GGCTATTAAA TGArATCATC ATGTAAATAT TGACACGCGC GCAATACTAC 4320 

AGTTATATTT aTAGTAAgTt AATaATgATT ATATAAGAaA GATGGTgATA TAGATGAGTG 4 380 

TTGAAATAGA ATCAATTGAA CATGAACTAG AAGAATCAAT TGCATCATTG CGACAAGcAG 444 0 

GCGTAAGAAT TACACCTCAA AGACAAGCAA TATTACGTTa TTTaATTTCT TCACATACTs 4 500 

20 ATCCAACAsC TGaTGaAATT TATCAAGCAC TTTCACCTGa TTTTCCAAAT ATAAGTGTTG 4 56 0 

CGAC7UVTATA TAATAACTTA AGAGTGTTTA AAGATATTGG AATTGTAAAA GAATTAACAT 4 620 

ATGGAGACTC ATCAAGTCGA TTCGACTTTa ATACACATAA TCATTATCAT ATTATATGTG 4 680 

AACAATGTGG TAAGATTGTT GATTTTCAAT ATCCACAGTT AAATGAAATT GAAAGATTAG 4 74 0 

CTCAGCATAT GACTGACTTT GACGTAACAC ATCATCGAAT GGAAATTTAT GGAGTTTGTA 4 800 

AAGAATGCCA AGATAAATAA TTTAACTTTG GTAGTATGAC AAATTAAAAA AGCGTTACTw 4 86 0 

30 

ACTTCATATA AGTAAGCGTA ATATTTAAGA nGTTAAACGA CATGaAAGTt GTTTAACTTT 4 920 

TTT 4923 

(2) INFORMATION FOR SEQ ID NO: 552: 

(i) SEQUENCE CHARACTERISTICS: 

tA) LENGTH: 917 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
40 (D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 552: 

TCCGGCTTTA AAAACTTTTC CCAATTCCAG CTTGGGCCTT TGGCATTAAT ATTAACCTCC 6 0 

TGGTTCGGAT TAATTGGGAA CCTAACCnTT TAGGCAATAA TTGGTTTAGG CAACTTCCAA 12 0 

ATTGGTGGTT CAACCAACGT CTTTGGATAC CyTGcTCATT TAATTCTAAA ATGgTyrGAA 180 

50 

CGCATTTTGG TACCCAAAAC GgTGACGTTC GTTTGCACGG TCTAATAAAT TGTCTAAGTT 24 0 
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GCGATCCATT GTTTGTTCGA AAAGAATAGC ACCCAAAATT TTATCTGGTG AGAATGAAGG 42 0 

TGAAGTTACC ACACGTGTAC GCATATCGTG AACAAGTTGG AACATTTCGT CTTCATTGCT 480 

ATATTGATCT TCGTTTACAC CATATTCTTT AAGTGCTTTT GGTGTACTAC CACCACTTTG 54 0 

GTCTAATGCG GCAATAAAGC CTTTTCCATT TTTCATTTTT TCTAATTGCT CTTTATTCAT 600 

ACTTTCCACT CCTTAACTTT TCAATACACC TCCAGTATGA TAAAAATGAG AACATTTCTC 66 0 

AAGTCATAAA CCTTGAAAAG TGTATAAAAT GTGAAAAATA ATTGTCAGTT TAATTAAAAA 72 0 

TATTATTTTA TTCTAGGTAT GACTAACGCC ATTAATGACA TAAAGAAAAT ATGTGTAATA 780 

J5 ATCCAACCGA TTAATTCTGT CACACTAAAT TGAAAAATTG GACGTTGCGC AATAAATACT 84 0 

AAAAAGGGAT ACAATGCTAT AAATAAGAAA AATAAAGGGA TATAACATAG ATAGTAAAGC 900 

CTTTTAGAAG TATGAAA 917 
(2) INFORMATION FOR SEQ ID NO: 553: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 432 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



30 



35 



40 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 553: 

ACTCCCATAT CCACTGGCAA GATTTATAAT TTGTAATTCT AAAGTATAAT AGCCTATATT 60 

TTTCAAAATC TCTTTTTTTA TGATTAACAT CCCTCTAAGT GCACCTAAAG GTTTTTATGT 12 0 

AATGTATTAA ATATTTCCAT TTATACATAT ACATCTTGTA TAAAAAGAAA GAACTCCATA 180 

TATACTCAAA AGGTATACTG AAGCTCTATA TATTATATAA AATTATTATA CTATTTTGAA 24 0 

AAACATAAGT AAATCACTAA ACATGATTTC TTTCACTTTT AGAAAACTTT TAATACTATA 3 00 

AAAGCACCCA CTCAGTCACT AGTTTGGGCA GTTATTGTAT GCCTATTGaA CTCAATGCgT 36 0 

ATATTACAAT ACCTTTTtCG CATATTCATA TAAGacTTTG CATCTTTAAG CTTAATTGCT 42 0 

ATCTCTTTCT CT 43 2 
(2) INFORMATION FOR SEQ ID NO: 554; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1374 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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nTTGCTTTA TGTTAAATGG ATTATTaTTA GAATAAAAAA TCGGTGATGA GCTAAAAAAG 6 0 

TGTGTAGGAT GTTTTCCkAA CCCAATTTTT ACATCCGAAG ATATCGAACA ATATCTTCCT 120 

^ ACTTCTACAT TATTAAAATC ACTACCAAAT CCAATATAAC TGTATTCACC AATGTGAGAA 180 

TTCCTGATTT TACACCATCT ATCTATATAG TTATTGCCAT CAAATTTTGA GTTTGTAATA 24 0 

TACGCCAAGC GATGAATCTT PlACP.TTCGAT TCTTTAGAGG ACTGGTTTTT CAGCA_AACCA 3 00 

10 

ATTATCTTTT CAATCGCTAT CCTCATCGTC ATTTCCTCCA AGTATTTCGA TTGATAATAT 3 60 

CTTTATAGCT TTGAATAATT TTAACTACCT TTGTCGAAAC GTTAGTGTCT TTATAATCAA 42 0 

TAGCATCAAT CATCGGTTCG TTATTGTTTT GCATCTCTCT TGCTAGTTCA ACGGATTGGA 4 90 

TTAGATTGTT ATAGGTAATA CCACCTACAA TAACCGTACC TTTATCTAGT ACTTCCGGTC 54 0 

TTTCTGTGGA AGTTCGAATA AGGACACCAG GGAACTTCAA AATAGACGAC TCTTCTGACA 600 

20 ATGTTCCACT ATCTGATAGC ACAACAAATG CATCTTTTTG CAATGCATTA TAATCAAAGA 66 0 

AACCAAATGG CTTTAACTGT TTAACTAATG GATCAAATTC AAATTTACTT TCTTCAATrr v-iU 

TCTTCCAACT TCTTGGATGC GTTGAATAAA TCACAGGCAT TTTATACTTT TTGGCAATAT 7 80 

CATTTATCGC ATTCATTAAT GATTTAAAAT TCTTTTCATT ATCGATATTC TCTTCTCTAT 840 

GCGCAGaTAC TAAAATGTAT TGTTGCGGTT CTAATCCTAG TTTATTTAAA ACGTCACTGT 900 

GATTAATTTT ATCTCGATGC GCTTCTATCA CTTCTGTCAT CGGTGATCCT GTyACAAAGA 96 0 

30 

TATTCGCTTT ATTGAAGCCT TCATCTAATA AATAACGTCT GCTATGTTCC GTATAAGGTA 1020 

GATTCACATC ACTGACATGG TCAACAATTT TACGATTGAT TTCTTCAGGT ACATTCTGAT 1080 

CAAAGCATCT ATTACCCGCT TCCATGTGGA ACACAGGAAT CTTTAATCGT TTAGCAGATA 114 0 

CTGCTGCTAA ACAACTATTT GTATCACCAA GAATTAAAAG TGCATCTGGT TGTTCGCGTA 12 00 

ATAAAACATC ATATGTCTTC GCAATAATAT TCCCCAtCGT TTCTCCAnGt TACTTCCAAC 126 0 

^0 TGCCTCTAAG TAGTGGTCCG GTTGTCTTAA TTCCAAATCA TCAAAGAAAA TTTGATTCAA 13 2 0 

TGTATAATCA TAATTTTGAC CAGTGTGTAC TAATATCTGA TTAnAAATAT TGAT 1374 
(2) INFORMATION FOR SEQ ID NO: 555: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1472 base pairs 

(B) TYPE: nucleic acid 
(D STRANDEDNESS : double 
(D) TOPOLOGY: linear 

50 
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TTACCTCTAG GCATGTCCCT TTCACGGTTT GCTTATGATA ACGTTATCGA CATTGTTAAA 120 

TTGTCCTTTT TGAAAAAATA ATTCTGGTGA GATAGACGCT ACTTGGaTTT CaTCaGTTTG 180 

^ TAAtAACGCA GtATAAtTAC CaTTACTAAA tTGaGTTAAT CGTTCaTATA AAGTACTAAT 24 0 

AGGATAATAA ATGTTATCCG TTAAgCGCGC CGTATAGTTC ACTTGATACG TTTCGCCTTC 300 

AACAATTGCT TGTTGGACAC GTTTAATATT AGTCATCATA ACCTCAGAAG ATTCAACAAA 3 60 

10 

TGAAAAATGA TACTTTGATA CATAAGAAGT TTGATGTTCA TATGTTGAAT TTATGCTTTC 42 0 

CGCTTTTTCA AAACTATAAG CTGCTGCATA AATATCATCT TTAGCTAATG AATGTGTACA 4 80 

^5 CATAGCATGA TTAAAATACT TTGCCGCTTC GTAACTTAAA TATAACGAGA CATATCTACC 54 0 

TTGTCGTkGt GCGCTTGTGC AAAGTGTATC ACTTCTCCCA CATCAGCCAA CTTAGTAGCA 600 

ACATACTTCT TTATAAATCC CTTTAATTGA ATATGGTATT GCTTATATTC ATTTTCAGTT 66 0 

20 AAATAGTAGC GATAATTATA TTCTATTCTC ACAGTAATCA CCTACCTTCG ATAAAAATAA 720 

TTCAACTTGT CGATAACCGT ATTCACTCAA AATAGATTCA GGATGATATT GCACACCAAA 780 

AACCGGAAAT CTAATATGCT CAAATGCCAT AATAATCGCT TCATCGTTTT TTGCTGTAAT 84 0 

25 

CTTTAAGCAA TTTGGAAAAG TCGCTCCGTC AGCAATTAAT GAATGATAAC GCATTACATT 900 

GAAATTTTGA GGCAGTCCTT GAAAAATACC TTCATTGGTA TGGCGTAACT GTGTAGTATG 96 0 

TCCGTGTACA GGATGATAGC CGTGAATGAT ATTTCCACCA AAATAAGACA CGATACATTG 102 0 

30 

AAATCCTAAA CATACACCTA GTATAGGTAC ACGCTGATAA AATTGTTCTA ACACTTCATT 1080 

CAAGATAGGA TAATCATCCG GaTTACCCGG CCCAGGCGAA ATAACAATTG CTTTTGGCTT 1140 

CATATTAATG ACGTCTTCTA TCAGCAGATT ATCAATACCA ACAACTTGAA CTGTTAGTTT 1200 

CGTTTGAGTC TTAATATAGT CTATTAAATT ATATGTAAAT GaATCATTAT TATCTATGAC 1260 

TAGAATCATT GTATACTCCG TTCTAAATGT GTTTTATTTT TATAATATGT ATTGGATGTA 1320 

^0 GCTAAAACTT TAAAAGCATT GTCATTATCC TGACCTTGAT TTTAACTAAT ATATGGTATA 1380 

TTCTATTCAT CGTACATAAA TGAATATCAG AGGTTCCTAG CTGAAACCCT CTATAAAAAA 144 0 

CTAGGCCATT GAAATTTCAA ACATTCGTTG GG 14 7 2 

45 

(2) INFORMATION FOR SEQ ID NO: 556: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1054 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY; linear 
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AGAACAGCAA GGATTACTTA CTGAGGAATT AAAGAAGGAT ATTTTAAAAC AGAACAAATT 60 

ACAACGTGTT GAAGACCTAT ATAGGCCTTT TAAACAAAAG AAAAAGACAA GGGCAACTGA 120 

^ GGCGAAACGT AAAGGGTTAG AGCCATTAGC GATATGGATG AAGGCACGTA AACATGAAGT 160 

CTCAATTGAA GAAAAAGCAC AACAATTTAT AAATGAAGAA GTGCAATCGG TTGAAGATGC 24 0 

TATCAAAGGT GCACAAGATA TTATTGCGGA ACAAATT7CA GATAATCCTA AATATAGAAC 3 00 

10 

AAAAATTITA AAAGATATGT ATCATCAAGG TGTGTTAACT ACATCTAAAA AGAAAAATGC 360 

TGAAGATGAA AAAGGTATTT TTGAAATGTA CTATGCATAT AGTGAGCCAA TTAAACGCAT 4 20 

TGCTAATCAT AGAGTTTTAG CTGTTAATCG TGGTGAAAAA GAGAAAGTAT TATCTGTAAA 480 

GTTTGAATTC GATACGACAT CAGTAGAGGA TTTCATTGCA CGTCAAGAAA TCAATCATAA 540 

TAATGTAAAT CGCAGTTATA TTTTAGAGGC GATTAAAGAT AGCTTGAAAC GCTTAATTGT 600 

20 CCCTTCGATA GAGCGTGAAA TCCATGCTGA TTTAACTGAA AAAGCTGAAA ATCATGCAAT 660 

AGATGTTTTT AGTGAAAACT TAAGAAATCT ATTACTGCAA CCTCCAATGA AAGGTAAACA 720 

AATATTAGGC GTAGATCCAG CATTTAGAAC AGGTTGTAAA TTAGCAGTCA TTAACCCATT 780 

'^^ CGGTACTTTT ATAGCAAAAG GTGTGATTTA TCCGCATCCA CCAGTTTCTA AAAAAGAGGC 84 0 

AGCAGAGAAG GATTTTGTAC AAATGGTTAA AQCGTATGAT GTGCAATTAA TTGCAATTGG 900 

CAATGGTACT GCAAGTCGTG AAACAGAACA ATTTGTTGCA GATTTAATTA AAAAGCATCA 96 0 

30 

GTTGCCAGTA CAATTCATCA TTGTCAATGA AGCGGGCGCT TCAGTATACT CAGCATCAGA 102 0 

AATTGCTAGA GATGAATTTC CTGATTTTCA AGTG 1054 
(2) INFORMATION FOR SEQ ID NO: 557: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1057 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
40 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 557: 

AAATGTCAGA ATACAAGAAA AAAATAATTG AATTAATTGA AAGTAATTTA ACAGGATATG 6 0 

AAATTTCTAA AAAAACTGGA GTTTCTCAAT ACGTACTTTC ACAATTAAGA CAGGGCAAAC 12 0 

GCGAAGTAGA TAATCTAACC CTGAATACAA CAGAAAAATT ATATGAATAT GCCAATAAAG 18 0 

50 

TTTTGTAATT TAACTAATGT ATAAATTAAT CAAGCTATGT TTATTTGATT TAACTATTAA 24 0 

. ' ^\7\'-^r.'^ -^-^r-.n^n^A-^r: cn'^A'^'-A'^AA '^AATTAAAAT A^AAAAATA^ '^AOATTPrAA ^oo 
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AAGGGAAAAT AAGTGTTAAG TTTTAAATGA TAAAAAAGAT TGGAATGGAT CGTCTTGAAA 4 20 

TGCTCCCTTC AAAGTTTTCA TTTTTTCAAT GTCGACTTCG AAGGGGGCAT TTTCATTAAA 4 80 

5 

TTGTTATAGC TTTTTATATT TGTATAATGA ACATATAAGT TTAAGAAGGT GCGAGTGAAG 54 0 

GAAATAAAAA AGCTCAAATG TACCAAATTG TTAATCTTAA TAAATCTCTA CTTTATAAAG 600 
ATTGAATGGA CATTCGAGCG TTAATCAGTC AGGAGGGACT TTCCCTCCTA CAATTTAATA 660 

10 

ATAATACTTG CTTCACCACT ATACAAGGAG TGAGTTGTTA TGTTCAAAGT GAATTATTCG 72 0 

ATTTTAAGTT ATTATCCAGA ATATAATATC GCAGTAAGTT GGCAACGTTT AAGAGAAGGA 780 
75 AAAACAATAA AAAACAAGAT TTAATACTGC TGCGTCATGA GGCGCTTGAA CATTATTTGA 84 0 

TGAATAAGTA TAATTTCAAC TATGATTATG CACATAAAAT TGTATCAAAA AAATACGATT 90 0 

ATTCAATTTT TATATWVaAG AAGGTGGATT AAATGCTTAC ATTAATAAAA TTGGAAAGAA 960 
GATGaACAGG tTATAATATA TGraTATATa CCTGaAGATG aTATAAGTAC CGGGTAAAGG 102 0 

GTCCCGTACC TTTTTAATTA AAAAAGTTCC AGGGGGT 10 57 

(2) INFORMATION FOR SEQ ID NO : 558: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3754 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 558: 
35 CTGTGCTGTA TTTACtTTAA TTTGACGACA TTTGAGAAGT ATTATGATGG nTGTAGTTAT 60 

TTTATGAAAG TAATGTATTA ACAATCGAAT TACTAAGTCT ATATTGGTAG GAATATCGAT 120 
GTTTAGTTTA AATGGAATGC ACTATTTAAG TTTTTAATAT GGAGATGTTT GTGACTTTTC 180 
GATGATTAAG ATTTTTATAG GTGTGCATCA TTTCAAATAA ACTTTGTGTT TAAAATTGAG 240 
CTTAGGAAAT CGATAGGTTT AGATGAGGAT ATTGTTGAAG TTATGTGTCT TGTATCCTTA 3 00 

GTTGTTATAA AAGCGCAAAA AATAGCACCG CTTTCTCTTT ATCTGTGTAG AAAGGATGCT 360 

45 

ATTATTGTAA AACAATAGTT TTAATTTnAT TTTCTGATaT ATCATATGTr ATTcTACCTG 420 
TATCAATTTC TATCGAATTA TAACCATCAA AATTATCAAC TTTATCATCA AAGTCTATCA 4 80 

CTTTTCCAAT TAATATTTTA TTATTAGTAA GCGTTAATTT GACTAATTTG CCTATGTATG 54 0 

50 

ACTGTAAATT CATATTTAAT CACTCCTTTT TAATATACGG AACTACATGA AAACCAGTTT 600 
TAGAATAATG AACCTTACCT AATTTCGTTT CAATATATTT ACCATTCACA TAAGATTTAC 660 
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AAACTGGAAT ATCAATAAAT TCTTTTTTAG TCATCTTTTC ACTTTCATTA GAATCTATTA 78 0 

TAGTATAACT TGGTAATAAA GATGAATTAT TTTTCTTATT ATTATATTCA TTTGTGCATA 84 0 

^ AAATATGACG CTTTTGCTTA ACATTATTCA ATTTCATCGT CATTTTACCA TTACTTATCA 900 

TTTCAAGCAT TTCTTTTTTA GCTTTTTGTA GAAAGGATGC TATTTTGTAA TTATTTTAAT 96 0 

AATCTAATAC TTTTTATCTC AGTTTCATCA AACGAATAAA CAGCAAAATC AGTCTGTATA 102 0 

10 

TCTACAACAA GATTTCCTGT TTCGCTCTCG AATTCATTTT CATAGTCAGT TACAAACCCT 10 8 0 

TCGTATTCTT CATTATTCAA AAGCGTGATA ATAACATCTT TACGATATGC ATCTnCAATT 114 0 

^5 CTCAATACTT TTCACCTACT TATCAATATA AGGTACTATA TGaGcACCTG GTCTTCGAAT 1200 

AATGGCACTT cCCTTTCTGG TTTCAATATA CACATTTTCG ATATGTATTT TTCCAATAAT 126 0 

TTGATTAAAA TTAATAATCT CTTTCAAATC AAATCGCTCA TCACTTAATA TTAGATTGCC 13 20 

20 TGTTGACATT TTTTCTCTTA ACAATTCATT CAATAAATCT ATAGAAAGTA TTGTATAGCT 13 8 0 

AGGCAATTTC TTATTATTTA AAATGGCTCT TTTTTTATTT TCATTATATA GGTGATGACC 144 0 

TAACATATAT CTATTTTATT TTCCGCAATT TATTTCTATT TTTATTTTAC CATTTTTAAT 1500 

CATTTCTTTC ATCTGGTTTT TAGCTTTTtC CTGtAATTAT GCTTCTTTTA CTTCTACTTG 156 0 

ATATTTACCt TCACGCTCTT TAAAGAACTT GTCCCGCCAA TTGCCAACAT GTGGCACTGT 16 20 

GGTACTTCTA CACCAAGGAT GCATAGGTGG CGCATTCACA CCTGGTATCA TATCTTTAAC 16 80 

30 

TTTAAATATT TTTCCGTTAA GTGAATGACA TAATTTAGAT GTTTTACTAT CTATTTTGGC 174 0 

AACATATTTA TATTCGCCAT CTTCACCAAG TTCTTTTAAA TATGTTAACT TTTGTGCTTC 1800 

3^ TGCATTTTCA GTAAATAGTT AAAAAAGCGT ATAAAAATAG CACCACTTTC TCTTTAkCTG 1860 

TCTAAAAAGG ATGCTATTTA TCTTTTGAAT TTGAATTCTT TTTCGCTTTT TCTATACTTT 1920 

CAAATTCTTC AACTAATTCT TTAAAAGATT CACTCAATTC TTTTGCAGTT ACATTTCCAT 19 80 

40 CTAATTGTGA ATCTAACATA ATTAAAATCA TCTCACTTTA TATTTAATCa TATTTATACT 2 040 

ATAAAGTTTT TTCAATAATT TTTCAATATG GCTATCATTA ATGATATTAA TATGCGTAAA 2100 

ATATTTAGCA CAAAATTTAC TCACTATTTT ACCATGAAAC CTATTTGACT TGGTAATAAA 2160 

TTTTACTTGT CCCTTATTAG TAACGATTGT CATTGATTTT ATTGATGGAT GCTTAAAAAA 2 220 

TGTAAATAAA TCATATTCTG AAAATCCTGA CTGTCCAGGA TGGTTATGTA ACATAACAAT 2280 

TGAATTCGGT TTACTGTTAA ATAATAATTC GGTTGCTTGT TACCCTGGCA CAAAAGATAC 234 0 

50 

ACTATCTTGA TTGACATATA CTTTTGTAAA TTTACCATCT TTTAACAAAT AAGCTACTTC 24 00 
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GAAGGTTAAA ATTTTTAAAG TAAACTTTCT TTATCCCCAT GCTACGAGTT CAGATTCAGG 258 0 

AAATAGCCCT TTACTAGTAT TTATGTATAT TCTGTCTATG GCATGAATAA AATAATTATC 264 0 

^ TCTTGTATTT tTTTCTAAAC TAGATTTTTC AGCATTGATA ACTTCAAGAC TATCTATATC 27 00 

CATTTGAATA ATACCAGGCT TAATATTTTC ATCATTATTA GGAAAATATT TATATGTAAC 2760 

ACTTTTATCA TTAATTTCTT TTATTTTTAA TATTAGCAAT CATTTCCACC TCTAATTAAT 2820 

W 

TAAAATACTA TAATTATATT TTATTTCTGT AAGTTTATGT GCCTCTATAT AGTGTAAATT 2 88 0 

ATACTTATTC ATTAGATAGT GTTCAAGAGC TTCATGTTTC TACATTATTA TATCCATTTT 2 94 0 

TTTAATATTT TTCCCTTCTC TTAAACGTTG CCAACTTTGA GCCATATAAA AGTCAGGATC 300 0 

AAATTGTTTA AATCCACTTT CTAATAAATA CTTATTTTCA AATATATGTT CATAAACTCT 3060 

TTGAATTAAA TTTTTATTTA TATTAgTATT TTTAGCAATT TTAGAAATCT CTATCTGTTT 312 0 

20 ATCTCGATTT CTAACTGAAT TATAATAAAT TTGAGCATGT CTGTTCCTTT TGATACCGTA 318 0 

TTCATCACTT TTATTATTAA GTGCACCTGA TTCAATAAAA CAACCTTCTA CTTGATATTT 324 0 

ACCTTCACGC TCTTTAAAGA ACTTGTCTCG CCAATTGCCG ACATGTGGCA CTGTGGTACT 3300 

25 TCTACACCAA GGATGCATAG GTGGCGCATT CACACCTGGT ATCATATCTT TAACTTTAAA 336 0 

TATTTTTCCG TTGAGTGAAT GACATAATTT AGATGTTTTA CTATCTATTT TGGCAACATA 342 0 

TTTATATTCG CCATCTTCCA CCAAGTTCyT TTAAATATGT TAACTTTTGT GACTCtTCyT 34 8 0 

30 

TTTCAACGAA TAATGAAAAA AGCATATAAA AATAGCATCG CTTTCTCTTT ATCTGTGTAG 3 54 0 

AAAGGATGCT TTAATACCaT GCTATTTTAT AATTTTcGGG AAATTcTTGC TTCTCGATAA 3600 

AGTCTCTTAC TACAGAAAAA GACTTATTAC GATATAACAT AAAATATTCT TCATTTTCTA 3660 

35 

TTTCAGATAA TGAGAAATCT ATAACCTCTG CATCTTTTTT ATTAAAAGTT ACTGAACCTT 3 72 0 

TACCGTTACT TATATCATCT TCAGGTATAT ATTC 3 754 
40 (2) INFORMATION FOR SEQ ID NO: 559: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 815 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 559: 

50 

ATTTAACTAA AACTATAaAT AATCAAATGA TATTGGAAGA TATTaGCATA GATATCGAAA 60 
AAGGTAAATT GACTTCTTTA ATTGGACCTA ATGGTGCGGG TAAGAGTACT TTACTTTCAG 12 0 
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10 



15 



CTGATTATAA AAATAATGAC TTGTCGAAAA AAATATCTAT ATTAAAACAA ACAAACCATA 24 0 

CTGAAATGAA TATTACGGTA GAGCAGTTGG TAAACTTTGG ACGATTCCCT TATTCTAAAG 3 00 

GTCGTTTGAC GAAAGAGGAT CATGATATTG TCAATGATGC GCTAGATTTG TTGCAACTAC 3 60 

AAGATATCAG AAATCGTAAT ATTAAGTCAT TATCTGGTGG ACAACGTCAG CGTGCATACA 4 20 

TTGCAATGAC AATAGCACAA GATACTGAAT ATATTTTGCT AGATGAACCA TTAAATAATT 4 30 

TAGATATGAA GCATGCTGTT CAAATTATGC AAACGTTAAA AATGTTAGCG CATAAAATGA 54 0 

ATAAAGCGAT TGTCATTGTG TTACATGATA TTAACTTTGC GTCCTGTTAT TCAGATCAGA 600 

TTGTAGCATT GAAAAACGGA CAACTAGTTA AGTCAGATTT GAAAGATAAT GTCATTCAAA 66 0 

GTAGTGTTTT AAGTGATTTA TATGACATGA ATATTCAAAT TGAACATATA AGAAATCAAA 720 

GGATTTGTTT ATATTTTAAG GATTGATAAT TTGGAGaCAC TTTAAAGGGG TGATGCGCCA 7 80 

20 ATTAAAGAAG GGTTAAACGT AAAGCATTTA TTTAT 815 
(2) INFORMATION FOR SEQ ID NO: 560: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 919 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 560: 
GAAACGAATA ATAAATTTAC TGAGTTATTA GTTGAAAAAG CTAATAAACA TGATGATGTT 6 0 
CTCGATAmGT TGATTAATAT TTTAAAATAA GCGATACACA CTACTAAAAT TGTATTATTA 12 0 
TTATGTTnAA TTGACnCCTC CTAAATTTGC AAAGATAGCA ATTTAGGAGG CGTGTTTATT 180 
TTTATTGACG TCTAACTCTA AAAGATATAA ATTAGACATT TACAAATGAT GTAAATAACG 24 0 
40 CAATTTCTAT CATCGCTGAT AACAATTCAT GGTTTAATAT GCAATGAGCA TATACTTTTT 3 00 
AAATAGTATT ATTCACTAGT TTTAACAATC AATTAATTGG TATATGATAC TTTTATTGGT 3 60 
TATTTTTATC CCATAGTGTG ATAATTACTA TTTTTCATTC ATAATAAAGG TTTAAAGCAT 42 0 
GTTAATAGTG TGTAAGATTA ACATGTACTG AAAAACATGT TTAACATAAT GATATAAGGA 4 80 
GTGACGTACA TGATCCGTCT AGGTAAAATG TCAGATTTAG ATCAAATCTT AAATCTAGTA 54 0 
GAAGAAGCAA AAGAATTAAT GAAAGAACAC GACAACGAGC AATGGGACGA TCAGTACCCA 6 00 
CTTTTAGAAC ATTTTGAAGA AGATATTGCT AAAGATTATT TGTACGTATT AGAGGAAAAT 6 60 
'-^^r,^rrAAr^ ^AGAATnr:'-A V^.nv-GACA'T^'^ 
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TATAAAGGAG CTGCTACAGA ATTATTCAAT TATGTTATTG ATGTAGTTAA AGCACGTGGT 84 0 

GCAGAAGTTA TTTTAACGGA CACCTTTGCG TTAAACAAAC CTGCACAAGG TTTATTTGCC 900 

^ AAATTTGGAT TTCATAAGG 919 

(2) INFORMATION FOR SEQ ID NO: 561: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 518 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

75 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 561: 
ATCATATAAA CCGGCTGCTT CTAAAAACGA AAATACTGTT ACTGGACCTA AAAATTTAAA 60 
20 CCCGTATTGT TTTAAATCTT TAGATAGTTG TGTTGCAGTA TCATCAACTG TGATACGATC 120 

AGAAGCATGT TCATACTGCA AATCTTTAGG CTTACCATTT ACATATGACC ATAAAAATTT 180 
ACTAAAACTA CCATATGCTT GTTCAATTTT TAAATACCCT TGAGCTTGAT TAACAATTGC 24 0 

TTCTAATTTT TTACGATGAT GAACGATATT TGGAAAAGTC ATTAAGCGGT CGATATCTTG 300 
AGCGGTCATT TGTGCTACCT TTTCTGGTTC GAAATCATAA AATGCTTCTT CATAGGCTTC 360 
TTTCTTTTTT AAAATAGTTA ACCAAGATAG CCCAGCATGT TGTGaTTCTA ATGCTAAAAG 42 0 

TTTAAACAAT GCCTTGCnAT CATAGAGCGG TTGTCCCCAT ACATGGATCm TGATAGTCTA 4 80 

AGTAGACTGG GATCTTTAGT ACCAAATGCG CATTCATT 518 
(2) INFORMATION FOR SEQ ID NO: 562: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
40 (D) TOPOLOGY: linear 



2S 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 562: 

CTTATTTAAA AGTAAATCAA TCAATGTATT ATAATCCGAA TAGTCCGCAT AAAGCTGGTT 60 

TGCaGGCAAA tCAATTACTA CAACAAGCAA AAACCCAAAT TAATGCAATG rTTaATTCAA 120 

AAACAAATTA TGATGTTGTA TTCACTAGTG GTGCAcTGAA TCCAATAATC TTGCTTTAAA 18 0 

AGGTATTGCC TATCGTAAAT TTGATACAGC GAAGGAAATA ATTACATCCG TGTTAGAGCA 24 0 

TCCGTCCGTA TTAGAGGTTG TAAGATATTT GGAAGCACAC GAAGGATTTA AAGTTAAATA 3 00 
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CAAAGTCGGT TTAGTAACAT GTATGTATGT AAATAATGTA ACTGGACAAA TACAGCCTAT 4 20 

TCCACAAATG GCTAAAGTTA TAAAAAATTA TCCTAAGGCA CATTTTCATG TAGATGCGGT 4 80 

^ TCAAGCATTC GGCAAAATTT CAATGGATCT CAATAACATA GATAGTATTA GTTTAAGTGG 54 0 

ACACAAGTTT AATGGTTTAA AAGGACAAGG CGTCTTACTT GTAAATCACA TTCAAAATGT 600 

TGAACCAACT GTCCATGGTG GTGGTCAAGA ATATGGTGTT AGAAGTGGAA CAGTTAATTT 660 

GCCAAATGAT ATTGCAATGG TTAAAGCGAT GAAGATAGCT AATGAAAACT TTGAAGCATT 720 

GAATGCATTT GTTACTGAGT TAAATAATGA CGTCCGTCAA TTTTTAAATA AATATCATGG 780 

AGTTTATATT AATTCTTCAA CTTCAGGTTC ACCATTCGTT TTAAATATTA GTTTTCCTGG 840 

15 

CGTAAAAGGT GAAGTATTAG TTAATGCTTT TTCAAAATAT GACATTATGA TATCTACGAC 900 

AAGTGCTTGT TCATCTAAAC GTAATAAATT AAATGAAGTA TTGGCTGCAA TGGGATTATC 960 

20 AGACAAATCT ATTGAAGGTA GTATAAGATT ATCATTTGGG GCTACTACAA CTAAAGAAGA 1020 

TATAGCGAGG TTTAAAGAAA TATTTATCAT CATTTATGAG GAAATTAAGG AGTTGCTAAA 1080 

ATAATGAAGT ATGATCACTT GCTTGTTAGA TACGGGGAGT TAACATTAAA GGGTTCAAAT 114 0 

AGAAAGAAAT TTGTAAATCA ATTAAGAAAT AATGTAAATA AGTCaTTAAA AGGACTTGAT 12 00 

GGGTTTGTCG TTAAAGGCaA ACGAGATCGT ATGTATATTG AACTTGAAGA CCATGCaGAT 1260 

ATAAATGAAA TAACATATCG ATTATCAAAA ATTTTCGGTA TTAAATCTAT TAGTCCAGTA 1320 

30 

TTAAAAGTAG AAAAAACAAT AGAGGCAATA AGTGCAGCGG CAATTAAATT gCGCAGaATT 1380 

TGAAGaAAAC AGCACATTTA AAATTGATGT GAAGCGTGCC CGATTAAAAT TTCCCCAATG 144 0 

GATACGGTAT GGAATTACAG CGTGGAATTG GGGTGGTGCC AGTATTGGAG CACTTCGCCA 1500 

35 

TATTTCCAGT GGATGTCCAA CGTCCCAGnC CCAGGAATT 153 9 
(2) INFORMATION FOR SEQ ID NO: 563: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 968 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 563: 

ATAnCGTATA CATGTGTTCT TTAAAATTGT GATAAGGAGT TTAGGATGGT TTATTTAAAA 6 0 

50 

TCAATAGATG CCATTGGATT TAAGTCTTTT GCAGATCAAA CCAATGTTCA ATTCGATAAA 120 

, -—ASAT^-nr rv^--^,-'^AAA..?^ ^TAATAT^A^ AGATG^TA'^"'^ ^pn 
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5 



15 



20 



ATCTTCTCAG 


GTGCAGAACA 


TCGCAAAGCT 


CAAAATTATG 


CTGAAGTACA 


GTTAAGATTA 


300 


GATAATCATT 


CTAAAAAGCT 


CAGTGTTGAT 


GAAAACGAAG 


TTATTGTAAC 


AAGAAGATTG 


360 


TATCGAAGTG 


GTGAAAGTGA 


GTACTACATA 


AATAATGACC 


GTGCAAGATT 


AAAAGATATT 


420 


GCCGATTTAT 


TTTTAGATTC 


TGGATTGGGA 


AAAGAAGCGT ATAGCATTaT 


CTCGCAAGGT 


480 


AGAGTTGATG 


AAATACTAAA 


TGCTAAACCA 


AITGATAGAC 


GTCAAATTAT 


TGAAGAATCG 


54 0 


GCTGGTGTAC 


TTAAATATAA 


AAAACGTAAG 


GCTGAATCAT 


TAAATAAACT 


TGACCAAACA 


600 


GAAGATAATT 


TAACGAGAGT 


AGAAGACATT 


TTATATGATT 


TGGAAGGTCG 


CGTAGAACCT 


660 


CTAAAAGAGG 


AGGCAGCTAT 


AGCTAAAGAA 


TATAAGACAC 


TTTCACATCA 


AATGAAACAT 


720 


AGTGACATTG 


TAGTTACAGT 


sCACGATATT 


GATCAATATA 


CAAATGACAA 


TAGACAATTA 


780 


GATCAACGTT 


TAAATGATTT 


ACAAGGCCAA 


CAAGCAAATA 


AAGAAGCTGA 


CAAGCAACGT 


840 


TTAAGCCAAC 


AAATTCAACA 


ATATAAAGGT 


AAACGTCATC 


AACTTGATAA 


TGATGTTGAA 


900 


TCgCTTAATT 


ATCAATTAGT 


AAAAGCTACG 


GAAGCCTTTG 


AAAAATATAC 


GGGACAATTA 


960 


AATGTTTT 












968 



(2) INFORMATION FOR SEQ ID NO: 564: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 436 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 564: 

35 

TTGTGTGTAT GCATTCAATG TGCTCTGTTT GTAAATGGCT AGCTATATAA TTTAgGATTC 6 0 

GAGGATCGTC ATCGACAACA AGACATTGCA CCATAGCTAT AAACTCCCTT ATCTTTTTCA 12 0 

40 TTTATTATAC ATGTAAAATA TTTTTGCGTA AAAAAACAAT TGTTCATATT GAGTTCATAT 180 

TTCAACCTTA TACTGACGCT AAAGAAGAAA TAGGGAGAAG TGAATCGATA TGAAATTAGC 24 0 

GATAAAAGAG ATTATGTTTT ACAAATTTCG TTATATTTTA ATCACATTAA TCATTCTTTT 3 00 

ATTAAGTATT ATGGTGTTAT TTATTAGTGG TTTAGCTCAn GGGCTTGGTA GGGaGAATAT 36 0 

TTCGTwATTT GAACACTTTG GATAATGATG aaTATGtTGT TCaAAAAATG AAAGAGCCGC 42 0 

aAATTGaGAA ATCGCA 436 

50 

(2) INFORMATION FOR SEQ ID NO: 565: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2554 base pairs 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 565: 

ATGTTTTGTC AATATCAATT GTTTGAGATA AATCCGCTTG TATAAAACGA ACTTTATCAT 60 

CATTAAATTT GCTAGTTAAT TCATTTATAT CAGTACGATA ATATTGCACA TAAACTTCAA 12 0 

ATCCATCAGT TAATAATTGT TTGACTATCT CAGAACCAAT TGAACCAGAA CCACCTAATA 180 

CTAATGCTTT CATTACTTTT TAATCTCCAA ACGACTATCG ACTTGCTGAT CTAAATTTAA 24 0 

ATATAACGAT GACGTTTCGT TAATACTATC TAACGTGATA TTTTCAACAA TGTCTAACAT 300 

GTCAAACACG CTAACACCTT CAAAATACAA TTTAGTATAT TGATTAGCAA TATATTCAGG 3 60 

TGAGTTTAAA CTTGATATGA ATTCACCTAT AAATTGCTTT TTCAAAAGTT CAAATGCTTC 420 

TGCATCTTGG AAAtTGCCTT TTTTATCACG CAACTCATCT AATAATAATT TTTTTAATTT 480 

ATCTGGTTCT TCAGTAGCAC TTGTCACGAT TGAAAAACTA TACGTCGGCT CTAGTACAAA 540 

TTGATAACCA AATGTATCAT CGATAAGTCC TTCGTTTAAT AAATTCTGAT AAAAATCTGT 600 

TTCTTCCCCA AAAATTAACT CAAAGAATAA TGACATTTCT AAATCACGTT GTACATATTT 66 0 

TTGAGGCGCT TCTTGTAATG GTTTATTTTT AAAACCAAGC ATTAGTCTTG GTGATTGAAT 72 0 

TTTCATAGAT TCAGTAACAA ATGCTTCTTT AACATCcTCC GGTTCATCAA CAAGTCCTCG 780 

TTCGATTTTG GGTTGGTTAA CTTTATTACG AGCATCCTCG TGTTGTTTTA CTATTCgACA 84 0 

TATTGCTTCA GGATCCACAT CGCCAACAAC AAATAAAACC ATATTTGATG GATGATAAAA 900 

CGTTTCATAA CATAGATACA AATCATCTTT TGTAATATCG TATATACTTT CTACACTACC 960 

GGCAATATCA ACACGTATTG GATGTTGTTG ATACATTGCA CGCAAtGTAT TaAACATTAA 102 0 

TTTATATCCA GGTTGCTCTT GaTACATTTT TATTTCTTCT GCAATAATAC CTTTTTCTTT 1080 

ATCAACAGTT TCTTTTGTAA AATAAGGCGT TTCaACCATT GTAAGTAAAC GTTTAATGTT 114 0 

GTTTTCAATA TtATCAGTTG CACTGAACAA GTAGCTTGTA CGATCAAAGC TTGtAAACGC 1200 

ATTTGCTTGT GCGTTATCTT CAGCAAACGC AGTAAATAAG CTTCTTCTTC TTTTTCAAAT 1260 

AATTTATGTT CcTAAAAAGT GAGCAACTCC ATCAGGTACA GTAACAAATT GGTCTTGTCC 1320 

AAGGGGTTTG AATTGATTAT CTAATGAACC AAATTGTGTA GTGTAAGTGA CAAATGTCTT 1380 

50 TTGAAAACCT GGlTkGGGGA TAATAAATAA TCGTAAACCA TTTTCTAATt CTTGTTCGAA 1440 

tACTCTTTCG TCTATTAATT CATAATAACG CTCTTTCATT ATTTATCCCC TCCTTTTGTC 1500 
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TTATGCATAA TCTCTATAAT ACTTTTCGGA CGATCTTCAG ATTCATATCG ATGAGAAATG 16 80 

ATTACTTTTT TAGCTAACTC TAATTTTTCT TCAGTGAAAT CTCCTGCTTT TATTTTTTCA 174 0 

5 

AATTCACTTA TAATAGTGTC TTTTGCAGTT TCGTACTTAT CACTTGAAAC CCCACTCAAA 1800 

ACAAATAAAT AGCCATTTTT GCCATCAATT TGTGAATGTA TAGAGTACGC TAAACTTTGC 1860 

TTTTCTCGCA CTTCATTAAA TAAAACAGAT GAAGGATCTC CTCCAAACAT CATGTTAAAT 192 0 

10 

ACAACAAAGG CAGCATATCC ACTTTGTCCA TATTGTGTTG GAAAACGGTA TCCCATATTT 198 0 

AATTTAGCTT GATCCACGTC ATCATATTCA ACAATATAAT CAACTTCTTC ATCGTGTAAA 204 0 

75 TGATGAGTAG AATGTTGGAA TTGATGTTTA TCGAATGGTT TAAGTGCAAA TTTTTCACGT 210 0 

ATTTGTTTCT CAACACTTTC AGGTTCTACA TTGCCGACAA CATAAACAGA ACATTGATCA 216 0 

TTATTAATCA TTGATTGATA TGTATGATAT AGTGTTTCAG CAGTAATATG TGGGATTTGT 222 0 

20 

TCTAGTTGTC CTGTAGATAA GTATTTATAT GCTTCATTTT CAAACATATG GTCGAGTAAT 2280 

TTTAAAAACG AATATTGTGC TTTATTATCT ACCATTGCTT CTATTTTTTT GGCTAATAAT 234 0 

GTTTTCTCTT GGTTAACAAA ATTATCATTG AATGCTTTAT TTTCAATTAA TGGATTCCAA 24 00 

25 

ATGATTTCTT GtAATAAATC TAATCCTTGa TTAAATAATG AwTCACCGkT TCyTAAATAA 246 0 

CGkkCaTTAA caATTyCTAA tGaAAATGtA ATgACaTGCT GaTCTTTGAA TTTTGAAATT 2 520 

JO GTACTATTCA CATACGCACC ATATAAATCG GCTA 25 54 
(2) INFORMATION FOR SEQ ID NO : 5 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1424 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 566: 
TnTCGCTATT TTnAATTGGT TTTGTATGGT TTAAGTTATA TCAATATACA ACAnACCCTA 6 0 

AAGCTGATAT CCCAGGTATC ATTTTTAGTA CGATTGGTTT TGGTGCTTTG TTATATGGTT 12 0 

45 

TCTCAGAAGC TGGCAACAAA GGTTGGGGTT CAGTAGAGAT AGAAACAATG TTTGCGATTG 180 
GTATTATCTT TATTATTCTA TTCGTTATTA GAGAATTAAG AATGAAATCA CCAATGTTGA 24 0 

50 ATTTAGAAGT ATTGAAATTC CCaACATTTA CATTAACAAC AATTATTAAT ATGGTTGTAA 3 00 

TGTTAAGTTT ATATGGTGGT ATGATTTTAT TACCGATTTA TTTACAAAAT TTACGCGGAT 36 0 

TcTCAGCATT AGATTCCGGA TTGTTATTAT TACCTGGTTC TCTAATTATG GGTCTACTAG 420 
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TTGCTGTAAT GACTTATGCA ACATGGGAAT TAACTAAATT AAATATGGAT ACACCATATA 54 0 

TGACAATCAT GGGTATCTAT GTACTTCGTT CATTTGGTAT GGCATTTATA ATGATGCCAA 600 

' TGGTAACTGC AGCTATTAAT GCGTTACCGG GACGACTTGC CTCTCATGGT AATGCTTTCT 66 0 

TAAATACGAT GCGTCAATTA GCAGGCTCTA TAGGTACAGC AATCTTAGTT ACTGTAATGA 720 

CAACACAAAC TACACAACAC TTATCAGCTT TTGGGGAAGA GTTAGATAAA ACGAATCCTG 780 

10 

TTGTACAAGA TCATATGCGT GAATTAGCAT CACAATATGG CGGACAAGAA GGCGCAATGA 84 0 

AAGTGTTACT ACAATTTGTA AATAAACTAG CAACGGTTGA AGGTATTAAT GATGCATTTA 900 

;5 TAGTTGCAAC GATATTTAGC ATCATCGCCT TAATTTTATG TTTATTTTTA CAAAGTAATA 960 

AAAAAGCAAA AGCTACAGCT CAAAAGTTAG ATGCAGATAA TAGTATCAAT CATGAATAAA 102 0 

TAAAATAAAT TAATTGAAGT GTGACTAATC AAAAATTATG TTGTGGGGAC ATGATTTTTA 108 0 

AAGTATCGGT GCCAAATATG GTTATCGATA CTTTTTTTAT TTGTTGATTT ATAGAATGTT 114 0 

AGAGGAATTA TATTAAAATT TGGCATTGAC GTAGTAGGTC ATTAATAAAG AAAAAGCAGG 1200 

AAGTGGGTCA ACGAAATGAA TTTTGTGAAA ATAACATTTC TGTCCCAATC CCTACTATAT 1260 

25 

AACATTATTT TAAACGAGGC ATGCGATTAC GGAAGAATAA GCTTATAACA AGTAAACCGA 1320 

TGCTACAGCC AAGTAAAATG ATGCCGTTAT GAATAGCGTC ACTTgCTGTA ATCACTTGAT 13 8 0 

CTGGTGGTAC ATTTAAATAA TATTTTTTGA AAACATCTGC AATT 14 24 

30 

(2) INFORMATION FOR SEQ ID NO: 567: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 676 base pairs 
3s (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 567: 

TCTCCCATGT TCTGCTAAAT GACGCATCAC TTTTACTTCA TGAGGCGTCA ATACACGTCC 6 0 

TTCACCAGCA TTCAAACCGA CAACATTTAA AGGCCCATAT TCAATACGAG ACAGTTTCGT 12 0 

CACTTGATGA CCAAAATGTT CGAACATTCT TCTGACTTGG CGATTACGAC CTTCTGTAAT 180 

TGTAATTTCA ACCAATGTTG TGTTTTTATC TTTATCTTGT TTCTTAACTT TCACTTCAGC 24 0 

so CGCTTGCGTC ATACCATCTT CTAATTCAAT ACCTTTTTCT AGCGCTTTCA CTTCTTCTCT 3 00 

CATTAAATAA CCTTTTAATT TCGCAACATA TTTTTTCTTA ATTTGATATC TTGGATGTGT 3 60 
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ACGTCCTCTA TCATCAGATA CACTTGTkAT CACTTGAGTT GGkTTATGGA AkAAAATGkA 
AAtTTTGTCT TCTAGTTCTA TTTTAATACC TTCAACTTCA ATCGTATCTG ATGGCTTCAC 
5 TTTTGTTCCT AATTCAGTGA CAGTCGTACC ATTCACTTTC ACTTTTCCTT CAGAAATTAA 

AGTTTCTGCC TTACGT 

(2) INFORMATION FOR SEQ ID NO: 568: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
,r (D) TOPOLOGY: linear 
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25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 568: 

GAAACGGTTC TACCAAAAAA CAGTAAGGGC TAAACCCAAT CATGGTAAGA CAAAAAGTAC 6 0 

AAATAGCCAT GCCCAAGTTG AACTCGCTGT ACGCCTATTT CTTTCTAAAA AGATAATAAT 120 

AAAAGCCAAT ACTAAATTAA TGATGAATCC AATGGCTAAA ATAATAGTAA ATAACGTTCC 180 

TAAATCGTTT GAAAATGTAA ATCGCATAGT CTTTTCTCCT ATAAAGAAAG GCACAAAAAA 24 0 

ACATTTTGCA CCTTTCACGT CATATTATTT ATTCACAGAT AAAGTTAAAA TTGcATTGAA 3 00 

TTCTTCTTCA TTATTTGGGA ATGTTCTTTC TTCTATTTCT TTAATAGTAA TATTTACTAA 36 0 

TTTTAAATTT GTAGCTTCTT CAGAACTTAA AAAAGCATTA ATGTTTTTTT CTAATAACTC 42 0 

kAAAGTCTCA GCTGtAAAaG TTTTAAGTTT AATT 454 
35 (2) INFORMATION FOR SEQ ID NO: 569: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 894 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56 9: 

45 

ACGATATTAC CACCCTCTGA AATAACATCA ATCATGTGAT CTGTAAAGTC CCAAGGATGA 6 0 

CTTGTTGTGA AACGAACTCT TGGAATCGCT ATTTTAGAAA TTGCTTGTAA AAGATCTCCT 12 0 

50 AAGTCATATT CTATATCCTG TAAATCTTTA CCATAAGAAT TTACATTTTG ACCTAAAAGC 180 

GTTATTTCTT TGTAACCTTC ACGAGCAAGT TCACGTACTT CATCTATAAT GTCTTCAGGT 24 0 

CTACGGCTTC GTTCTTTACC TCTTGTAAAT GGAACAATAC AATATGTACA AAACTTATCA 3 00 
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TCAATAACGT CTCCTTCTTT AGACCATACT TCAACAACCA TTGCTTTAGA TAAGTATGCT 420 

TCTTCTAAAA TTTCTGGTAA ATGATGAATA TTATGTGTAC CAAATATCAT ATCTACATTT 4 80 

TGATACGATT TTAAAATTTT ATTCACTACT GACTCTTCTT GTGACATACA ACCACAAACA 54 0 

CCGATTAAAA TATCAGGTCG TTCTTTTTTC AAATrCTTCA AATTACCTAT TTCACTAAAC 600 

ACTTTGTTCT CGGCATTTTC TCTAATCGCA CATGTATTAA TTAAAATAAC ATCTGCAGTG 66 0 

TTAATATCAG TCXSTkGCTTG aTAGCCTAAT GCyTCmAGTA TACCAGCAAT GACCTCAGTG 72 0 

TCATGTGCAT TCATTTGACA TCCATATGTT TTAATTAAAA ATGTACGCTC GTTCCCCATA 78 0 

CCGCGATATT TTyCATCAAT TtGGgAAATC nCTATTATAA CGAACTTCTk GTtTACCnCC 84 0 

TTTTTTnCGC TCCTTTAAAA TTAAGGCGGC TGATAAACAG GTCCAAAATA TTAC 894 
(2) INFORMATION FOR SEQ ID NO: 570: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 441 baSC pulSTS 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 570: 

TATCAATCCC ACAGCACATG CTGAACAAGA TCAAACATGG GAGAAGATTA AAGAACGCGG 60 

TGAACTTAGA GTGGGTCTTT CTGCAGATTA TGCACCAATG GAATTTGAGC ATACAGTTAA 120 

TGGTAAGACT GAGTATGCAG GTGTAGATAT TGATTTAGCT AAAAAAATTG CGAAAGATAA 180 

^5 TAATTTAAAA TTAAAAATCG TCAATATGTC ATTTGATAGT TTGTTAGGAG CTCTTAAAAC 240 

TGGAAAAATT GATATTATTA TTTCCGGAAT GACTTCAACG CCTGAACGTA AGAAGCAAGT 300 

TGATTTTTCA GATTCATATA TGATGACTAA AAATATCATG CTTGTAAAGA AAGATAAAGT 3 60 

40 

TAATGAATAT AAAGATATCm AAGACTTTAA TAATAAAanA GTnGGGGCAC AAAGGGACTG 420 

AACCAGAAAA AATCGCTCAA C 441 
(2] INFORMATION FOR SEQ ID NO: 571: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1205 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS- double 
50 (D) TOPOLOGY: linear 
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AAGTCGTTCA TCTTTTTCTA CAAGGGTGTA AAAATAATCA ATCATATCGT ATAACGCTTC 12 0 

TTTACCAATG ATGTCATATG GTGTTGTTGT CATTTAATCA CCCATTTTCA AAAATTTACT 18 0 

GTTACGAACT TAAGTTAATA TATAACTAAT ATAACATGAT TTTAAACATT TGAAAGAAAT 24 0 

ATGCATATTT GCCAATTTAA TTTATATTGT TTGAAAGTGT TTCTTTTTTC TTGAAAAAAC 30 0 

GTTGAACTTT ATTTAAAGGT tGATGATGTT CGAGGTTTAG TTCGTTtAAT AAAGATtGGA 36 0 

ACTTTTGTAA ACCTTGATTA TAGTCTTTAA CTTCGAACTC TAACTCATAA TCCGTAGTAT 42 0 

CGAAATACTC ACTTTTATCT AAAACCAGTA AATCACCTTT ATATTTAGTT TCTTGGCGAT 4 80 

75 ATGTCGTTAA TGCACCAAGT ATTGATAAAG TTGTATCTTT TACACCAAAC TGTTCAACTA 54 0 

TAATTTGACG AATGTCATCT GGAAGATTGT CGTTTGAAAT AATCAAGTTC ATCTCTGGTT 600 

TAATGTCGAC GATATAGTTG TATTCTAATA GACCAACCTT TGCTGGTGTC TTTAAAGTCA 66 0 

TTTCATATTG ATTGTCTTTA ACTCTTATGC GTAGTGCAGA GCGATGTTCC TTTAATTTGA 72 0 

AATCGGGTGT ATCAATATAG TAATTGACTT GCTTAmAAAG CACACTGTCT TTAAAATATT 78 0 

TCTCTTGCAA TTTATTATAG ATTGAtGCAG TTATCATTTG TtTAAATTCT ATtTCATGAT 84 0 

TTGTTGCCAT GATATGTATA CACCTCGTAT CAAATTCAAT TTATCTTAAC TATATTATGA 900 

ATGACAAAGT TGAATTTTAA AAGTAATTTC CTTTATCTAT TATCAATGTT AATTTGACCA 960 

TTAAAAATAG TGTTCGTAAG TGTTTTGTAT TATTGaATTG TGTTAAAATG TTATGGAATA 1020 

AGAGGAGGAT TAAGCATGsG TTTwTATATT AATGAAATTA AAATTAAAGA TGACATACTT 10 8 0 

TATTGTTATA CAGAAGATTC TATTAAAGGA TTATCTGAAG TAGGACAAAT GCTCGTTGAT 114 0 

AGTGATAATT ATGCCTTTGC GTATACATTA GATGATGGTA AAGCGTATGC TTATCTCATT 12 00 

TTCGT 1205 
(2) INFORMATION FOR SEQ ID NO: 572: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 570 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 572: 

TGAAGAAATA GCAATGATGA AATGCCGTAT GTTATATGAG ACGGGGTCAT TTCTTGAATT 60 

AAGAGAAGAA ACAATTGTCT TATTGAAAAC TGGCATACAA CAATATGATG CATTGATGAT 120 

TTATTACGTG AAAAGTTTGA TTGGTTTGGG ACAATATTTT GAAGCGGTAG AAGTAATTCA 180 
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ATTTGCTAAG TCAAAATTAA TTGAAGATGA AAAACGATTG ACTCAGTCAT TAGCTGATTT 300 

TGrTACGTTA TCAATGAGGG AACAGACXiCA CTTGATTTTG AAGTTAATAG ACAATGGTCA 360 

TTTTCAATTT CAAGAAACGG TATTATATAT ATkAAaAyCT AATaCGTACa GTtATAACCT 42 0 

CATTAGTTTA ATGATTGAGT ATTTAAGGTT CGCAAATTGT ACACAAGAAC TGACAATTGA 4 80 

AAAGTATGGT ATGGATGTAA CTTTTGTACC AGd'AATTTA AAAGGGCTAG AACATACAAC 54 0 



10 

ACTTAAAGAA AAAGTTATAC CTAACGTTAT 
(2) INFORMATION FOR SEQ ID NO: 573 r 

TS (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 573: 
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GTTGAATGGT 


TAGCAGCTGC 


AGTTGTATTA 


TATTTCTGTG 


GTGTAATTGT 




fin 


GTATCATTCA 


TGTCCTTTAT 


TGCAATATTT 


ATCATTGCTG 


CATTATCAGG 


TTTAGTCAGC 


120 


TTTATTCCTG 


GTGGTTTCGG 


CGCTTTCGAT 


TTAGTTGTAT 


TACTAGGATT 


TAAAACTTTA 


180 


GGTGTCCCTG 


AGGAAAAAGT 


ATTATTAATG 


CTACTTCTAT 


ATCGTTTTGC 


GTACTATTTT 


240 


GTACCGGTAA 


TTATTGCATT 


AATTTTATCA 


TCATTTGAAT 


TTGGTACATC 


AGCTAAGAAG 


300 


TACATTGAGG 


GATCTAAATA 


CTTTATTCCT 


GCTAAAGATG 


TTACGTCATT 


TTTAATGTCT 


360 


TATCAAAAGG 


ATATTATTGC 


TAAAATTCCA 


TCATTATCAT 


TAGCAATTTT 


AGTATTCTTT 


420 


ACAAGTATGA 


TCTTTTTTGT 


AAATAACTTA ACGATTGTkT ACGATGCTTT 


tATATGATGG 


480 


AAATCACTTA 


ACGTATTATA 


TTCTATtGGC 


AATTCATACT 


AGTGCTTGTT 


TATTACTTTT 


54 0 


ACTGAATGTA 


GTTGGTATTT 


ATAAGCAAAG 


TAGACGTGCC 


ATTATCTTTG 


CTATGATTTC 


600 


AATTTTATTA 


ATCACAGTGG 


CGACATTCTT 


CACTTACGCT 


TCATATATTT 


TAATAACATG 


660 


GTTAGCTATT 


ATTTTTGTTC 


TGCTTATTGT 


AGCTTTCCGT 


AGAGCGAATA 


GGTTGAAACG 


720 


C C CAGT AAG A 


ATGAGAAATA 


TAGTTGCAAT 


GCTTTTATTC 


AGTTTATTTA 


TTTTATATGT 


780 


TAACCATATA 


TTTaTTGCTG 


GAACGTTATA 


TGCATTAGAT 


ATTTATACGA 


TTGAAATGCA 


840 


TACATCTGTA 


TTGCGCTATT 


ACTTCTGGCT 


TACGATTTTA 


ATCATCGCTA 


TCATCATAGG 


900 


TATGATTGCA 


TGGTTGTTTG 


ATTATCAATT 


TAGCAAAGT 
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(A) LENGTH: 1059 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 574: 

GAATTAATTA AATATTACAC ACAGCCTCAT TTTTCATTTT CAAATAAATG GCTATATCAA 6 0 

TATGATAATG GAAACATTTA TGTTGAACTT AnGAGATATT CATGGTCAGC ACATATATCT 12 0 

TTATGGGGCG CTGAAaGTyG GGGAAATATT AATCAGTTAA AAGATCGTTA CGTAGATGTG 18 0 

TTTGGACTAA AAGACAAAGA TACTGATCAG TTATGGTGGT CTTATAGAGA GACATTTACA 24 0 

GGTGGCGTTA CACCAGCCGC AAAACCTTCT GATAAAACTT ATAATCTTTT TGTGCAATAC 30 0 

AAAGATAAAC TACAAACGAT TATTGGTGCG CATAAAATAT ACCAAGGCAA TAAACCAGTA 36 0 

TTAACATTGA AAGAAATCGA TTTCCGTGCA CGAGAAGCGT TAATAAAAAA TAAAATATTA 42 0 

TATAACGAAA ATCGTAATAA AGGTAAGCTT AAGATCACCG GTGGCGGTAA TAACTACACT 4 80 

ATTGATTTAA GCAAAAGATT ACATTCAGAT CTAGCAAATG TTTATGTTAA AAATCCTAAT 54 0 

AAAATAACTG TTGACGTCCT CTTTGATTAG TATATGAAGG TGACTTATAC TTCATGCACT 60 0 

TTAATTCCAA ATCAGATTAT TTAAATGATA ATTTTTAAAG TGTATGATGT ATATAATAGG 66 0 

30 TAAAATTTTC TATATATTTA AATGGAATTG GGAGTAGGAA TGTGACAGAA ATAGTATTTT 72 0 

ATAAAATTTA TTCtTGTCAC TCCCCAACTT GCACATTATT GTAAGCTGAC TTTCCGCCAG 780 

CTTCTATGTT GGGGCCCCGC CAACTTGCAT TGTCTGTAGA aTTTCTTTTT GAAATTCTCT 84 0 

ATGTTGGGGC CCCGCCTATA ATTGAAAAAT GCTTGTTACA TGGGCATTTT CATTCGGTCA 90 0 

ACTACTACCA ATATAATATT GtAGaGCCTA AGACATTGAT TTATTATGTC TTAGGCTCTA 960 

TTCCTTCATT TAATGATTAA nTTATTATAG CAATACTTTA TTGTCCCATG ATTAGTGTTC 1020 

TTTTAATGAG ACATAGTAAC TATAAAGTTT AATAATCGT 1059 
(2) INFORMATION FOR SEQ ID NO: 575: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 574 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 575: 
GTTTGCTTTA GGTCCTGTTT CATATTTATA CTTCGAAGGA TTTACCTTTT TGaAGTCTGG 6 0 
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TTCAACTkGC TTTTTATTCT TTTCGAAATC AGCTGGTTGA GTAGTTATGA GTTCATTATT 180 

TTTATTAGmA TAAATCTTAC CATTAACATA TTTATAATCT TTTGTTATAA AGTCACCATT 24 0 

TCTGAATGGA ACTACTTGAT TATGACCTTT AGAGAATAAA TCAGTACCGA ACATTAAATA 30 0 

GTTCTTCGTA TCTATACCAG CCAAATGTAA AATTGTTGGC ATTACATCGA CTTGACCAGC 360 

ATATTCATTA TTGATACCAC CAGATTTACC AGGGATTTTA ATCCAGAAAC CAGTTCTGTT 42 0 

TAAATCTGTA AATTTAGCCG GTGTGATTTT TTCACCTAAT AGTTTTTCCA TGGCATTGTT 4 80 

ATGGTTTTCA GAGATACCAT AGTGGTCACC ATAAATCATA ATCACTGAAT TGTCATATAA 54 0 

TCCTTTTTTC TTCAAGTCAT TAATATATTC TTCT 574 
(21 INFORMATION FOR SEQ ID NO: 576: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 base pairs 

(B) TYPE: nucleic acid 

(r) STRANDEDNESS : doilblc 

(D) T0P0UX;Y: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 576: 
CAATGTTTTA TAGTACAATA TATTTTnAAT AATACTCGTT AAGGAGAATG ATATGATATC 60 

AATTCACGCA ATTTCGACAG GAAAAATCCA AGATTTGCCT TATAGCTCGA AAAGACCGAT 120 

30 

GCGCTCTGCT TTAGATAAAA CTAAGATTTC ACAAACAACA TGGTTATCTT CAACTGGTTT 180 

CACTGGTGAT GAACAGGCTT ATAAAGATCA TGGTGGACCA CATAAAGCAG TTTGTGGGTT 240 

35 TAQUPJ^QQAT AATTATGCAC TGTATCAAGA TGATTTACCT ACACTACCTA CTCATGCGAT 3 00 

GTTTGGAGAG AATTTAACAT TTGATTATTT AGACGAATCT GATGTTTACT TTGGTAATCA 3 60 

ATATCGTTTA GGTGAAGCGT TAATTGAGGT TTCTGAAATT AGAGAACCAT ACTGGAAAAT 4 20 

TCAAGCAAAA TATAATATTC CTGATTTAGT GAAGCGCATG TCTACATCTG GTAAAACAGG 4 80 

TTTCTATTTC CGGGTATTAA AACAAGGCTA TGTATCTCCA AATGATCAGC TTTACTTAAT 54 0 

ACAAGAAGCA CCAATCGAAC ATCGTTTATC TGTACAACAG CTTAATGACC TTTATTATAA 6 00 

TGATAGACAA AATCAAGaTA TGTTACGATA TGCACTAAAC AATCCATTTC TGTCACCAAC 6 60 

AAGACGCGAT AAACTTCAAA AAATGTATAA CAGAACATTG GAAATAATTA CCTTTCATTn 7 20 

ATAAGTGTTA AATGAACTTT TCAAAACAnA AAGGAATCAA CTTCACACAT CGTTTGTATG 780 

AATAGTCTTA TCTATA 7 96 
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(A) LENGTH: 1095 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

5 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 577: 

AGAAATTATG ACAAAATATA AAGATGGAAA GTTAGTTTAT GCATCAGTCG AACCAGGATC 6 0 

TTACGTAATA CATAAAGATG ATGCAATTAA ATATGACGAT TATTCTAAGT TAAAAAAATT 12 0 

AAGTCAGCTA ACTAAACTTG ATCATCCAAA ACCAGTTCCA TATAGCGTaC TCAAATCAAA 180 

15 TCTTTCGGAG TACCTTTAAC AAGCGTTTCA TTTATGACAC ATGGATCAAA GGATACTAAA 24 0 

GATGAAGTGT TGCCGGCATT GGCCTATTTC ACTTTTTCAC CAAAAAATTA TGAAGACAAG 300 

TCTAATCCAG ATCCAAAAGT TTTAAATTTA GTACATATGG ATTTCTTAAA TGCATCTAGT 360 

20 

GATTTTGGTA ACGCACATTT TGTTGTTTTA AGTAAATATA TTAAAGAGTA TGAATCAAAC 420 

TATGAAACAG CGTCAGATGA TTCTTTAAAA TAGTATTTAC TGTGTGAAAA ATAAATAGTG 4 30 

TACTACATTA AATAATCGCA ATAATAATCC CGATAAACAA TCAGCATTAC TGCTTATCAC 54 0 

25 

ATAGAGTTCG TAATAACTAT AACTCTATGA TTCGCAAATA ATAAATGATT GTCATCGGGA 60 0 

TTTATTTTTA TCAATTTATA AAGTGACATT ACCTTGTTCA TCAGCAGGTT TGAAAACAGT 66 0 

2Q AATCACTGCA CTAATAATTG CTAAAATGTG TGGGATACCT GTCCAACAGA ATATTAAGTG 720 

TAGAATACCT TGCATATTCT TGCCGGCATA AAATTTATGA ATACCAAAAC TACCTAAGAA 7 BO 

CAATGCTAAT AAAATATAAA TAACTTTGTT TACTTGCATT TCTTTCCCTC CAGTTGAATT 84 0 

35 GCTTATAATG ACATTAGCTT CTCTTTTTAT TATACCCACT TTTAGTTCAA ACATTCTAGT 900 

TTAAGCATTC CCAATCATCT AAATTTCAGT TATTCAATCC TTACAATAAA TTTAGGATTA 96 0 

CATTTCAGTT GCATTGTATT ATTTTACGTG TGAAATATAC GTAATGAATC ACATGACAAy 102 0 

40 

CTyCAAATTG AAAAATATAC ATTCTATGAT GTAAGGTCGC ATTTTTAATA TATTTACGTn 108 0 

AAAATAGTTT GGATG 10 95 
(2) INFORMATION FOR SEQ ID NO: 578: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 489 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
50 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 578: 



55 



EP0 786 519 A2 



10 



20 



25 



30 



35 



40 



ACTGCCPAPJi TAATCATAGC CATGTTCTAC AGCTGCTTTC GCTACAATAT CCAAACGCAT 12 0 

TTCAAAACAA GCGGTACAAC GTAAGCCGCC TTCTTTTTCA TCAGCTAATT CTTTATCCTT 180 

CACCATTTTC ACAAACTTAT GTGGTTCATA AGGTGCTTCA ATATACTTCA CATTCGCACC 24 0 

AGTCTTGCGA TTAAAATCTT CCACAAATTG TTCTTGCACT TTAGCACGTC GTAAGTACTC 3 00 

ATTTTTCGGA TGAATATTTG AATTCGCGAA ATAAATTGCA ATGTCTGCAT ATTGTGTTAA 36 0 

AAACTCTAAT GTATATGTAC TACAAGGTGC ACAACAACTA TGCAATAAGa TTTTAGGTCT 42 0 

GATTGCTTCT CTTTCCCACT GsCCGATTAA TnTCTTCAAC ACCTTGTCCA TAATTAATTT 4 80 

GTnGATTTn 4 89 
(2) INFORMATION FOR SEQ ID NO: 579: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 579: 

CGTACTGCAT CATTTGTTGA AAAATTTGCA AACTTCTTCA AGATGTTATG GCTTAGACTT 60 

AAAGCGATGA AGCACTACAA AGCCTTAAAT AAAGAATCTA AGAAGCAAGA ATTTGAAAAT 120 

TCATTCAAAG ATGTTCAAAA AATTATGCGT ATTGTGAATC ACAATATTAT TTTACGCTTA 18 0 

AAAGAAGAAC AAAATAGTAC AAATGTACTT GAGGTTAGCT TAGTCATTAA TCATTACTAT 24 0 

GATATGAGTC GCTCATTAAA GTGGCGTGCA CAACGTCGAA AAGAACGTCA AGAAAACAGC 3 00 

AATCAAATCA TACCGCAAGC TATGTTCCAT AACCACAAAT TGGAAGCATT GTACTTACAA 360 

CGTCATCTTT TAGATGAATT AATTCGCAAA AATAAAATCA ACAATATCGT TGCAGCTCAA 420 

ATTCGAGAAA ATATCAATTa CAACGAAATT GTCTTGTCTT TACAGTCCAA ACATTaAGCA 480 

AGaCwTaCaw TmCCCCCGTA CATATGAGAC AAAGTCATTA TCATCTCATA TGTACGGGGT 54 0 

TTTTATATTC AACATCAAAA AATCAGATTG ATGAAAAGTA AATAACCTTT CATCAATCCG 6 00 

ATTTGATTAT AGA^TCTATT TTTTAAGTTT AAATGGAATT GTACATACGT TAACATTCTT 6 60 

TTGATAAATT AAATACAATT TCATACGCAA ACTAGTTTGA TTGTGTAATA AATTATGCCA 720 

ACGTTTCTTA GTAATAAATT CTGGTATCAC TACTGTAATC ATATAGTTTT GATCGTTGGC 7 80 

TTTACGATTA ATCTTATCGA TAAAACGTGA AATTGGTCGT ATAATACTGC GATATTCCGA 84 0 
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TGCGTAATAA ATAGACTTAT CAACTGCTGT TGTAATACTT GTTATCGGCA CAATTGCTAA 1020 

ATTACGATCT ACCACGTCCA CATTAAGAAC ATCAATGTCA GAACGTAATT GTTCTGCGAT 10 80 

ATCTCGATAA TGkTTGkTAA TTTTCAAGAA GAAAATCACC ACGAACGGCA AGAAAATAAG 114 0 

TATCGGCCAT ACTTGGCTAA ATTTAGTTAT GAGTAAAATC cATAAnAACA ATAAATGTCA 1200 

CGATACCACC AAGTAAGTTC ACAGACAACT TACTTAACCA ATTCTTAGGA CGTTCATGAA 12 60 

TCCATTTAAT AACCATACCG AATTGTG 12 87 
(2) INFORMATION FOR SEQ ID NO: 580: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1223 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 580: 

GCCTCCTTTC CTATATCCAT GGGTGTCCTA GTTGGGAATG GCTTATTCCC TATCCCAAAC 6 0 

ATCTGTCAAT TCAAGACTAT CACTGTATAC TAAATCGCCT nTCTATACnT CCATnCATTT 12 0 

CATTTTCCAG TTGGAACAAT TTGTTTAATT ATTTTAAATA TCATTATCAA TCCAGAAGTA 18 0 

TTTACTATTC ACTTTTACAA TAATCAATCA TTTAACTACA CATGGGTAGT CGGTGGTTTG 24 0 

CTTGGGGTTA GCTTTTTAAC TGGCAATTTA TTGTTATTGC CAAAATTAGG TGCAACATTA 3 00 

ACTGTAATTG CAACAGTTGC GGGTCAAATT ATTATGGGTG TCATTATTGA TACATTTGGA 36 0 

35 TTATTTGGCG CTACAATTCA TGATTTTAAT TTAATTAAAG CAATTGGAGT ATTGTTACTC 42 0 

ATTGTCGGCA TCGTCATAAT GAATCAATTT AACAAGAATA ATTTATTACT AACTGATCAA 4 80 

AAGTATTTAC TGTTTTGGCT TCTATTAGGA TTTATTTTTG GTTTCTTTCC ACCTATTCAA 54 0 

ACGACAATTA ATAGTGCTTT AGCTAGTCAT ACTCATTCAC CAGCCTTTGC ATCATTAGTA 600 

TCATTTACAA TTGGGTCAAT AGCGCTATTG ATTTTAACCG CTATTTTTAA TCGTTCTTTA 66 0 

AAACTAAAAA CAAGTCATTT AAAATTCGGT AAATTAAAGC CTATCTATTT TACTGGCGGT 720 

ATACTTGGCA TGGCTTTTGT AACAGCTAAC ATTATCTTAA TGCCTCATAT GGGTGCAGCA 780 

TTAACAACAC TTATTGGGAT GTTTGGCCAG ATTCTAATGG GCATATTGAT AGATCACTTT 84 0 

GGATTATTTG GTTCACCTAA AATAGCAATG ACATCCAGAA AAACTATTGG TCTATTATGT 90 0 

ATTTTGACAG GCATTATACT TTTAAGATTA TTTTAAATTA ACTTTTAGCT TATCATTTTA 960 

ACTTGTAATT ATTTTTAAAA GTGATAAGCT ATTTTTTTGT GGTCTAAAAA TCTTTAGAAA 102 0 
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CAACTCATTC TTAAGACCTA AATTAATGTT ATnTTTTAAT AATTTACACC AAATTAATAG 1140 

CAAAAATTAT GTTATTCGTG CTAATATTTC ATAGTTGGTT ATTCAATTAA TTAAAAATAA 1200 

^ GTCAAAATGC ACAACTTTTT ATn 1223 

(2) INFORMATIOK FOR SEQ ID NO: 581: 

(i) SEQUENCE CHARACTERISTICS: 
w (A) LENGTH: 454 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5B1: 
CACCTTTTTA TCATGCTTAG TTATCAATAA ATCTATATTT TGCTGTTTTA CAATTTTTTT 60 
20 AACTTTATCA ATCTCATTAT CTTGGACTAA ATAAATATAT GATCTTGCAT CTGTTGCTAG 12 0 

AC^CTTGTTCG TGTTTTTCTG ATAAAACATA TGTGAT-GGAA GCGT-GAATAA TAATGCCTAA 180 
TGTAACAAAA CTGATAATTA ATATACTGCT TATCAATAAC ATTAAGCGGT GGTGAAACTT 24 0 

CATCATTGTT CTTTAGGTCT TTCCAATTTA TAGCCTAAGC CACGCACAGT TTTAATAAGT 3 CO 

TGTGGCTTCT TAGGATTATC TTCTAATTTA TCTCTTAAAT GACTGATATG TACATCAACA 3 60 

ATTCTTGAGT CTCCTGCAAA TTCATAATTC CATACCGTAT TTAACATATG CTCTCTCGTA 420 
ATGACTCTGC CTTGTCTTTC TATCAAATAA AGCA 4 54 

(2) INFORMATION FOR SEQ ID NO: 582: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 452 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 582: 

TTTATAAGAT TTTATTTACA AATAATTGGT TTTCATATGT ATAAACACTT TTGACTTTCA 60 

AATCTTAACG ATAATTCTAT TACAATACAA TCCCCTATTA GAATGATTTA TGTAATAAAA 12 0 

AAAGCGGAGT TTCCCCCAGC TTTTCTAAAC GACTACATAA AATATAAGAT TGCAATTAAA 180 

TGCAATAGTG ATGCTATTAC AATAAAAAIA TGCCAAATCA TATGAAAATA TGGTCTATTC 24 0 

TTTTGTGCAT AAAACCATGC ACCAATTGTA TAAGACACAC CACCTAAGAA AATGAATAAT 3 00 

ATnAATATrr ATnATCTacr. AATAAAAATA ATTGGTAACA AGATAATACC TACCCAGCCC 3 60 
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AAAATCCCCC AAAGTGTCGT TCCCCATAAT AA 4 52 
(2) INFORMATION FOR SEQ ID NO: 583: 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1472 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 583: 

CGCTTTTTGC AATAAGTCAT TAGCCGCTTT TAAGCCTTCT TCTTTTCGAT CTACAACAAG 60 

TAPiPATPAAT GGCTTTAACG CTTCTTCTTT TTCACTTTCA AGCATATCTG GTTTTTGAAC 12 0 

CATTTCAAAT GGAGATTTCA ATCCATTATT ATCGCTCATT TCAATAATTG CATCATACTG 180 

20 TGCTTGTGAC ATACTTGCAA TAGCCTGTTT TGCATTTTCT TGAAGGAAAT ATAAGTTTTT 24 0 

CAATTTAGGA TGCTTATTTA ATGTACTTAA TGTAATCGGT GTAATGTCTT TCTCATAAGA 300 

CACTTCAATC ACTGTACTAT TTGTTCTACC AGGAATTGGT GGTTTTTCAT GAATATGCTT 360 

TGATACTTCT CCAATTCCAA CGACAGATTG ATTTTTCGTT CGATTATAAA AAATAATATT 42 0 

GTCGCCTTCT TCTAACTGAG TATAAAAATG ATAACCATTA CGTTTAATAC CGTTGTACGT 4 80 

GTGCGTATAA ATCGTATATT GGTTTCCAGG TTCAAATTCT TCAGTTTCAG CTAAAAAGAA 54 0 

30 

ATAACGCGGT ATCTTAATTT CGCCTTTACC AAGACCACTT ATTAAATCAA ACTCTTCTGC 600 

AGTGATTTGA TTGAACAATG TCTCTTTCAT ATnACTTATA CGAAATTCCA AAGCTTCACT 660 

ACGCTTTAAA TAATCTGCTG TTAATGGTTT CAATTGTTCA TTAAAACGAA ACTGTACACG 720 

35 

TATTTTATTT TGTGCACCTG TTTCAACACT AATAATTTCA CCACATCCAA GTAGTCCAGT 780 

ATCCGTCTGA ACTTGATAAA AGATGACTTG ATCTCCTACT TTAGCCTTTT TAAACGCTCT 84 0 

40 AAATCCTTGA GATGGGTTAA AATGTGCGCC TGATTCAAAT AAAGCTGTTT GTCCTACTAA 900 

CGGTTCATTA TGATTCCAAC GGTTATATCC ACAATTCAAC CAAAAATAAT TCGTTTCTGC 96 0 

TGTCATCTTA ATACTCCTTA ACCTGAATAA ATTTTAGAAA CACTATGAAT TACATTCTTT 102 0 

tAGTGTTTCT TATGCAGTTG GACGCGTATG CGAACAACTG TATACCCTTT GTTCACTGCG 108 0 

ATTTTAATCG CATTTCCTAT AACATTGTAG CGCCCAGGAC ATTAATTTAC GTCCCAGACC 114 0 

CTTATCGTTT TCACTTCTAA GTAAGTCGAA CTATTITGCT TTACAACAAG TGCGACTCTA l2 00 

50 

AATACAGTTG GACACACATA CGAGCAACTG TATACCTTTT AATCAGTTTT CTATATTTTA 126 0 

TTTATTATAT CTGTCTTAAT GATAAAAATT GTTACAAACA GTTTAACATA TTTAGCTACC 13 20 
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ATGATACCAC TATGCTTGCn TATCTCTATA GCGCCATTGA TACACATTTT TAAATATCTA 1440 

TACTGCCGTT AGAATTTTAT CATGTCTnAA TT 14 72 

(2) ItlFGRMATION FOR SEQ ID NO: 584: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 787 base pairs 
(B! TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 584: 

15 

GGTAGTGAAT GGGGTTCAAG ACAACAGTAT TGGAAGTACG AATGAATCAC AGTTTTTAGG 60 

AAATTATATT GTAATTAAGC ACGCAGAAAA TGAGTATAGC TTAATAGCTC ATTTACACCA 120 

20 ATATTCAATC ATTGTGAATG AGGGGCAAAA TGTTAAATAT GGTGATATCA TTGGGAAGGT 180 

TGGGAATTCT GGCAATTClA CTGaACCTCA TATACATTTT CAAGTAATGA ATGATAAGAA 24 0 

TATTGAAGCA TGTACATCTT TAAAAATTCG ATTTATAAAT AATCGAGAAC TTATCAAAGG 3 00 

GGATGTGGTC TGCGGATTAC AAGCTGAATG ATGGCGATAC TTATAAAATC TCGACACTAT 360 

AAAAATGGTA TAGTGTCGAG ATTTTCTTGC TTATTTAGTT AATTCAAAGT GCACGCCGGA 420 

TTCATTAGAA GTCGACGTAT TTTTGTTTGT AATAGAGTAA CCGGTCATTG AAATTTTAGA 48 0 

30 

TTCAATATCT GAAGCGGAAT TTGTAGATTC AGGATTATAG AAGCTACATT CATAAGTGTT 54 0 

ATCATCTTTC TTTTTAAGTA TAAACATACC TTTGGCTTTA ACTTCGACTT TAGTGTTGTT 5 00 

AATGTCAAAA GTTTGAGTGC TATTACTATA ATTAACACCA GCCCAAACCG ATTCATTATC 66 0 

35 

TTTCACAACG GGGAAGTCAT CTTCTTGCTT AACGACGTnA CTTCATCTTT CTCTGTCTTA 72 0 

AAGACATCTT TAGATAAGCC TGGATACAAC ACATATCCAT ATTTATTGTC AGAATTAGAA 78 0 

40 TGCTTTT "^^"^ 
(2) INFORMATION FOR SEQ ID NO: 585; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 585: 

Ar;annr:A.ATA TTTArr.ATAA AGAGACCAGA AAAATAATAA AACACATGCT 
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TAAAATGAAG ACTATTTTTT ATTACAAGAA AATGTATCTA GTAAACTTAA AGTAGCAAGA 180 

CCTAATAAAT TTAATGCATG TTGTGCACCT TTTTTACCTT GGCCAGCTTC GAAATGTTTG 240 

TAAGCAGCTA CACTTAAAAT GCCTATCGTT GATAGTGATG CAAGGCGAGA AATGTTTTTA 3 00 

TTGATAAAGC TAGCTGAGTA TAAAGCAGCA GTAGTTGCTT CTGCAATGCC GACGTATTTT 360 

ACAAGTTCTT TTTGCAAGCC AAAAGTATGT TCAAACAGTT CAATCATACC CTTATCTTCT 420 

TGCAATTTAG GTTTACTGGC TTGGTATAGC TCTTTCGCAA GTTTTAAATT CGTTGCGTAA 4 80 

CGCAAAATCA TATTTAATTC CTCCCAATAT TTGATTTTTT GTGAAAGATG ATTACTTTAT 54 0 

CATTTTTACC CGTTTCTATA AAAATGAATC AATTATGTAA CGTATGTGTA GTTTAGGAAT 600 

GTTTGCTATG GAAATATAAT TCTGTTCACT CAAAATGTAT GAAATTAATG TGTAGTTTTG 660 

TCGAGTTGCT CTTTTAATTT GGTTAGATTG TTTTTTAGAG AAGCGGTACT ATTTTTAAGT 72 0 

GCATCAACAG ATTTACCTTC GTTTTGAGAC ATTGAGTTTA TTACAGCACG AAGTTCTGTT 7 80 

TCTAGTATGT CaGCGTCGCt TTAGCATTAG AACTTAaTAt TTAtAcTCTT 83 0 
(2) INFORMATION FOR SEQ ID NO: 586; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 586: 

35 TTAGGACGTT TTTACAATCA GTACAATGAG CTCATTGTTA TTAGTCCTTT AACGGCGTCT 60 

TTTAATGCTG GCGCTACATT TGGGCGATTT CATCATTTAA TTGATACTGA AACTTTAGCA 120 

AAATTAGAAC ATGAAAAAGG ACATTATTAT CAGAAGATGA TATGTGATGA CAATGTAGAA 180 

ATGATTTCTA TAAATAACAT ACCGAAATAT CCGAGAAATC ATAATGTATT AACTAATCAT 240 

GACTCATACG AATATTCATT GAATTTAGGA AGTAGTAATA GTTATTCAAA GTATGAGCTT 3 00 

ACCTTAGATG ATATTTATGT TGGTGCTACC TTTtAACAAA TTATATTTAT ATTCTAGCCm 3 60 

ACtAAATAAA AGGGkaCtaT TTGaATCaAA CmATaTGtAT TAACCTTTTT TA 412 
(2) INFORMATION FOR SEQ ID NO: 58 7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 709 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 587: 

TTCAGTTTGA AATTAATCAT ATAAATTTCT TATGGGAGGG TTGATATCTT AATGATTAAC 60 

ATTATTTCAG CTATAGGATC TATTGGAACA TTTATTATGG CTTTATTTTA TTTTGTATCA 120 

GTTTCAGTTC AACTTTATCA AATGAAAATT AGCTTTCTGC CAGCTTTAGG TTTTAACCAA 180 

ATTTTATTAG AAAGGGAGGA GGATCAACTT AATATAATGA ATTCGGCAAC AGAAGAGCAT 24 0 

CATCATAAAG ATTATATTAA ACTATATAAT TTAGGTGOCG GTGCTGCTAA rAAAATTGCA 3 00 

ATAGAGGTTT TATTGGGGAA GGATArAGTC ATTCAGAAAA AATACGTGCA TATTkTACCT 3 60 

AGTAAAGAAG GGTACATGTT ACCAATTAAT AAAAATGTGT ACGAAGAATT AGAAAGAACG 420 

ATTGAsAACA ATGGTCATGA AGCTGATTTG AATGTACGTA TGACTTATTA TCATAATGTA 4 80 

AGTCGCAAAC AACAGGAAGT TATATTAAAA GGTCAAATCG ACCGTTTTAA TACTTATAAT 54 0 

20 AATAAAGAAA TTTATGATTT GCAGTTTATC TAAAAATTGA TTTAAGAGGG TAGTTGTTTA 600 

TTGCGPJ^J^J^J^. TATCATTCAA TTTTAATGAA ATAATGGCGT CATTACTATA AAATATTACT 660 

TTATGTTGTA ATGCATTTTT CTATAAGATA GAACTAAAAG GAGGGGCAAA GATGCAAATT 720 

AGACAAATAC ATCAACATGA CTTTGCTCAA GTGGACCAGT TAATTAGAAC GGCATTTGAA 780 

AATAGTGAAC ATGGTTATGG TAATGAATCA GAGCTAGTAG ACCAAATTCG TCTAAGTGAT 840 

ACGTATGACA ATACCTTAGA ATTAGTAGCT GTTCTTCAAA ATGAAGTTGT AGGGCACGGT 900 

TTACTAAGTG AAGTTTATCT TGATAACGAG GCACAACGGG AAATTGGATT AGTGTTAGCA 960 

CCrGTATCTG TTGATATTCA TCATCAAAAT AAAGGTATTG GGAAGCGATT GATTCAAGCA 1020 

TTAGAACGAG AAGCAATATT AAAAGGATAT AATTTTATCA GTGTATTAGG ATGGCCGACG 1080 

TATTATGCCA ATCTAGGATA TCAACGCGCA AGTATGTACG ACATTTATCC ACCATATGAT 1140 

GGTATACCAG ACGAAGCGTT TTTAATTAAA GAATTAAAAG TGAACAGTTT AGCGGGAAAA 1200 

40 ACAGGTACCA TAAATTACAC ATCTGCTTTT GAAAAAATAT GATTTCAAGC TAGGATTACA 1260 

TTAGGTAGAG TTCATATTAA TAATAAAAAA TGTTTGCAAT CAAATCGTAC GTTGTCGTTT 1320 

GTAATTCTTA AAATAGCAAT AAATAAAATG TTTGTTAGTA AAGTATTATT GTGGATAATA 13 8 0 

AAATATCGAT ACAAATTAAT TGCTATAATG CAaTTTTAGT GTATAATTCC ATTGACAGAG 144 0 

ATTAAATATA TCTTTAAAGG GTATATAGTT AATATAAAAT GACTTTTTAA AAAGAGGGAA 1500 

TAAAATGAAT ATGAAGAAAA AAGAAAAACA CGCAATTCGG AAAAAATCGA TTGGCGTGGC 1560 

TTCAGTGCTT GTAGGTACGT TAATCGGTTT TGGACTACTC AGCAGTAAAG AAGCAGATGC 162 0 

AAGTGAAAAT AGTGTTACGC AATCTGATAG CGCAAGTAAC GAAAGCAAAA GTAATGATTC 16 80 
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GTCAAACACT AATAATGGCG AAACGAGTGT GGCGCAAAAT CCAGCACAAC AGGAAACGAC 18 00 

ACAATCATCA TCAACAAATG CAACTACGGA AGAAACGCCG GTAACTGGTG AAGCTACTAC 186 0 

5 

TACGACAACG AATCAAGCTA ATACACCGGC AACAACTCAA TCAAGCAATA CAAATGCGGA 192 0 

GGAATTAGTG AATCAAACAA GTAATGAAAC GACTTCTAAT GATACTAATA CAGTATCATC 1980 

TGTAAATTCA CCTCAAAATT CTACAAATGC GGAAAATGTT TCAACAACGC AAGATACTTC 204 0 

10 

AACTGAAGCA ACACCTTCAA ACAATGAATC AGCTCCACAG AGTACAGATG CAAGTAATAA 2100 

AGATGTAGTT AATCAAGCGG TTAATACAAG TGCGCCTAGA ATGAGAGCAT TTAGTTTAGC 216 0 

15 GGCAGTAGCT GCAGATGCAC CGGTAGCTGG CACAGATATT ACGAATCAGT TGACGAATGT 222 0 

GACAGTTGGT ATTGACTCTG GTACGACTGT GTATCCGCAC CAAGCAGGTT ATGTCAAACT 228 0 

GAATTATGGT TTTTCAGTGC CTAATTCTGC TGTTAAAGGT GACACATTCA AAATAACTGT 234 0 

ACCTAAAGAA TTAAACTTAA ATGGTGTAAC TTCAACTGCT AAAGTGCCAC CAATTATGGC 24 00 

TGGAGATCAA GTATTGGCAA ATGGTGTAAT CGATAGTGAT GGTAATGTTA TTTATACATT 24 6 0 

TACAGACTAT GTAAATACTA AAGATGATGT AAAAGCAACT TTGACCATGC CCGCTTATAT 252 0 

25 

TGACCCTGAA AATGTTAAAA AGACAGGTAA TGTGACATTG GCTACTGGCA TAGGTAGTAC 258 0 

AACAGCAAAC AAAACAGTAT TAGTAGATTA TGAAAAATAT GGTAAGTTTT ATAACTTATC 264 0 

TATTAAAGGT ACAATTGACC AAATCGATAA AACAAATAAT ACGTATCGTC AGACAATTTA 270 0 

30 

TGTCAATCCA AGTGGAGATA ACGTTATTGC GCCGGTTTTA ACAGGTAATT TAAAACCAAA 276 0 

TACGGATAGT AATGCATTAA TAGATCAGCA AAATACAAGT ATTAAAGTAT ATAAAGTAGA 282 0 

35 TAATGCAGCT GATTTATCTG AAAGTTACTT TGTGAATCCA GAAAACTTTG AGGATGTCAC 2 88 0 

TAATAGTGTG AATATTACAT TCCCAAATCC AAATCAATAT AAAGTAGAGT TTAATACGCC 2 94 0 

TGATGATCAA ATTACAACAC CGTATATAGT AGTTGTTAAT GGTCATATTG ATCCGAATAG 3 000 

CAAAGGTGAT TTAGCTTTAC GTTCAACTTT ATATGGGTAT AACTCGAATA TAATTTGGCG 3 06 0 

CTCTATGTCA TGGGACAACG AAGTAGCATT TAATAACGGA TCAGGTTCTG GTGACGGTAT 3120 

CGATAAACCA GTTGTTCCTG AACAACCTGA TGAGCCTGGT GAAATTGAAC CAATTCCAGA 3180 

45 

GGATTCAGAT TCTGACCCAG GTTCAGATTC TGGCAGCGAT TCTAATTCAG ATAGCGGTTC 3 24 0 

AGATTCGGGT AGTGATTCTA CATCAGATAG TGGTTCAGAT TCAGCGAGTG ATTCAGATTC 3300 

5^ AGCAAGTGAT TCAGACTCAG CGAGTGATTC AGATTCAGCA AGCGATTCCG ACTCAGCGAG 3 3 60 

CGATTCCGAC TCAGACAATG ACTCGGATTC AGATAGCGAT TCTGACTCAG ACAGTGACTC 342 0 

AGATTCCGAC AGTGACTCAG ATTCAGATAG CGATTCTGAC TCAGACAGTG ACTCGGATTC 348 0 
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CGATTCTGAC TCCX5ACAGTG ATTCCGACTC AGACAGCGAT TCAGATTCCG ACAGTGATTC 3 600 

CGACTCAGAT AGCGATTCCG ACTCAGATAG CGACTCAGAT TCAGACAGCG ATTCAGATTC 3 660 

AGACAGCGAT TCAGATTCAG ATAGCGATTC AGATTCCGAC AGTGACTCAG ATTCCGACAG 372 0 

TGACTCGGAT TCAGATAGCG ATTCAGATTC CGACAGTGAC TCAGATTCCG ACAGTGACTC 3 78 0 

AGACTCAGAC AGTGATTCGG ATTCAGCGAG TGATTCGGAT TCAGATAGTG ATTCCGACTC 3 84 0 

CGACAGTGAC TCGGATTCAG ATAGCGACTC AGACTCGGAT AGCGACTCGG ATTCAGATAG 3 900 

CGATTCGGAC TCAGATAGCG ATTCAGAATC AGACAGCGAT TCAGATTCAG ACAGCGACTC 3 96 0 

AGACAGTGAC TCAGATTCAG ATAGTGACTC GGATTCAGCG AGTGATTCAG ACTCAGGTAG 4 020 

TGACTCCGAT TCATCAAGTG ATTCCGACTC AGAAAGTGAT TCAAATAGCG ATTCCGACTC 4 0 80 

AGTTTCTAAC AATAATGTAG TTCCGCCTAA TTCACCTAAA AATGGTACTA ATGCTTCTAA 414 0 

TAAAAATGAG GCTAAAGATA GTAAAGAACC ATTACCAOAT ACAGGTTCTG AAGATGAAGC 4 2 00 

AAATACGTCA CTAATTTGGG GATTATTAGC ATCAATARGT TCATTACTAC TTTTCAOAAG 4260 

AAAAAAAGAA AATAAAGATA AGAAATAAGT AATAATGATA TTAAATTAAT CATATGATTC 4320 

ATGAAgnAAc rCCTTAAAAG GTGGCTTTTT TACTTGGATT TTCCAAATAT ATTGTTTGAA 4 3 80 

TATAATTAAT AATTAATTCA TCAACAGTTA ATTATTTTAA AAAGGTAGAT GTTATATAAT 4440 

TTGGCTTGGC GAAAAAATAG GGTGTAAGGT AGGTTGTTAA TTAGGGAAAA TTAAGGAGAA 4 500 

AATACAGTTG AAAAATAAAT TGCTAGTTTT ATCATTGGGA GCATTATGTG TATCACAAAT 4 560 

TTGGGAAAGT AATCGTGCGA GTGCAGTGGT TTCTGGGGAG AAGAATCCAT ATGTATCTAG 4620 

TCGTTGAAAC TGACTAATAA TAAAAATAAA TCTAGAACAG TAGAAGAGTA TAAGAAAAGA 4 680 

TTGGATGATT TCAATATGGT CCnTTCCCA 470 9 
(2) INFORMATION FOR SEQ ID NO: 588: 
(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 1554 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO r 588: 

CTTTTTTAAt TAwCgGaAtA TTGtCaTgaT tAcAcTTCGt TAGGGTTTAc gTCGtAATTT 60 

CATTTaaTAA gCgCTTCAcC ATTAAAtGTG gTAmCCTTTA ATTCGCCAGT AGAAACATCA 12 0 

CAGTAACTAA GCGCAATTTC AGGTTGATTC ATAACAAAAC TTAAAATATA GTTATTTTGT 180 
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CGTCTAACCA TACCTTTCGT TTGTTTCGGA TCTTCCATCT GTTCACAAAT AGCTACTTTA 300 

TATCCATTAT TAACAAGTGT ATCTATATAA CTATCTGCAG AATGATACGG AACACCACAC 360 

^ ATCGGAATTG GATTTTCTTT TTTAGCATCT CTITTAGTTA AAGTAATTTC AAGTAyACGT 4 20 

GATGCCTCCT TGGCATCTTC ATAAAACATT TCATAGAAAT CACCTAGTCT AAAAAATAAT 4 80 

AAGCAATCTT GGTATTCTGA TTTTATTTTT AAATATTGCT GCATCATTGG TGTAACATTA 54 0 

10 

GACATATTAT TTCTTCACAA CCCTTGTCTC TTTTTAAAAT TTGTCTTTAC AATATATTCG 6 00 

TTTGTAAGyT TTTTAATTAT TAATTATTTA ACTTATACAT TTTAACATAC TTACTTTTAC 660 

AAACCTATTC ATACCATATA ATCACGAAGC ATCTTAAATG TATAAGAAAA CGCCTCAAAC 7 20 

CTAATAAAAT GTGTCAATAG CATGTTTAGA ATTAAATTAA AATTCTAACA TTCAAGACAT 7 80 

TTAATTAAGT AAGGGCGTTC AATATTAAAA TGAACAATGA CTCTGTTTGA AATCATATAT 84 0 

20 CATAAAATTA TTTTATAAAC CTTTGAAGAA TACCACGTTT TTTTAGAGTA ATTAATAAGA 90 0 

AATAACTTAT AATAGATCCG ATAGCACTTG aGACTATGaA CGTAATCATT AACGGTTTAA 960 

TGAAGAAGTC TTGAAGCCCA AGGaAATATG CTAATGGtAT aCaAATTAAA cTTCCgATGA 1020 

CaCCAGTTCC aAGTACTTCa CCGACCGCGG CCaTAAATAT ATGTTTACGA TATnygTAAA 1080 

ACATACTAGC CAATAAAACT CCAATCATAC TACCCGGAAA TGCAAAAGst GTACCAGTAC 114 0 

CAAAAAGAAC TCTTAAAATT GATGATATAA GCGCTTGAGC TAATCCATAC CAAGGACCTA 1200 

30 

CTATGACCGC ACTTAATACA TTTACAAAAT GCTGTACTGG TGCTGCCTTA ACTGGTCCTA 1260 

GAGGAATGAT GATAATACTG CTTAATACAA CATTTATTGC AATTAAAAGT GCAGTTATAG 1320 

CCAGTTTTCT TGATTTCATA TGATTGTTCT CCTTTTTGTT TGTAATTAAT CACTATGCTT 13 8 0 

35 

GGCTTTATTA TGGTCATTTA AACGTGTTTC CATTGTTGAT ACAAACATTT TCAATAATTG 144 0 

ATTCGCTTCA TATTGTGAAG TTTGAAACTG TTCAACTATG GGCAATGTAT TTATTTCTGC 150 0 

40 TTCTAtACTC TGAATGGtAT GTTCCGACTG nTCCAGCGCA TTTGTTTCCC GnAA 1554 
(2) INFORMATION FOR SEQ ID NO: 58 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 638 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

50 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 589: 

TTTTGACGAA AAAAGTTGAT TTACAAATAT ATAAACGTTG TGATTTCAAT GTTTGTATAG 6 0 
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TATATACATG ACAGCAACTT GGGAAAAAAA GGAAGGTAAC GAAGGTTTAT TAACTGTTAC 180 

TGTTCCTGCA GAAAAAGTAA ACAAAGCrCT GAGAGATCCC CTCATAATTT CCCCAAAGCG 24 0 

TAACCATGTG TGAATAAATT TTGAGCTAOT AGGGTTGCAG CCACGAGTAA GTCTTCCCTT 3 00 

GTTATTGTGT AGCCAGAATG CCGCAAAACT TCCATGCCTA AGCGAACTGT TGAGAGTACG 36 0 

TTTCGATTTC TGACTGTGTT AGCCTGGAAG TGCTTGTCCC AACCTTGTTT CTGAGCATGA 42 0 

ACGsCCGCAA GCCAACATGT TAGTTGAAGC ATCAGGGCGA TTAGCAGCAT GmTATCAAAA 4 80 

CGCTCTGAGC TGCTCGTTCG GCTATGGCGT AGGCcTAGTC CGTAGgCAGG ACTTTTCAAG 54 0 

TCTCGGAAGG yTTCTTCAAT CTGCATTCGC TTCGAATAGA TATTAACAAG TTGTTTGGGT 6 00 

GTTcGAATTk CAACArGTaA GTtAGtTGCT AGAnCCCA 63 8 
(2) INFORMATION FOR SEQ ID NO: 590: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1242 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 590: 

AAAATATTCC CGTACATTTT GATGTCTGTA GGGGCTTTTT TGACTTTAGG ATTTGTCATT 6 0 

TTTTCAATTC ATAAAGGGAG ACGAACGAAA AATGAATCAG CACGTAAAAG TAACATTTGA 12 0 

TTTTACTAAT TATAATTACG GCACATATGA CTTAGCAGTA CCAGCATATT TACCGATAAA 180 

AAACTTAATA GCTTTAGTAT TGGATAGTTT GGACATTTCA ATATTTGATG TCAATACACA 24 0 

AATTAAAGTG ATGACGAAAG GTCAATTACT TGTTGAAAAT GATCGACTCA TTGATTATCA 3 00 

AATCGCTGAT GGAGATATTT TGAAGTTACT ATAGGAGGGA AAATAGATGG TTAAAAATCA 36 0 

TAACCCTAAA AATGAAATGC aAGATATGTT AACGCCTTTA GATGCTGAAG AAGCAGCTAA 420 

AACAAAATTA CGCTTAGATA TGAGAGAGAT TCCTAAGTCT TCAATTAAAC CAGAACATTT 4 80 

TCATTTAATG TACTTATTAG AACAACATTC TCCATATTTT ATAGATGCTG AATTAACTGA 54 0 

ACTACGTGAC aGTTTCCaAA TACATtATGA CATTAATGAc AATCATACAC CTTTTGATAA 600 

TATTAAATCA TTTACTAAAA ATGAAAAATT ACGTTACTTA CTCAATATCA AAAATTTAGA 66 0 

AGAAgTAAAT CGTACACGCT ACACATTTGT GTTGGCACCA GATGAATTAT TTTTCACAAG 72 0 

AGATGGATTA CCCATTGCTA AAACAAGAGG GTTACAAAAT GTTGTTGATC CATTACCTGT 78 0 

GTCAGAAGCT GAATTTTTAA CAAGATATAA AGCGCTGGTT ATCTGTGCAT TCAATGAGAA 84 0 
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AACTAAAGTT ATTGAAGCGG CAACGTTAGA TTTACTAACG GCATTTTTAG ATGAACAGTA 96 0 

TCAGAAACAA GAACAAGATT ATAGTCAAAA TTATGCATAT GTACGCAAAG TAGGACATAC 102 0 

5 

CGTTTTCAAA TGGGTTGCTA TCGGTATGAC AACGTTAAGT GTTTTATTAA TTGCATTCTT 10 80 

AGCCTTTTTA TATTTTTCAG TAATGAAGCA TAATGAGCGC ATTGAAAAAG GATACCAAGC 114 0 

ATTTGTAAAG GATGtTATAC GCAAGTACTA AATACGTATG ATGATTTAGA TGGTaAAAAt 120 0 

TO 

TgaTAAAGAG GCACTTTACA TTTATGCCAA AAGTTATATC CA 1242 
(2) INFORMATION FOR SEQ ID NO: 591: 

^5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 744 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 591: 



35 



40 



TTCCAGATAG 


AGCCTTAGTT 


GCTGCCGCTG 


AATTGTCTGA 


TCGTTACATC 


ACTGATCGTT 


60 


TTTTTACCAG 


ATAAAGCGAT 


TGATTTAGTT 


GACCAAGCAT 


GTGCAACAAT 


TCGTACGGAA 


120 


ATGGGATCAA 


ATCCAACTGA 


ATTGGATCAA 


GTTAATAGAC 


GTGTCATGCA 


ATTAGAAATT 


180 


GAAGAAAGCG 


CACTTAAAAA 


TGAATCTGAC 


AATGCGAGCA 


AACAGAGATT 


ACAAGAACTA 


240 


CAAGAAGAGC 


TTGCCAATGA 


AAAAGAGAAA 


CAAGCAGCAC 


TTCAATCTCG 


TGTAGAATCA 


300 


GAAAAAGAAA 


AAATAGCAAA 


TTTACAAGAA 


AAACGTGCGC 


AACTAGATGA 


AAGTAGACAA 


360 


GCGTTGGAAG 


ATGCACAAAC 


AAATAACAAT 


TTAGAAAAAG 


CTGCTGAACT 


ACAATATGGA 


420 


ACAATTCCTC 


AATTGGAAAA 


AGAACTTAGA 


GAATTAGAGG 


ATAATTTCCA 


AGATGAGCAA 


4B0 


GGTGAAGATA 


CAGATCGAAT 


GATTCGTGAA 


GTTGTAACAG 


ACGAAGAAAT 


TGGCGATATT 


540 


GTCAGCCAAT 


GGACAGGCAT 


ACCAGTTTCA 


AAATTAGTTG 


AAACAGAACG 


TGAAAAATTA 


600 


CTTCACTTAA 


GTGACATCTT 


GCATAAACGT 


GTTGTAGGTC 


AAGATAAAGC 


GGTTGACCTG 


660 


GTTTCAGATG 


CAGTAGTTAG 


AGCAAGAGCA 


GGTATTAAAG 


TnCAAACAGA 


CCTATTGGTA 


720 


GTTTCTnATT 


CCTAGGTCCn 


ACTG 
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(2) INFORMATION FOR SEQ ID NO: 592: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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